[Halld-offline] Hall-D Computing plan and storage of simulated data.

Graham Heyes heyes at jlab.org
Fri Feb 11 17:40:31 EST 2011


As Sandy pointed out. I have a typo in my last mail. I meant 64-bit nodes of course. 

Graha

On Feb 11, 2011, at 3:37 PM, Graham Heyes <heyes at jlab.org> wrote:

> Curtis,
>    Chip asked for SpecInt numbers. This time around I am asking the questions and am asking for "time to process one event" normalized to out current generation 32-bit farm nodes. This should be much easier to measure/estimate. I have spreadsheets that can take those values and estimate performance in terms of standardized benchmarks and then estimate how many nodes we need.
> 
>    Regards to all,
>        Graham
> 
> On Feb 11, 2011, at 3:27 PM, Curtis A. Meyer wrote:
> 
>> Thanks David -
>> 
>>   I concur with Richard that the grid infrastructure is critical and 
>> needs to be placed
>> into the plan somewhere. Also, in going through the numbers in the plan, 
>> I did not
>> fully understand all the SpecInt numbers that were there. It would be 
>> useful to have a
>> meeting where we go through these numbers before submitting the final 
>> plan. I am
>> sure that they are right, but I am just "slow".
>> 
>>  Curtis
>> On 2/11/11 1:38 PM, David Lawrence wrote:
>>> Hi All,
>>> 
>>> Just a follow up to the discussion at the meeting on Wednesday. I'd
>>> mentioned that Mark had submitted the document I wrote up in 2008 when
>>> filling out Chip's original spreadsheet. The document titled "12GeV
>>> Computing Plan" is GlueX-doc-1431. As to the specific issue of storing
>>> simulated data, it has this in the second paragraph on page 2:
>>> 
>>> "Hall-D currently plans to follow a computing model that stores very
>>> little simulated data on tape. The expectation is that it would be
>>> nearly as fast or faster to reproduce the data as it would be to read it
>>> off of tape and reprocess it. The “DST” files produced by the simulation
>>> are therefore only expected to need 10% of the disk space used for the
>>> raw, simulated data that will exist only on the farm nodes."
>>> 
>>> This was not backed up by any hard numbers but I still suspect it is
>>> true. This refers only to what is needed regarding JLab resources and
>>> not what may be implemented outside of JLab (and JLab's budget). To my
>>> knowledge this represents what is our current Computing Plan for Hall-D.
>>> 
>>> Regards,
>>> -David
>>> _______________________________________________
>>> Halld-offline mailing list
>>> Halld-offline at jlab.org
>>> https://mailman.jlab.org/mailman/listinfo/halld-offline
>>> 
>> 
>> 
>> -- 
>> Prof. Curtis A. Meyer        Department of Physics
>> Phone:    (412) 268-2745        Carnegie Mellon University
>> Fax:    (412) 681-0648        Pittsburgh PA 15213-3890
>> cmeyer at ernest.phys.cmu.edu    http://www.curtismeyer.com/
>> 
>> 
>> 
>> _______________________________________________
>> Halld-offline mailing list
>> Halld-offline at jlab.org
>> https://mailman.jlab.org/mailman/listinfo/halld-offline
> 
> 
> _______________________________________________
> Halld-offline mailing list
> Halld-offline at jlab.org
> https://mailman.jlab.org/mailman/listinfo/halld-offline



More information about the Halld-offline mailing list