[g13] tarring skim files

Thu Jul 22 16:35:57 EDT 2010

Can you tell me what the data expansion factor is? for each input byte, how many bytes of output are there?
Dennis

On Jul 22, 2010, at 3:57 PM, Paul Mattione wrote:

> Each cooking job cooks a single raw file, and the resulting cooked  
> file is then skimmed.  This produces a lot of output files for the  
> single job (skims, dsts, ntuples, anahists, gflux, etc).  It's easier  
> to just to group all of these files together since they're all  
> produced by the same job, than to try to combine things across 50+ jobs.
> 
>  - Paul
> 
> On Jul 22, 2010, at 3:48 PM, slava at jlab.org wrote:
> 
> You lost me at "From the standpoint...". Why is it easiest?
> 
> Slava
> 
>> From the standpoint of cooking, I think the easiest way of writing
>> our skims, dsts, monitoring histograms, etc to tape is to combine them
>> all into a single tar file for each cooked file.
>> 
>> However, this means that in order to extract your skim you have to
>> cache much more (all) of the data from the tapes than if you just had
>> a single tar file for a given skim for a given run.  But if you
>> combine things by run, if anything needs to be re-cooked (e.g. a
>> cooking job fails), you have to re-make all of the tar files
>> associated with that run.
>> 
>> How do you guys think we should do it?
>> 
>> - Paul
>> 
>> 
>> _______________________________________________
>> g13 mailing list
>> g13 at jlab.org
>> https://mailman.jlab.org/mailman/listinfo/g13
>> 
> 
> 
> _______________________________________________
> g13 mailing list
> g13 at jlab.org
> https://mailman.jlab.org/mailman/listinfo/g13

--
Dennis Weygand
weygand at jlab.org
(757) 269-5926