[Halld-offline] zero field
Matthew Shepherd
mashephe at indiana.edu
Fri Jul 18 11:52:55 EDT 2014
Great -- thanks.
It was going to catch someone at some point. (I'm
assuming I'm not the only one that hits ctrl-c when I realize
that I've just executed a long program with
incorrect arguments.) The field download takes
a while and happens right at the time you realize you
made an execution mistake.
For such a large system with so many users and
possible permutations of use it is really important
to make every sanity check possible and abort
immediately when something doesn't seem right.
Matt
---------------------------------------------------------------------
Matthew Shepherd, Associate Professor
Department of Physics, Indiana University, Swain West 265
727 East Third Street, Bloomington, IN 47405
Office Phone: +1 812 856 5808
On Jul 18, 2014, at 10:23 AM, David Lawrence <davidl at jlab.org> wrote:
>
> Hi Matt,
>
> Sorry this has cost you so much time. You’re the first person to
> report this as an issue in the 6 months since it was deployed. I’ll
> go ahead and put in the code to generate and check the md5
> checksum whenever a program is started so that an error can
> be flagged if there is a mismatch.
>
> Regards,
> -David
>
> On Jul 18, 2014, at 10:15 AM, Matthew Shepherd <mashephe at indiana.edu> wrote:
>
>>
>> With bits and pieces from 3 different people, I think
>> I've figured this out... gggrrrrrr!
>>
>> Mike Staib suggested my log seemed to indicate my field doesn't have
>> enough z points.
>>
>> It seems like jana is using JANA_RESOURCE_DIR to cache fields. (I got
>> this from Paul.. I've never set this, but grep and reading jana source
>> tells me it is set by default to /tmp/username/resources.)
>>
>> If I go to that directory and delete it and rerun bfiled2root, I get
>> a full field.
>>
>> What happened? Here's my theory:
>>
>> I ran a job and that job chose to download the field. This
>> is the first time consuming thing that happens in a job. If,
>> during the field download you kill the job with ctrl-c then
>> you are left with a partial field.
>> (I tested this out and was able to repeat it.)
>>
>> I must have been unlucky and hit ctrl-c on the job that
>> tried to download the field. Note this is normal for must users.
>> It is easy to execute a command accidentally or realize just
>> after you press return that you didn't specify all the arguments
>> that you wanted.
>>
>> This is a really nasty behavior because every subsequent job
>> then just reads this partial field and never prints any message
>> or error.
>>
>> Can we put some sort of check in the field? Write the number
>> of points in the file first. And then on read back when there
>> isn't than many points abort with an error.
>>
>> nasty nasty nasty... that cost Paul and me a ton of time
>> this week
>>
>> Matt
>>
>> ---------------------------------------------------------------------
>> Matthew Shepherd, Associate Professor
>> Department of Physics, Indiana University, Swain West 265
>> 727 East Third Street, Bloomington, IN 47405
>>
>> Office Phone: +1 812 856 5808
>>
>> On Jul 18, 2014, at 7:18 AM, David Lawrence <davidl at jlab.org> wrote:
>>
>>>
>>> Hi Matt,
>>>
>>> The output looks right. What happens if you run the bfield2root utility? This should
>>> be built as part of the default sim-recon build. (Source is in $HALLD_HOME/src/programs/Utilities/bfield2root)
>>>
>>> Draw the field map in ROOT using:
>>>
>>>> bfield2root
>>>> root bfield.root
>>> root [1] Bz_vs_r_vs_z->Draw("colz")
>>>
>>> Also, how are you checking that the field map is returning zeros?
>>>
>>> Regards,
>>> -David
>>>
>>> On Jul 17, 2014, at 9:49 PM, Matthew Shepherd <mashephe at indiana.edu> wrote:
>>>
>>>>
>>>> Hi all,
>>>>
>>>> Paul and I have been trying to understand why I cannot
>>>> get the example analysis software working and we seem
>>>> to have traced the problem down to the fact that the
>>>> magnetic field map is returning a zero field.
>>>>
>>>> Does anyone know how to debug this? The startup
>>>> of the job suggests all is OK (I think):
>>>>
>>>> JANA >>URL: sqlite:////home/s4/mashephe/gluex/ccdb.sqlite
>>>> JANA >>context: default
>>>> JANA >>Reading Magnetic field map from Magnets/Solenoid/solenoid_1350_poisson_20
>>>> 130925 ...
>>>> Nx=251 Ny=1 Nz=43 ) at 0x7f46720a0ce0
>>>> Fine-mesh evio file does not exist.
>>>> Constructing the fine-mesh B-field map...
>>>> rmin: 0 rmax: 88.5 dr: 0.1 zmin: 0 zmax: 600 dz: 0.1vg.: 0.0Hz)
>>>> Number of points in z = 6000
>>>> Number of points in r = 885
>>>> JANA >>10599 entries found (Created Magnetic field map of type DMagneticFieldMap
>>>>
>>>> I'm using:
>>>>
>>>> sim-recon-2014-06-30
>>>> ccdb_1.02
>>>> hdds-2.1
>>>> jana_0.7.1p3
>>>>
>>>> and a ccdb.sqlite file from July 14 (copied from Mark's web page link that day). I've also
>>>> tried the ccdb.sqlite file from dc2 conditions.
>>>>
>>>> Matt
>>>>
>>>>
>>>> ---------------------------------------------------------------------
>>>> Matthew Shepherd, Associate Professor
>>>> Department of Physics, Indiana University, Swain West 265
>>>> 727 East Third Street, Bloomington, IN 47405
>>>>
>>>> Office Phone: +1 812 856 5808
>>>>
>>>>
>>>> _______________________________________________
>>>> Halld-offline mailing list
>>>> Halld-offline at jlab.org
>>>> https://mailman.jlab.org/mailman/listinfo/halld-offline
>>>
>>
>
More information about the Halld-offline
mailing list