[Clas_offline] Fwd: unresolved issue with user_ana and Centos62 batch farm
Paul Mattione
pmatt at jlab.org
Thu Nov 20 13:25:22 EST 2014
I dunno if this fix ever got sent out to the listserv, and I lost the original email I sent, but here's one a few emails later.
- Paul
Begin forwarded message:
> From: Paul Mattione <pmatt at jlab.org>
> Date: September 9, 2013 5:05:52 PM EDT
> To: Dennis Weygand <weygand at jlab.org>
> Cc: Johann Goetz <theodore.goetz at gmail.com>
> Subject: Re: [Clas_offline] unresolved issue with user_ana and Centos62 batch farm
>
> Well, the line it crashes on is the "read" line below (recsis/retcl_yield.F):
>
> if (nbyt .gt. 0) then
> read(*,13, end=999)cline
>
> So it's whatever fortran "read" with "*" as the first argument means (I thought it was command line, but maybe it is stdin, I dunno).
>
> - Paul
>
> On Sep 9, 2013, at 5:02 PM, Dennis Weygand wrote:
>
>> Hello Paul,
>>
>> Did you mean 'command line'? or stdin?
>> Dennis
>>
>>
>> On Sep 9, 2013, at 4:54 PM, Paul Mattione wrote:
>>
>>> I have the same problem as well, and I think I may have found a kludge to avoid it (it doesn't "fix" it though).
>>>
>>> My understanding is that in this part of the code, recsis tries to read in tcl arguments from the command line. Since we don't want to do this in a batch job anyway, in recsis/retcl_yield.F, just before the crash at line 100, I changed the lines (in my personal build):
>>>
>>> IF (LINTER) THEN
>>> call asynck_c(nbyt)
>>> else
>>> nbyt = 1
>>> endif
>>>
>>> to:
>>>
>>> IF (LINTER) THEN
>>> call asynck_c(nbyt)
>>> else
>>> nbyt = 0
>>> endif
>>>
>>> This skips the check-the-command-line section of the code entirely, and appears to be working for me (although I have some other issues to iron out on my monte-carlo so I'm not completely sure yet).
>>>
>>> I realize this isn't an ideal fix, but maybe it'll get things working for other people. Let me know if it does.
>>>
>>> - Paul
>>>
>>> On Sep 9, 2013, at 9:14 AM, Silvia Niccolai wrote:
>>>
>>>> Dear Dennis and John,
>>>> we have managed to run the whole set of executables that are needed for
>>>> our MC jobs (event generator, GSIM, GPP, user_ana) on the **interactive**
>>>> Centos62 machine. Everything seems to be working fine and we are getting
>>>> consistent results with what was obtained running the full simulation
>>>> chain on Centos53.
>>>>
>>>> However, there are problems when moving onto the batch farm. My
>>>> student has met the same problem that was reported a week or so ago by
>>>> Kijun Park (and Kijun has confirmed us that his problem has not been
>>>> solved yet). The original email from Kijun is reported here. Did any of
>>>> you have a chance to look into this?
>>>>
>>>> It seems like user_ana crashes while the routine retcl_yield.F tries to
>>>> read the tcl file. Why does this crash happen only when running on the
>>>> batch farm and not on the interactive one? Is there any difference from
>>>> ifarm1102 and the batch ones? I have no idea...
>>>>
>>>> The line 100 of retcl_yield.F reads:
>>>>
>>>> read(*,13, end=999)cline
>>>> 13 format(a200)
>>>>
>>>> My FORTRAN knowledge is a bit too stale lately to start looking into
>>>> this in a useful way.
>>>> And it seems like Kijun and Nerses have already spent quite some time
>>>> trying to look into this without finding a solution. If anyone has any
>>>> idea, it would be great. It could be another step ahead towards
>>>> actually managing to use the Centos62 farm (at today, used only at
>>>> 40%, while the "old" farm is overloaded...).
>>>>
>>>> Thank you all and best regards,
>>>> Silvia
>>>>
>>>>
>>>>>
>>>>> On 08/30/2013 03:52 PM, KiJun Park wrote:
>>>>>> Hello All,
>>>>>>
>>>>>> Do you have any idea to resolve the following issue ?
>>>>>>
>>>>>> This is an error from "user_ana" running under a batch-farm.
>>>>>> Meantime, I didn't have any problem same code under
>>>> ifarm1102(interactively).
>>>>>> I have checked all envi. setup both cases but nothing wrong so far...
>>>>>> Thank you in advance for nay help.
>>>>>>
>>>>>> At line 100 of file reconstruction/recsis/retcl_yield.F (unit = 5, file
>>>> = 'stdin')
>>>>>> Fortran runtime error: Sequential READ or WRITE not allowed after EOF
>>>> marker, possibly use REWIND or BACKSPACE
>>>>>> cp: cannot stat `clas.A00': No such file or directory
>>>>>> cp: cannot stat `clas.A00': No such file or directory
>>>>>> cp: cannot stat `eventtag': No such file or directory
>>>>>> cp: cannot stat `eventtag': No such file or directory
>>>>>> tput: No value for $TERM and no -T specified
>>>>>> stty: standard input: Invalid argument
>>>>>>
>>>>>> Sincerely yours,
>>>>>> Kijun
>>>> _______________________________________________
>>>> Clas_offline mailing list
>>>> Clas_offline at jlab.org
>>>> https://mailman.jlab.org/mailman/listinfo/clas_offline
>>>
>>
>> --
>> Dennis Weygand
>> weygand at jlab.org
>> (757) 269-5926 (office)
>> (757) 870-4844 (cell)
>>
>>
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://mailman.jlab.org/pipermail/clas_offline/attachments/20141120/dc72e185/attachment.html
More information about the Clas_offline
mailing list