[Clas_offline] gsim_bat does not run in batch farm
Nick Tyler
tylern at jlab.org
Mon Oct 1 14:35:42 EDT 2018
Hello Jixie and Zhiwen,
I have a working container available on the farm which can be used with singularity. It would be interesting to see if the problem persists in the container environment.
You should be able to test it by loading singularity with:
module load singularity
And then running:
singularity shell /work/clas/clase1/tylern/clas6.img
This will give you a shell with all the clas software available in the path.
I agree though there is some work that needs to be done if this is going to be the solution. At the moment I’m using a precompiled version of cernlib and have not been able to compile cernlib in the container. If anyone has any knowledge on compiling cernlib that would be greatly appreciated.
Nick
> On Oct 1, 2018, at 2:19 PM, Zhiwen Zhao <zwzhao at jlab.org> wrote:
>
> Hi, Jixie
>
> I think the debugging and interpreting the result on new farm using old software is a never ending process.
>
> Nick made a clas6 software docker image which could be the ultimate way of running old software and have the same result as the old farm did
> https://github.com/tylern4/docker-clas6/blob/master/README.md <https://github.com/tylern4/docker-clas6/blob/master/README.md>
> There are some work needed if the collaboration wants to make this the official way.
>
> Zhiwen
>
> On 9/30/2018 5:24 AM, Jixie Zhang wrote:
>> Hi CLAS members,
>> gsim_bat give me some trouble that I could not understand. I am wondering
>> if any of you have similar experience. The clas_package version I am using is
>> a pretty old version: release-4-14, but I made necessary modification to have it
>> compiled and run in RHEL7.
>> gsim_bat does not run in batch farm since August 2018, before that the same executable
>> ran smoothly.
>> gsim_bat encountered with "segmentation fault" when run in batch farm.
>> It runs normally in interactive farm (ifarm1401 and ifarm1402) but only
>> in some specified work disk. I recompiled the whole CLAS PACKAGE but this
>> problem stays.
>> After careful investigation, I found that gsim_bat can not be run in the following path:
>> /work/halla/solid/*
>> /work/hallc/sane/*
>> /scratch/jixie/*
>> /cache/halla/solid/*
>> /home/jixie/*
>> (I only tested the above disk|drive. the list could be even longer...)
>> The error message is:
>> --------------error message start---------------------
>> Program received signal SIGSEGV: Segmentation fault - invalid memory reference.
>> Backtrace for this error:
>> #0 0x7FB9ADAC3467
>> #1 0x7FB9ADAC3AAE
>> #2 0x7FB9ACFCA66F
>> #3 0x7FB9AD02A13F
>> #4 0x625AD5 in ctimef_
>> #5 0x41AA65 in gsimpar_2_bos_
>> #6 0x4076CF in uginit_
>> #7 0x4058D3 in MAIN__ at gsim_bat.F:?
>> Segmentation fault
>> ifarm1401.jlab.org <http://ifarm1401.jlab.org/> <http://ifarm1401.jlab.org <http://ifarm1401.jlab.org/>>>
>> --------------error message end---------------------
>> The error message told that the problem happen at calling ctimef() subroutine, which comes from cernlib. I do not trust this error message too much.
>> However, the same executable can run in
>> /work/clas/claseg4/jixie/*
>> /work/halla/g2p/disk1/jixie/*
>> Surprisingly, gsim_bat_debug works well in everywhere. gsim_bat_debug was compiled with -DEBUG defined, which means the source code is totally different.
>> See more details of my test here: https://userweb.jlab.org/~jixie/gsim_bat_problem.txt <https://userweb.jlab.org/~jixie/gsim_bat_problem.txt> <https://userweb.jlab.org/%7Ejixie/gsim_bat_problem.txt <https://userweb.jlab.org/%7Ejixie/gsim_bat_problem.txt>>
>> --
>> With Best Regards,
>> Jixie Zhang
>> 757-269-7735 <tel:757-269-7735>
>> _______________________________________________
>> _______________________________________________
>> Clas_offline mailing list
>> Clas_offline at jlab.org <mailto:Clas_offline at jlab.org>
>> https://mailman.jlab.org/mailman/listinfo/clas_offline <https://mailman.jlab.org/mailman/listinfo/clas_offline>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/clas_offline/attachments/20181001/d736671b/attachment.html>
More information about the Clas_offline
mailing list