[Clas_offline] gsim_bat does not run in batch farm

Zhiwen Zhao zwzhao at jlab.org
Mon Oct 1 14:19:08 EDT 2018


Hi, Jixie

I think the debugging and interpreting the result on new farm using old 
software is a never ending process.

Nick made a clas6 software docker image which could be the ultimate way 
of running old software and have the same result as the old farm did
https://github.com/tylern4/docker-clas6/blob/master/README.md
There are some work needed if the collaboration wants to make this the 
official way.

Zhiwen

On 9/30/2018 5:24 AM, Jixie Zhang wrote:
> Hi CLAS members,
>    gsim_bat  give me some trouble that I could not understand. I am 
> wondering
> if any of you have similar experience.  The clas_package version I am 
> using is
> a pretty old version: release-4-14, but I made necessary modification to 
> have it
> compiled and run in RHEL7.
> 
>    gsim_bat does not run in batch farm since August 2018, before that 
> the same executable
> ran smoothly.
> gsim_bat encountered with "segmentation fault" when run in batch farm.
> It runs normally in interactive farm (ifarm1401 and ifarm1402) but only
> in some specified work disk. I recompiled the whole CLAS PACKAGE but this
> problem stays.
> 
> After careful investigation, I found that gsim_bat can not be run in the 
> following path:
> 
> /work/halla/solid/*
> /work/hallc/sane/*
> /scratch/jixie/*
> /cache/halla/solid/*
> /home/jixie/*
> (I only tested the above disk|drive. the list could be even longer...)
> The error message is:
> 
> --------------error message start---------------------
> Program received signal SIGSEGV: Segmentation fault - invalid memory reference.
> 
> Backtrace for this error:
> #0  0x7FB9ADAC3467
> #1  0x7FB9ADAC3AAE
> #2  0x7FB9ACFCA66F
> #3  0x7FB9AD02A13F
> #4  0x625AD5 in ctimef_
> #5  0x41AA65 in gsimpar_2_bos_
> #6  0x4076CF in uginit_
> #7  0x4058D3 in MAIN__ at gsim_bat.F:?
> Segmentation fault
> ifarm1401.jlab.org <http://ifarm1401.jlab.org>>
> 
> --------------error message end---------------------
> 
> The error message told that the problem happen at calling ctimef() 
> subroutine, which comes from cernlib.  I do not trust this error message 
> too much.
> 
> However, the same executable can run in
> /work/clas/claseg4/jixie/*
> /work/halla/g2p/disk1/jixie/*
> 
> Surprisingly, gsim_bat_debug works well in everywhere. gsim_bat_debug 
> was compiled with -DEBUG defined, which means the source code is totally 
> different.
> 
> See more details of my test here: 
> https://userweb.jlab.org/~jixie/gsim_bat_problem.txt 
> <https://userweb.jlab.org/%7Ejixie/gsim_bat_problem.txt>
> 
> 
> -- 
> With Best Regards,
> Jixie Zhang
> 757-269-7735
> _______________________________________________
> 
> 
> _______________________________________________
> Clas_offline mailing list
> Clas_offline at jlab.org
> https://mailman.jlab.org/mailman/listinfo/clas_offline
> 


More information about the Clas_offline mailing list