[Halld-offline] Offline Software Meeting Minutes, June 29, 2018

Mark Ito marki at jlab.org
Fri Jun 29 15:52:53 EDT 2018


Folks,

Please find the minutes below and at

https://halldweb.jlab.org/wiki/index.php/GlueX_Offline_Meeting,_June_29,_2018#Minutes

   -- Mark

_______________________


    GlueX Offline Meeting, June 29, 2018, Minutes

Present:

  * *CMU: * Naomi Jarvis
  * *FIU: * Mahmoud Kamel
  * *FSU: * Sean Dobbs
  * *JLab: * Alex Austregesilo, Mark Ito (chair), David Lawrence, Simon
    Taylor, Beni Zihlmann
  * *W&M: * Justin Stevens
  * *Yerevan: * Hrach Marukyan

There is a recording of this meeting <https://bluejeans.com/s/Gl5u7/> on 
the BlueJeans site. Use your JLab credentials to access it.


      Announcements

 1. Sim-Recon 2.26.0
    <https://mailman.jlab.org/pipermail/halld-offline/2018-June/003228.html>.
    A periodic release. See the release notes for changes from the last
    such release.
 2. GlueX Root Analysis 0.3
    <https://github.com/JeffersonLab/gluex_root_analysis/releases/tag/0.3>.
    There has not been a new release for about a year. This one was overdue.


      Review of minutes from the June 15 meeting

We went over the minutes 
<https://halldweb.jlab.org/wiki/index.php/GlueX_Offline_Meeting,_June_15,_2018#Minutes>. 


  * The Experimental Readiness Review for GlueX II was held last Monday.
    David noted that Eugene Chudakov has already circulated the
    preliminary report of the committee.
  * David mentioned that we may want to revisit our estimate of the
    amount of Monte Carlo we need. It accounts for nearly half of our
    current projections. This is in line with what we have always said,
    but bears re-examination.


      Coming Computing Resources

Mark flipped through Chip Watson's talk at the June 28 Round Table 
<https://www.jlab.org/indico/event/247/session/5/contribution/14/material/slides/0.pdf>. 


Increases to the core count on the farm coming this summer were listed 
on slide 6:

  * Current system: 3.5k cores (scaled to Broadwell)
  * Major farm upgrade due in July: 88 dual 20 core Skylake compute
    nodes (farm18), adds 3.5k cores (100% gain)
  * Retiring LQCD cluster to be shared for 6 months, 250 dual 8 core
    (2012 Sandy Bridge) compute nodes, adds 2.4k cores
  * Size for the Fall run: 9.4k cores (up 2.7x)
  * Note: 2.7k cores go end of life mid way through FY2019, and we might

add only 1.8k new, dropping onsite capacity to 8.5k cores, still up 150%.

For all of the talks presented at the Round Tables see the Indico site 
<https://www.jlab.org/indico/event/247/>.


      Report from the June 28 SciComp Meeting

Mark and David gave the report.

  * SciComp has replaced PBS with Slurm <https://slurm.schedmd.com/> as
    the underlying batch scheduler for the LQCD cluster and are looking
    to do the same for the Experimental Nuclear Physics (ENP) cluster.
    We now have Auger sitting on top of PBS and SWIF sitting on top of
    Auger. They are looking at eliminating the Auger layer.
  * SciComp is planning on allowing LQCD jobs to run on the ENP cluster
    at low priority. The reverse will not be allowed. Jobs submitted to
    ENP will be able to pre-empt LQCD jobs (kill them to make room if no
    job slots are free).


      Missing CDC hits in recent bggen launch

Alex A. reported on the problem. All CDC hits are apparently missing. 
Cause of the problem is not known. See his presentation of the evidence 
starting at 40:40 in the recording <https://bluejeans.com/s/Gl5u7/>.

We will have to re-run.


      Splitting up Sim-Recon

Mark reported that there was still some remaining work to get to a 
production system.

 1. *Environment variable changes*. Rather than HALLD_HOME for
    sim-recon, we will have HALLD_SIM_HOME for the halld_sim repository
    (originally gluex_sim) and HALLD_RECON_HOME for the halld_recon
    repository (originally gluex_recon). We decided that the "halld"
    name reflected the fact that non-GlueX experiments will be using the
    same software.
 2. *Preserving branches in halld_recon*. There were some details to be
    worked out to have the branches from sim-recon transmit to halld_recon.
 3. *Removing sim pieces from gluex_recon*. This had not been done.
    Removal necessitates changes in the build system.
 4. *Dealing with build_scripts*. Build_scripts has not been modified
    for the new configuration. All work so far has been in getting a
    working build.


      HDGeant4 Meeting

At the last collaboration meeting Richard suggested having a meeting 
dedicated to issues related to the Geant4 version of our simulation. The 
first meeting will be Friday, July 6 at 10 pm. We will likely meet 
bi-weekly in the slot opposite the Offline Software meetings.


      Review of Pull Requests and Software Help Topics

We went over recent pull requests 
<https://github.com/JeffersonLab/sim-recon/pulls?q=is%3Aclosed+is%3Apr> 
and recent discussion on the GlueX Software Help List 
<https://groups.google.com/forum/#%21forum/gluex-software> without major 
discussions.

-- 
Mark Ito, marki at jlab.org, (757)269-5295

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/halld-offline/attachments/20180629/f062c5dd/attachment-0001.html>


More information about the Halld-offline mailing list