[Halld-offline] Software Meeting Minutes, January 21, 2020

Mark Ito marki at jlab.org
Wed Jan 22 19:13:04 EST 2020


Colleagues,

Please find the meeting minutes here 
<https://halldweb.jlab.org/wiki/index.php/GlueX_Software_Meeting,_January_21,_2020#Minutes> 
and below.

   -- Mark


    GlueX Software Meeting, January 21, 2020, Minutes

Present:

  * *CMU *: Naomi Jarvis
  * *FSU *: Sean Dobbs
  * *JLab *: Alex Austregesilo, Mark Ito (chair), Igal Jaegle, David
    Lawrence, Keigo Mizutani, Justin Stevens, Simon Taylor
  * *ODU *: Nilanga Wickramaarachchi

There is a recording of his meeting <https://bluejeans.com/s/ARHwq/> on 
the BlueJeans site. Use your JLab credentials to access it.


      Announcements

Mark reminded us about the recent upgrade release for halld_sim and 
hdgeant4 (version_4.13.0.xml) 
<https://mailman.jlab.org/pipermail/halld-offline/2020-January/003873.html>, 
and the corresponding recon launch releases 
<https://mailman.jlab.org/pipermail/halld-offline/2020-January/003874.html>. 



      Review of Minutes from the Last Software Meeting

We went over the minutes from the meeting on January 7 
<https://halldweb.jlab.org/wiki/index.php/GlueX_Software_Meeting,_January_7,_2020#Minutes>. 
As part of the deployment of the new Lustre-based disk space, we will be 
moving our volatile partition to a home on a new system and doubling its 
size, from 60 TB to 120 TB. Mark will send out an announcement on how 
this will work. The change is scheduled for February 4.


      Review of Minutes from the Last HDGeant4 Meeting

We went over the minutes from January 14 
<https://halldweb.jlab.org/wiki/index.php/HDGeant4_Meeting,_January_14,_2020#Minutes>. 
Peter Pauli posted a study under Issue #93: Calorimeter timing mismatch 
between g3 and g4 <https://github.com/JeffersonLab/HDGeant4/issues/93>. 
He sees a significant difference in timing between G3 and G4 for the 
slow kaon in his reaction, not unlike that seen in other channels.


      Default CCDB server on farm: switch back to sqlite

Mark described the problem that led to the switch back to using SQLite 
files 
<https://mailman.jlab.org/pipermail/halld-offline/2020-January/003866.html> 
for farm jobs. The root problem is when a couple thousand jobs start on 
the farm at the same time. The MySQL servers get CPU-bound and jobs have 
to wait on their constants (delays of an hour or so when things are 
bad). Under lesser loads, there is no problem. There are three paths 
being pursued to fix the problem:

 1. Marty Wise of the Computer Center has called a meeting for tomorrow
    to discuss adding more servers to hallddb-farm (a DNS alias for a
    combination of hallddb-a and hallddb-b.jlab.org).
 2. CCDB 2.0 has a new feature: creation of an intermediate table that
    does the join needed for calibration constant retrieval in advance.
    Dmitry Romanov and Mark are working on getting this version deployed.
 3. Mark has started looking at the idea David proposed some time ago of
    making a smaller database that services only a subset of run
    numbers. Mark succeeded in making a version of CCDB, valid for a
    single run, with an "assignments" table a factor of 20 smaller than
    nominal and about a quarter of the disk footprint. Performance
    improvements have not been measured. This effort is in its early stages.

We will likely settle on some combination of these approaches.

  * Alex pointed out that his launches use a
    only-used-for-launches-SQLite file and thus present no load to
    either the database servers or the SQLite mirrors on the work disk.
  * Justin noted that we could separate the load on the servers on a
    project-by-project basis: certain launches could use hallddb-farm
    while the individual user gets directed to the SQLite mirrors.


      Review of recent issues and pull requests

  * halld_sim Pull Request #111 Ijaegle primex
    <https://github.com/JeffersonLab/halld_sim/pull/111>. Sean expressed
    concern about a generator, built by default, that requires execution
    of shell scripts with non-generic paths and a gcc version greater or
    equal to 5.0. Igal explained that the non-default-ness would not
    hamper builds of the master branch, but may not give a usable
    product for amateurs.
  * *RCDB is_dirc_production queries*. These do not work at present on
    the website. Dmitry Romanov has been contacted via email.


      Review of recent discussion on the GlueX Software Help List

We reviewed the list 
<https://groups.google.com/forum/#!forum/gluex-software> without comment.


      Action Item Review

 1. Fix is_dirc_production on the RCDB webpage (Dmitry)
 2. Get Igal's PrimEx generator working on the farm. (Igal, Mark)
 3. Fix the database server so that CCDB access is not an issue (Mark et
    al.)
 4. Announce the move to a new volatile Lustre system (Mark)

Retrieved from 
"https://halldweb.jlab.org/wiki/index.php?title=GlueX_Software_Meeting,_January_21,_2020&oldid=95902"

  * This page was last modified on 22 January 2020, at 19:08.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/halld-offline/attachments/20200122/46d525b2/attachment-0001.html>


More information about the Halld-offline mailing list