[Halld-offline] Offline Software Meeting Minutes, April 6, 2018
Mark Ito
marki at jlab.org
Tue Apr 10 17:56:26 EDT 2018
Folks,
Please find the minutes below and at
https://halldweb.jlab.org/wiki/index.php/GlueX_Offline_Meeting,_April_6,_2018#Minutes
.
-- Mark
___________________________________
GlueX Offline Meeting, April 6, 2018, Minutes
Present:
* *JLab: * Alex Austregesilo, Amber Boehnlein, Thomas Britton, Mark
Ito (chair), David Lawrence, Simon Taylor, Beni Zihlmann
There is a recording of this meeting <https://bluejeans.com/s/meuB8/> on
the BlueJeans site. Use your JLab credentials to access it.
Announcements
1. *New tag of sim-recon for simulation: recon-2017_01-ver02-sim_ver01
<https://mailman.jlab.org/pipermail/halld-offline/2018-March/003137.html>.*
This tag is meant for use by simulations.
2. *Meeting on Containers, 11:30 today, A110.* Note that future
containers meetings will be in A110 as well.
3. *Launches.* Alex A. is starting a new analysis launch on 2016 data
for Lubomir Pentchev.
4. *Software Review, Summer 2018.* Mark reported that there are plans
for another software review, perhaps early in the Summer.
5. *SQLiteCpp and SQLite libraries.* Mark reports that there are still
some residual problems with these libraries in the builds. He is
working on a solution and a new version should be coming out soon
with fixes. One holding item is to see if the recent fix to the
memory leak problem in tracking does the job.
Review of minutes from the March 9 meeting
We went over the minutes
<https://halldweb.jlab.org/wiki/index.php/GlueX_Offline_Meeting,_March_9,_2018#Minutes>
without much comment.
Report from the GlueX Containers meeting on March 30
Mark reviewed his notes from the meeting
<https://halldweb.jlab.org/wiki/index.php/GlueX_Containers_Meeting,_March_30,_2018>.
Two major topics from that meeting:
1. Mark reviewed the talk he gave at the OSG All-Hands meeting on
March ??? He emphasized the new user-friendly access to the OSG for
GlueX collaborators and raised the idea of doing raw data
reconstruction on the OSG.
2. Richard reported on work he has done using XROOTD to stream data
over the network as input to jobs running on the OSG. This solves
the problem of getting our data to a remote grid node when the
identity/location of the node is unknown until the job starts. It is
the solution that Atlas and CMS have been using for years and relies
on mature technology. We hope to set up a system on the Submit Host
in the near future.
For all of the details, see the notes at the link given above.
Progress on Running Simulation on the OSG
Thomas described the latest go-around submitting jobs to the OSG using
MCwrapper.
* There was no wait for jobs to start up. They were running minutes
after they were submitted.
* We observed use of a large fraction (30-40%) of the 1 Gb interface
on the Submit Node at multi-job-start-up time.
* Amber mentioned that in the interim report to the Office of Nuclear
Physics, the ability to plan large simulation campaigns, outside of
peak demand relief was not mentioned. Neither was the ability to
reconstruct raw data.
* Amber reported that NERSC is committed devoting resourced to GlueX
reconstruction as part of their strategic agenda. They need to have
specific deliverables and so we need to identify appropriate
data-sets/code-versions for running there. We hope to do this this
summer.
* Alex mentioned bookkeeping as an issue when running on the OSG.
Indeed when the data challenges were run, the OSG component was not
tracked with a database, only the JLab-resident jobs were. That used
a database developed by Mark. On the batch farm, using SWIF, Alex is
tracking jobs with the SWIF database (although we do not have
SQL-query level access to the SWIF database). Mark thought that
bookkeeping is actually where most of the GlueX development manpower
will be spent in getting these large-scale campaigns under control.
Functions include basic job info, resource use, job status,
evaluation of job success, re-submission on failure, and permanent
storage and cataloging of output files. Amber mentioned a Fermilab
project that we might want to look at. She remarked that the need is
global to all groups at the lab and beyond and a common solution
would avoid duplication of effort.
HDvis Update
Thomas has added new features to HDvis:
1. Scrubber bar: used to click and drag time-in-event.
2. Camera snap-to's: Fixed views of the detector, such as "barrel top"
or "TOF".
3. Context menus for choosing mass hypothesis: right click and choose
particle type for charged tracks. Tracks redrawn with chosen hypothesis.
He is working on a method for implementing a "next event" that will work
for multiple users running independent browsers on the same node. See
the demo displaying a J/ψ event at ??? in the recording.
Monitoring Dashboard
Sean sent email proposing a monitoring dashboard (with colors) that
would display all relevant monitoring results at a glance with links for
digging deeper when necessary.
* Alex would like to see things like ρ yield.
* We might want to add Monte Carlo data to the recon_test.
* There is simulation test that runs twice a week. Not everyone is on
the email list for results.
* David is looking at InfluxDB as a way to look at selected quantities
as a function of time or run number.
* We will meet with Sean to flesh out issues when he is here next week.
Action Items
1. Put MC into recon_test.
2. Schedule a monitoring dashboard meeting.
3. Put together a new release that fixes SQLite issue, MC treatment of
the CDC, and the memory leak.
Retrieved from
"https://halldweb.jlab.org/wiki/index.php?title=GlueX_Offline_Meeting,_April_6,_2018&oldid=86331"
* This page was last modified on 10 April 2018, at 17:54.
--
Mark Ito, marki at jlab.org, (757)269-5295
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/halld-offline/attachments/20180410/3d6ca47d/attachment.html>
More information about the Halld-offline
mailing list