[Halld-online] Requirements for the conditions database
Elliott Wolin
wolin at jlab.org
Fri Nov 11 16:01:17 EST 2011
Dear Mark and Dmitry,
Below is an attempt to specify requirements for the conditions
database. Please get back to me if you need additional information...we
can iterate as required.
Thanks,
Elliott
*
_Requirements for Hall D Conditions Database_*
Elliott Wolin
11-Nov-2011
_General Remarks_
The Hall D conditions database needs to store a wide variety of types of
data relevant to configuring and running the apparatus and taking data.
The information in the conditions database will be used by the offline
reconstruction and analysis effort, and might further be used for
debugging of online problems. Further, it is important that this data
be easily accessible from the analysis programs from both on- and
off-site. The data must be tagged appropriately, and the schema shown
by Mark at the April 2011 Online meeting seems like it should work.
Note that the conditions database need not duplicate the function of the
EPICS archiver, which records a vast quantity of fine-grained data from
a large number of sensors. Most of this data is not needed by the
offline effort, but it is difficult to predict in advance which data
will or will not be needed. It does make sense, however, to record
selected EPICS data in the conditions database that surely will be used
by the offline, for example the solenoid setpoint current at the
beginning of each run.
_Input Requirements_
_
_Data will be input to the conditions database in a variety of ways
during experiment operations. These include input from C++ and Java
programs and from Python and shell (and perhaps Perl) scripts. Some
input will come from long-running programs or scripts, some from
short-lived programs or scripts launched as part of a run control
transition, and some from CRON jobs.
Entry size will vary from single numbers or short strings to long
strings containing many thousands of characters.
Entry frequency varies widely, with flurries of activity expected at the
beginning and end of data taking runs, with relatively few entries
during steady-state data taking.
Entry rates will peak within about 20 seconds after the start of a new
run, with something like 10-20 processes making about 100-200 entries
with a maximum aggregate size of less than 1 MB. Note that how long
these entries take is not critical as they will be made by programs and
scripts running in the background.
_
_API requirements are not particularly strict, as only a few experienced
members of the online group will be involved in making entries into the
conditions database.
_
Retrieval Requirements_
These are largely driven by offline requirements, and are almost
identical to those of the calibration database. It is highly desirable
that the API and access mechanisms for the conditions and calibration
databases be as similar as possible.
An important consideration concerns export of the conditions database to
other sites. Some of the data entered will only be useful to the online
group, although it is never clear whether any particular data type will
be needed by the offline. A mechanism needs to be provided that allows
selective export of the database contents, e.g. by run range and data type.
--
Sincerely,
Elliott
================================================================================
Those raised in a morally relative or neutral environment will hold
no truths to be self-evident.
Elliott Wolin
Staff Physicist, Jefferson Lab
12000 Jefferson Ave
Suite 8 MS 12A1
Newport News, VA 23606
757-269-7365
================================================================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://mailman.jlab.org/pipermail/halld-online/attachments/20111111/2bb48dd6/attachment.html
More information about the Halld-online
mailing list