[Sbs_software] Hydra online monitoring - plots and labeling

Ole Hansen ole at jlab.org
Fri May 6 12:12:01 EDT 2022


Hi everyone,

our friends in the EPSCI group have more or less finished porting Hall 
D's AI-based online monitoring system, Hydra, to Hall A/SBS. The way it 
works is roughly like this:

- We run the 10k/50k replay as we usually do
- A slightly modified version of Panguin writes the output plots not 
only to PDFs but also to individual image files
- Hydra detects new image files on disk and uses ML inference to 
classify each one
- If there are anomalies, i.e. Hydra thinks a plot is "bad", an alarm sounds

Unlike our current alarm handler, AI-based image classification can 
detect subtle problems, like hot channels, unusually low yield, 
distorted spectra etc. Of course, it can also get hopelessly confused. 
We almost certainly will have to fine-tune things when the next run 
period starts.

Now, for Hydra to work, its ML model needs to be trained with existing 
data. For this purpose, we've run Panguin over the ROOT files from the 
GMn experiment, extracted images of the plots (a subset—the macro-based 
Panguin plots aren't there yet), and loaded a few of them into the Hydra 
database for labeling.

Labeling means that a human expert has to manually classify each plot. 
(This only needs to be done once, of course, after which the model makes 
predictions on its own.) The labeling can be done through a web 
interface, which also gives a nice overview of the run-by-run data 
quality for each individual plot.

It would be great if our online monitoring experts (Paul, ...) could 
take a peek at the labeler web page (CUE login required) and give us 
some feedback:

https://hallaweb.jlab.org/hydra/labeler.html

It should be mostly self-explanatory, but instructions are here:

https://docs.google.com/document/d/1h4AC5UL_DB4Ugjw1GlybNdaq23uAYrc9m0R37sTxG1s/edit?usp=sharing

The page will probably tell you that you do not have permission to label 
anything, but you can look around. We need to decide who should be given 
write access.

If you don't see any plots, choose a different plot type from the 
drop-down list (top center) or switch on Editor Mode (top left).

As you will see, the database contains only 5 different types of plots 
at the moment. These are a few simple ones we uploaded for testing. We 
need to decide on the full set of plots we'd like to have monitored by 
Hydra.

The images that Panguin extracted for each run are in these directories 
on the counting house computers:

/chafs2/work1/hydra/<run_number>/

Again, it would be great if our online experts could look at these plots 
and make a list of the ones worth monitoring. Also, a list of plots 
still missing that should definitely be monitored. As I said, the 
macro-based Panguin plots aren't there yet.

I'll be happy to give a 10 minute demonstration of the labeler and 
answer questions at the software meeting later today.

Ole




More information about the Sbs_software mailing list