[Halld-offline] can GlueX use idle cores at JLab?
Sandy Philpott
philpott at jlab.org
Thu Apr 2 08:44:01 EDT 2015
Hi again,
Thanks - I see lots of jobs came in... but the memory requirements aren't a good match with the Haswells. Attached is the visual impact of the gxproj2 jobs able to use only 14 of the 24 cores on the farm14 nodes -- these are all long-running jobs but are a bit of a bad fit to the hardware. Can the jobs be adjusted to run mutli-core, or use less memory per single job? The goal is 1400 MB or below on these nodes, as they have 32 GB RAM. Can the jobs be shorted to the default 24 hour walltime to be better farm users? This would allow other jobs to get slots without having to wait the full >2 days for these to finish.
Input / feedback welcome,
Sandy
From: "Mark Ito" <marki at jlab.org>
To: halld-offline at jlab.org
Sent: Tuesday, March 31, 2015 4:41:18 PM
Subject: [Halld-offline] Fwd: can GlueX use idle cores at JLab?
from Sandy Philpott
-------- Forwarded Message --------
Subject: can GlueX use idle cores at JLab?
Date: Tue, 31 Mar 2015 11:52:19 -0400 (EDT)
From: Sandy Philpott <philpott at jlab.org>
To: halld-offline at jlab.org
CC: Heyes Graham <heyes at jlab.org> , Chip Watson <watson at jlab.org> , Mark Ito <marki at jlab.org> , David Lawrence <davidl at jlab.org>
Hello GlueX,
The newest Haswell farm14 nodes of 2400 cores at JLab have been mostly idle since their installation last fall. That's much of 1.7 M Haswell core-hours each month that are largely unused, or almost 5 M core hours of idle time so far.
Could Hall D keep simulation jobs in the queue and running indefinitely, rather than just during times of the data challenges? Are there other jobs to run? Otherwise, many of the available computing cycles in the farm for Experimental Physics are falling on the floor rather than being used.
Feedback and perspective welcome,
Sandy
_______________________________________________
Halld-offline mailing list
Halld-offline at jlab.org
https://mailman.jlab.org/mailman/listinfo/halld-offline
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/halld-offline/attachments/20150402/38662450/attachment-0002.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: farm_gluex_mem.png
Type: image/png
Size: 12204 bytes
Desc: not available
URL: <https://mailman.jlab.org/pipermail/halld-offline/attachments/20150402/38662450/attachment-0002.png>
More information about the Halld-offline
mailing list