<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
p.msonormal0, li.msonormal0, div.msonormal0
{mso-style-name:msonormal;
mso-margin-top-alt:auto;
margin-right:0in;
mso-margin-bottom-alt:auto;
margin-left:0in;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
span.apple-converted-space
{mso-style-name:apple-converted-space;}
span.EmailStyle19
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-US" link="blue" vlink="purple">
<div class="WordSection1">
<p class="MsoNormal">Hi Nathan,<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">I wanted to know if there were changes in the memory footprint of the reconstruction application and if SLURM is reacting to resident memory and not the virtual memory. I have to say that your previous email answered my questions.<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">-Vardan<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">--------------------------------------------------<br>
Vardan H. Gyurjyan, Ph.D.<br>
Staff Scientist<br>
Thomas Jefferson Accelerator Facility<br>
Newport News, VA, 23606<br>
E-mail: gurjyan@jlab.org<br>
757-269-5879 (JLAB)<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<p class="MsoNormal"><b><span style="font-size:12.0pt;color:black">From: </span></b><span style="font-size:12.0pt;color:black">Nathan Baltzell <baltzell@jlab.org><br>
<b>Date: </b>Tuesday, February 7, 2023 at 9:37 PM<br>
<b>To: </b>Vardan Gyurjyan <gurjyan@jlab.org><br>
<b>Cc: </b>clas12 software <clas12_software@jlab.org><br>
<b>Subject: </b>Re: [Clas12_software] jlab batch job memory requests<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<p class="MsoNormal">And for exclusive jobs, the memory request is moot anyway. What are you really after here?<o:p></o:p></p>
<div>
<p class="MsoNormal"><br>
<br>
<o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<p class="MsoNormal">On Feb 7, 2023, at 11:14 AM, Nathan Baltzell <<a href="mailto:baltzell@jlab.org">baltzell@jlab.org</a>> wrote:<o:p></o:p></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<div>
<p class="MsoNormal">Hi Vardan,<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">What is the "reconstruction application"? The number I quoted was for standard simulation jobs. The larger jobs used for real data of course depend on the size of the job, but never more than 1 GB per thread.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">-Nathan<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal"><br>
<br>
<o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<div>
<p class="MsoNormal">On Feb 7, 2023, at 11:11 AM, Vardan Gyurjyan <<a href="mailto:gurjyan@jlab.org">gurjyan@jlab.org</a>> wrote:<o:p></o:p></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">What is the reconstruction application's resident (not the virtual) memory usage for the exclusive usage of a node?<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">--------------------------------------------------<br>
Vardan H. Gyurjyan, Ph.D.<br>
Staff Scientist<br>
Thomas Jefferson Accelerator Facility<br>
Newport News, VA, 23606<br>
E-mail:<span class="apple-converted-space"> </span><a href="mailto:gurjyan@jlab.org"><span style="color:purple">gurjyan@jlab.org</span></a><br>
757-269-5879 (JLAB)<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0in 0in 0in">
<div>
<p class="MsoNormal"><b><span style="font-size:12.0pt">From:<span class="apple-converted-space"> </span></span></b><span style="font-size:12.0pt">Clas12_software <<a href="mailto:clas12_software-bounces@jlab.org">clas12_software-bounces@jlab.org</a>> on behalf
of Nathan Baltzell via Clas12_software <<a href="mailto:clas12_software@jlab.org">clas12_software@jlab.org</a>><br>
<b>Reply-To:<span class="apple-converted-space"> </span></b>Nathan Baltzell <<a href="mailto:baltzell@jlab.org">baltzell@jlab.org</a>><br>
<b>Date:<span class="apple-converted-space"> </span></b>Tuesday, February 7, 2023 at 11:05 AM<br>
<b>To:<span class="apple-converted-space"> </span></b>clas12 software <<a href="mailto:clas12_software@jlab.org">clas12_software@jlab.org</a>><br>
<b>Subject:<span class="apple-converted-space"> </span></b>[Clas12_software] jlab batch job memory requests</span><o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<p class="MsoNormal">FYI Everyone,<span class="apple-converted-space"> </span><o:p></o:p></p>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">We've brought up memory efficiency of batch job requests at CLAS12 collaboration and software meetings in previous years. Lots of jobs requesting a lot more memory than they actually use can make the farm unnecessarily idle and significantly
reduce throughput for everyone.<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">*** And now Scicomp has a larger initiative to improve farm efficiency, which includes contacting people running memory-inefficient jobs and potentially throttling their jobs if no action is taken. ***<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">You can check metrics of your batch jobs at:<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"><a href="https://scicomp.jlab.org/"><span style="color:purple">https://scicomp.jlab.org</span></a> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">There's a search feature at 'Slurm Jobs' (left sidebar) -> 'Jobs Query' (top), and 'Recent Jobs' (top), and also 'Memory Efficiency' (top). <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">Before launching a large number of new types of jobs, you can measure how much memory your jobs use. For example, by submitting a couple jobs and using that website, or by running your job interactively and checking in htop or ps or other
system utilities. And then set your SLURM/SWIF job memory request accordingly.<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">Note, standard CLAS12 simulation jobs (gemc plus recon-util) require less than 1.7 GB of memory.<o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal"> <o:p></o:p></p>
</div>
</div>
<div>
<div>
<p class="MsoNormal">-Nathan<o:p></o:p></p>
</div>
</div>
</div>
</blockquote>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</div>
</blockquote>
</div>
<p class="MsoNormal"><br>
<br>
<o:p></o:p></p>
</div>
</body>
</html>