[Moller] [Jlab-scicomp-briefs] Scientific Computing Maintencne work completed, farm now running jobs

Bryan Hess bhess at jlab.org
Tue May 26 15:58:56 EDT 2020


The previously announced scientific computing maintenance work has been completed as planned. Thank you for accommodating the pause in job processing while we performed these updates.

If you experience any issues with the farm, storage, or other scientific computing systems please open an incident at https://jlab.servicenowservices.com<https://jlab.servicenowservices.com/nav_to.do?uri=%2Fhome.do%3F>





Last Week's announcement:


On Tuesday, May 26th at 8am the farm will be paused for software maintenance work and the ifarm machines will be rebooted. Once the farm nodes have been updated, jobs will be released and normal operations will resume. No action is required your part, but please see the note below about /farm_out


The following changes will be made:

     *   All farm and ifarm machines will be upgraded to newer Lustre clients
     *   Auger, SWIF, and other systems that rely on Lustre will be shifted to newer Lustre servers to improve performance and remove an operational dependency with the old Lustre system.
     *   The /farm_out default location for farm job standard output and standard error will be moved from Lustre to a new file server better suited to the small IO operations of job logging.

Notes about the new default /farm_out that may affect your batch jobs:

     *   standard output and standard error in /farm_out will be limited to 1GB per user. Jobs that overrun this limit will still run, but their stdout/stderr will be truncated.
     *   files older than 10 days will be automatically removed.
     *   /farm_out files that are large may be compressed or have the middles trimmed to conserve space.

Reminder: End of life for CentOS 7.2/ Please run on CentOS 7.7

     *   Jobs submitted using the "centos7" or "centos72" tags will soon be unsupported. Please migrate your jobs to use "centos77" or "general". The final 7.2 nodes from 2012 are being operated on a best-effort basis and will be taken out of service to make space for new hardware as needed.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/moller/attachments/20200526/85eba47e/attachment-0001.html>
-------------- next part --------------
_______________________________________________
Jlab-scicomp-briefs mailing list
Jlab-scicomp-briefs at jlab.org
https://mailman.jlab.org/mailman/listinfo/jlab-scicomp-briefs


More information about the Moller mailing list