[Halld-offline] Planned Outage of farm, ifarm, and storage: Monday, August 19th 9:00am to Wednesday, August 21st 9:00am
Alexander Austregesilo
aaustreg at jlab.org
Fri Aug 16 11:41:43 EDT 2024
Dear Colleagues,
Please remember the planned outage of farm, ifarm, and storage from
Monday, August 19th 9:00am to Wednesday, August 21st 9:00am. The farm,
ifarm, SWIF, Globus, tape storage, and Lustre storage (/cache and
/volatile) will be offline during this time window to transition to a
new Lustre parallel filesystem. Farm nodes are reserved starting August
19th at 9:00am. Farm jobs that are scheduled to run during this period
will be queued until the work has been completed. Files stored in /cache
and /volatile will be copied to the filesystem with the following caveats:
* If the final filesystem synchronization runs long, it will be
stopped at 9:00am on August 21. Any uncopied files will be limited to
those written in the days just before the upgrade. /lustre19 will be
maintained as an offline backup for at least one month.
* The /lustre19 path will not be valid after the change. Please use
/cache and /volatile in absolute paths to files.
* As a reminder, an outage for this transition was scheduled because
of a limitation in older hardware that prevents from running both Lustre
filesystems simultaneously.
Steps to help facilitate a speedy Lustre transition on August 19th:
* Consolidate or delete small files on Lustre if possible
* Limit directly writing large file sets to /cache or /volatile the
weekend before the upgrade. This will help to keep the final file system
synchronization from running long.
* Important Note: Using jcache is not a concern because jcache
requests are written to both the old and new Lustre filesystems.
Best regards,
Alex
--
Alexander Austregesilo
Staff Scientist - Experimental Nuclear Physics
Thomas Jefferson National Accelerator Facility
Newport News, VA
aaustreg at jlab.org
(757) 269-6982
More information about the Halld-offline
mailing list