[Hps] [Jlab-scicomp-briefs] Farm Lustre outage resolved - May 25 2023

Bryan Hess via Jlab-scicomp-briefs jlab-scicomp-briefs at jlab.org
Thu May 25 12:17:07 EDT 2023


Overnight a hardware failure left much of the farm Lustre filesystem (/cache and /volatile) unresponsive. This morning the problem was diagnosed as an IO controller in a 60 bay SAS disk shelf, and the component was replaced from shelf spares. Systems are operating normally again. A handful of systems, including ifarm1802, were rebooted to clear problems caused by the outage.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://mailman.jlab.org/pipermail/hps/attachments/20230525/ffddf36c/attachment.html>
-------------- next part --------------
--

This is an announcement-only list for Jefferson Lab Scientific Computing Updates .

Subscription and List Archive: https://mailman.jlab.org/mailman/listinfo/jlab-scicomp-briefs

For help: https://jlab.servicenowservices.com/scicomp


More information about the Hps mailing list