[Halld-offline] Final Lustre pool not recoverable [Re: CCPR 121020 UPDATE (Lustre/Infiniband issue of 12/31-1/1)]

Mark Ito marki at jlab.org
Thu Jan 19 08:12:12 EST 2017


Bad news...see below.


On 01/19/2017 06:57 AM, ccpr_reply at jlab.org wrote:
> Here is an update to the help request you submitted.
> When you reply to THIS message, Please DO NOT include the original text below.
>
> Mod Date:   2017/01/19
> Mod Time:   06:57:23
> Mod User:   philpott
> Current State:   COMPLETE
> STATE changed from (WORKING) to COMPLETE.
>
>
> --------------------------------------------------------------
> News Jan. 18
> Lustre/ZFS recovery complete; final pool not recoverable
>
> After Intel support was able to recover 4 of the 5 problematic ZFS
> storage targets that became unavailable in Lustre Dec 31, the last
> target is corrupt and cannot be recovered, holding a total of ~8 TB of
> data. Users can remove problematic files on this target from Lustre
> with the "unlink" command.
>
> 									
> --------------------------------------------------------------		
> Here is a copy of your Original Request:
> 	
> Email:     marki at jlab.org
> Name:      Mark Ito
> Username:  marki
> Staff:	   philpott
> Platform:  other,Netscape,20100101 Firefox
> Building:  12_1
> Room:      A104
> Hostname:  129.57.73.156
> Category:  SCIENTIFIC COMPUTING
> Subject:   Lustre/Infiniband issue of 12/31-1/1
> Submitted: 1/3/2017 8:50 AM
> 			
> Request:
> Is there an update on this problem? Should we expect normal operation?
> I am seeing problems on /work/halld on ifarm1401. 'ls' not responding
> from /work/halld/pull_request_test .
>
>
> The following link will bring you to a list of open CCPR's
>
> http://mis.jlab.org/mis/ccpr/ccpr_user/ccprframe_user.html
> 	

-- 
marki at jlab.org, (757)269-5295



More information about the Halld-offline mailing list