[Hallb-engineering] lost communication with torus IOC

Steve Christo christo at jlab.org
Wed Feb 2 07:27:49 EST 2011


The problem has been narrowed down to a loss of communication with the 
accelerator host "eosfs" but MCC has not been able to track down the 
person responsible for it yet. It seems to me that it would be far less 
trouble for us if the boot files resided locally so the IOC didn't have 
to go over the network to find them. There are at least 2 networks to 
negotiate plus at least one other host that is not ours, leaving us more 
vulnerable than we should be. If the reason for keeping the files on the 
accelerator host is that they want to keep track of revisions, then 
isn't possible to have a mirror site for the revisions in the hall that 
the IOC could find without having to go through such a circuitous path?
Steve

Steve Christo wrote:
> Went into to hall at 2:00 because the Torus IOC could not find the boot 
> file on the network. Cycled the power on the torus router but it didn't 
> help this time as it had 2 weeks ago. Am suspecting that a higher level 
> router or accelerator network is at fault. Marco called MCC to try to 
> get help but they called in a software person who, by the way, came to 
> the same conclusion. At the time of this writing, they are trying to 
> raise a network person who can help. Any of you guys know what/where the 
> Torus comms get their boot files from? A router somewhere in the 
> counting house?
> Steve
>
> _______________________________________________
> Hallb-engineering mailing list
> Hallb-engineering at jlab.org
> https://mailman.jlab.org/mailman/listinfo/hallb-engineering
>   





More information about the Hallb-engineering mailing list