[Lerftest-ctrls] RF CPU reboot & iocConsole problems
Sonya Hoobler
sonya at slac.stanford.edu
Fri Sep 21 11:46:28 EDT 2018
Hi Wesley,
I just logged in to look around and things seem improved.
Was something done to address the problems?
Thanks,
Sonya
On Thu, 20 Sep 2018, Sonya Hoobler wrote:
> Hi Wesley,
>
> Thank you for the update and for following up.
>
> I won't do anything until hearing back from you.
>
> Sonya
>
>
>
> On Thu, 20 Sep 2018, Wesley Moore wrote:
>
>> Sonya,
>>
>> Larry rebooted lcls-llrfcpu02. Said it powered back up, but isn't showing
>> any connectivity. Looks the same from my end.
>>
>> lclsfs - can't ssh, but pingable
>> lclsapp1 - seems fine
>> lclsapp2 - can't ssh, but pingable
>> lcls-llrfcpu01 - seems fine
>> lcls-llrfcpu02 - can't ssh, can't ping
>> Control room hosts (lclsl01-03): can't ssh, but pingable
>>
>> Let me follow up with the guy that setup the fileserver and see if we can
>> get that checked out first. We may need to reboot stuff after that's
>> sorted out.
>>
>> Wesley
>>
>> On 9/20/18, 9:39 AM, "Lerftest-ctrls on behalf of Wesley Moore"
>> <lerftest-ctrls-bounces at jlab.org on behalf of wmoore at jlab.org> wrote:
>>
>> Looks like at least lcls-llrfcpu02 needs to be rebooted. Others seem
>> likely as well. The control room hosts aren't connecting either. Have you
>> heard anything from Larry?
>>
>> Wesley
>>
>> On 9/19/18, 7:20 PM, "Lerftest-ctrls on behalf of Sonya Hoobler"
>> <lerftest-ctrls-bounces at jlab.org on behalf of sonya at slac.stanford.edu>
>> wrote:
>>
>> Hi Wesley, all,
>>
>> I just tried a reboot of RF CPU lcls-llrfcpu02 and it never
>> successfully
>> re-booted up.
>>
>> I can't view the boot-up sequence because iocConsole is also no
>> longer
>> working for either CPU:
>>
>> [softegr at lclsapp1 iocCommon]$ iocConsole lcls-llrfcpu01
>> : ssh -x -t -l laci lclsapp2.acc.jlab.org bash -l -c "
>> pyiocscreen.py -t HIOC lcls-llrfcpu01 lclsts1 2001 "
>> Read from socket failed: Connection reset by peer
>> [softegr at lclsapp1 iocCommon]$ iocConsole lcls-llrfcpu02
>> : ssh -x -t -l laci lclsapp2.acc.jlab.org bash -l -c "
>> pyiocscreen.py -t HIOC lcls-llrfcpu02 lclsts1 2002 "
>> Read from socket failed: Connection reset by peer
>>
>> I tried a remote reboot of the terminal server.
>>
>> I also tried ipmitool (and EPICS ipmi) to remotely restart the CPU.
>>
>> But still no signs of life.
>>
>> Perhaps you could take a look at the network and/or locally? We may
>> need a
>> local power-cycle of the CPU and the terminal server. I'm cc'ing
>> Larry
>> Farrish, who may also be able to help with that.
>>
>> This is not super urgent. When you have a chance during your normal
>> working hours, I'd appreciate any help.
>>
>> Thanks,
>> Sonya
>>
>> _______________________________________________
>> Mailing List: Lerftest-ctrls at jlab.org
>> https://mailman.jlab.org/mailman/listinfo/lerftest-ctrls
>> Wiki: https://wiki.jlab.org/lerf/index.php/Network
>>
>>
>>
>> _______________________________________________
>> Mailing List: Lerftest-ctrls at jlab.org
>> https://mailman.jlab.org/mailman/listinfo/lerftest-ctrls
>> Wiki: https://wiki.jlab.org/lerf/index.php/Network
>>
>>
>
More information about the Lerftest-ctrls
mailing list