[Lerftest-ctrls] RF CPU reboot & iocConsole problems
Sonya Hoobler
sonya at slac.stanford.edu
Fri Sep 21 11:54:31 EDT 2018
Hi again,
I see that lcls-llrfcpu02 still does not boot--tftp timeout.
I'll go back to hands-off.
Sonya
On Fri, 21 Sep 2018, Sonya Hoobler wrote:
> Hi Wesley,
>
> I just logged in to look around and things seem improved.
>
> Was something done to address the problems?
>
> Thanks,
> Sonya
>
>
> On Thu, 20 Sep 2018, Sonya Hoobler wrote:
>
>> Hi Wesley,
>>
>> Thank you for the update and for following up.
>>
>> I won't do anything until hearing back from you.
>>
>> Sonya
>>
>>
>>
>> On Thu, 20 Sep 2018, Wesley Moore wrote:
>>
>>> Sonya,
>>>
>>> Larry rebooted lcls-llrfcpu02. Said it powered back up, but isn't showing
>>> any connectivity. Looks the same from my end.
>>>
>>> lclsfs - can't ssh, but pingable
>>> lclsapp1 - seems fine
>>> lclsapp2 - can't ssh, but pingable
>>> lcls-llrfcpu01 - seems fine
>>> lcls-llrfcpu02 - can't ssh, can't ping
>>> Control room hosts (lclsl01-03): can't ssh, but pingable
>>>
>>> Let me follow up with the guy that setup the fileserver and see if we can
>>> get that checked out first. We may need to reboot stuff after that's
>>> sorted out.
>>>
>>> Wesley
>>>
>>> On 9/20/18, 9:39 AM, "Lerftest-ctrls on behalf of Wesley Moore"
>>> <lerftest-ctrls-bounces at jlab.org on behalf of wmoore at jlab.org> wrote:
>>>
>>> Looks like at least lcls-llrfcpu02 needs to be rebooted. Others seem
>>> likely as well. The control room hosts aren't connecting either. Have
>>> you heard anything from Larry?
>>>
>>> Wesley
>>>
>>> On 9/19/18, 7:20 PM, "Lerftest-ctrls on behalf of Sonya Hoobler"
>>> <lerftest-ctrls-bounces at jlab.org on behalf of sonya at slac.stanford.edu>
>>> wrote:
>>>
>>> Hi Wesley, all,
>>>
>>> I just tried a reboot of RF CPU lcls-llrfcpu02 and it never
>>> successfully
>>> re-booted up.
>>>
>>> I can't view the boot-up sequence because iocConsole is also no
>>> longer
>>> working for either CPU:
>>>
>>> [softegr at lclsapp1 iocCommon]$ iocConsole lcls-llrfcpu01
>>> : ssh -x -t -l laci lclsapp2.acc.jlab.org bash -l -c "
>>> pyiocscreen.py -t HIOC lcls-llrfcpu01 lclsts1 2001 "
>>> Read from socket failed: Connection reset by peer
>>> [softegr at lclsapp1 iocCommon]$ iocConsole lcls-llrfcpu02
>>> : ssh -x -t -l laci lclsapp2.acc.jlab.org bash -l -c "
>>> pyiocscreen.py -t HIOC lcls-llrfcpu02 lclsts1 2002 "
>>> Read from socket failed: Connection reset by peer
>>>
>>> I tried a remote reboot of the terminal server.
>>>
>>> I also tried ipmitool (and EPICS ipmi) to remotely restart the CPU.
>>>
>>> But still no signs of life.
>>>
>>> Perhaps you could take a look at the network and/or locally? We may
>>> need a
>>> local power-cycle of the CPU and the terminal server. I'm cc'ing
>>> Larry
>>> Farrish, who may also be able to help with that.
>>>
>>> This is not super urgent. When you have a chance during your normal
>>> working hours, I'd appreciate any help.
>>>
>>> Thanks,
>>> Sonya
>>>
>>> _______________________________________________
>>> Mailing List: Lerftest-ctrls at jlab.org
>>> https://mailman.jlab.org/mailman/listinfo/lerftest-ctrls
>>> Wiki: https://wiki.jlab.org/lerf/index.php/Network
>>>
>>>
>>>
>>> _______________________________________________
>>> Mailing List: Lerftest-ctrls at jlab.org
>>> https://mailman.jlab.org/mailman/listinfo/lerftest-ctrls
>>> Wiki: https://wiki.jlab.org/lerf/index.php/Network
>>>
>>>
>
More information about the Lerftest-ctrls
mailing list