Bug #4596
closed
IPMI looks ok on that machine however from the error message it had a problem getting 'IPMI console'. Just a guess (as I haven't looked into the code) but I am guessing it checks that by getty login prompt or something. That machine is crashed (stuck in kbd prompt). Because of this you would not get a getty login prompt when you accessed the machine via SOL which could be why it got that error.
Or do you think it was just another networking issue cropping up?
- Status changed from New to Closed
- Status changed from Closed to In Progress
Actually.. hmm. IIRC 6233 also errored out with the same message. After the first error, it should have nuked the node (and powercycled it). Can you see if there are other clues in that job's output?
also, alex said on ceph-qa:
>> 6430: (1147s) collection:cephfs clusters:fixed-3.yaml fs:btrfs.yaml
tasks:kclient_workunit_suites_pjd.yaml
>> [Errno 9] Bad file descriptor
(plana 11, 48, 64)
This too. I also couldn't reach the plana 48 console.
- Status changed from In Progress to Resolved
- % Done changed from 0 to 100
Something went wrong when the inktank user got setup on this machine. Probably some dropped IPMI commands. I fixed it up. It looked like:
root@sigoto: 12:03 PM :~# ipmitool -I lanplus -H plana48.ipmi.sepia.ceph.com -U root -P XXXXXXXXX user list
ID Name Callin Link Auth IPMI Msg Channel Priv Limit
2 root true true true ADMINISTRATOR
3 planatemp true false true ADMINISTRATOR
4 true true true ADMINISTRATOR
and now:
root@sigoto: 12:05 PM :~# ipmitool -I lanplus -H plana48.ipmi.sepia.ceph.com -U root -P XXXXXXXXX user list
ID Name Callin Link Auth IPMI Msg Channel Priv Limit
2 root true true true ADMINISTRATOR
3 planatemp true false true ADMINISTRATOR
4 inktank true true true ADMINISTRATOR
I also tested the sol using the inktank user.
Also available in: Atom
PDF