Project

General

Profile

Bug #50370

confusa10 down, unresponsive to ipmi/console

Added by Dan Mick 2 months ago. Updated 29 days ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Infrastructure Hardware
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

confusa10 was hanging on an ansible task to restart dockerd; investigating, systemctl stop docker hung, as did start; no useful status from systemctl status docker.

looking at the rest of the machine, I saw several jenkins build jobs that had been started on April 12 (this was afternoon PDT on the 14th). Attempting to strace those jobs hung strace unkillably. Nothing useful in dmesg or tailing syslog, so I decided to reboot, and it didn't come back up. IPMI won't accept a connection.

History

#1 Updated by adam kraitman 2 months ago

  • Status changed from New to In Progress
  • Assignee set to adam kraitman

#2 Updated by adam kraitman 2 months ago

  • Priority changed from High to Normal
  • Severity changed from 1 - critical to 3 - minor

Hey Dan I rebooted the machine now it's running

#3 Updated by adam kraitman 29 days ago

  • Status changed from In Progress to Resolved

Also available in: Atom PDF