Project

General

Profile

Actions

Bug #50370

closed

confusa10 down, unresponsive to ipmi/console

Added by Dan Mick about 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Infrastructure Hardware
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

confusa10 was hanging on an ansible task to restart dockerd; investigating, systemctl stop docker hung, as did start; no useful status from systemctl status docker.

looking at the rest of the machine, I saw several jenkins build jobs that had been started on April 12 (this was afternoon PDT on the 14th). Attempting to strace those jobs hung strace unkillably. Nothing useful in dmesg or tailing syslog, so I decided to reboot, and it didn't come back up. IPMI won't accept a connection.

Actions #1

Updated by adam kraitman about 3 years ago

  • Status changed from New to In Progress
  • Assignee set to adam kraitman
Actions #2

Updated by adam kraitman about 3 years ago

  • Priority changed from High to Normal
  • Severity changed from 1 - critical to 3 - minor

Hey Dan I rebooted the machine now it's running

Actions #3

Updated by adam kraitman almost 3 years ago

  • Status changed from In Progress to Resolved
Actions

Also available in: Atom PDF