Project

General

Profile

Bug #24222

Manager daemon y is unresponsive during teuthology cluster teardown

Added by Sage Weil 8 months ago. Updated 7 months ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
Start date:
05/21/2018
Due date:
% Done:

0%

Source:
Tags:
Backport:
mimic,luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:

Description

2018-05-22 00:21:18.397796 mon.b mon.1 172.21.15.70:6789/0 121 : cluster [WRN] Manager daemon y is unresponsive.  No standby daemons available.
2018-05-22 00:21:18.451864 mon.b mon.1 172.21.15.70:6789/0 122 : cluster [DBG] mgrmap e4: no daemons active

at very end of ceph.log. meanwhile, teuthology.log,
2018-05-22T00:19:38.372 INFO:teuthology.misc:Shutting down mds daemons...
2018-05-22T00:19:38.373 INFO:teuthology.misc:Shutting down osd daemons...
...
2018-05-22T00:20:33.291 INFO:teuthology.misc:Shutting down mgr daemons...
...
2018-05-22T00:20:33.465 INFO:teuthology.misc:Shutting down mon daemons...
...
2018-05-22T00:21:52.789 INFO:tasks.ceph:Checking cluster log for badness...

hmm, just before shutdown, we have

2018-05-22T00:19:36.598 INFO:teuthology.orchestra.run.smithi050:Running: "sudo ceph --cluster ceph tell 'mon.*' injectargs -- --no-mon-health-to-clog" 

maybe this needs to include mgr and other daemon health?

/a/sage-2018-05-21_18:33:19-rados-wip-sage3-testing-2018-05-21-1130-distro-basic-smithi/2563080


Related issues

Copied to RADOS - Backport #24245: luminous: Manager daemon y is unresponsive during teuthology cluster teardown Resolved
Copied to RADOS - Backport #24246: mimic: Manager daemon y is unresponsive during teuthology cluster teardown Resolved

History

#1 Updated by Sage Weil 8 months ago

  • Status changed from Verified to Need Review

#2 Updated by Sage Weil 8 months ago

  • Status changed from Need Review to Pending Backport

#3 Updated by Nathan Cutler 8 months ago

  • Copied to Backport #24245: luminous: Manager daemon y is unresponsive during teuthology cluster teardown added

#4 Updated by Nathan Cutler 8 months ago

  • Copied to Backport #24246: mimic: Manager daemon y is unresponsive during teuthology cluster teardown added

#5 Updated by Nathan Cutler 7 months ago

  • Status changed from Pending Backport to Resolved

Also available in: Atom PDF