Actions
Bug #49591
openno active mgr (MGR_DOWN)" in cluster log
Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
Tags:
Backport:
pacific, octopus, nautilus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
seen in nautilus
description: rados/verify/{ceph clusters/{fixed-2 openstack} d-thrash/none msgr-failures/few msgr/random objectstore/bluestore-comp-zlib rados tasks/rados_api_tests validater/valgrind} duration: 2455.64199924469 failure_reason: '"2021-03-02 14:24:52.608800 mon.a (mon.0) 331 : cluster [WRN] Health check failed: no active mgr (MGR_DOWN)" in cluster log'
examining mgr logs:
{ "PG_DEGRADED": { "severity": "HEALTH_WARN", "summary": { "message": "Degraded data redundancy: 1/2 objects degraded (50.000%), 1 pg degraded" }, "detail": [ { "message": "pg 23.1 is active+undersized+degraded, acting [7]" } ] },
2021-03-02 14:43:58.900 7fba2ce69700 0 log_channel(cluster) log [DBG] : pgmap v1257: 40 pgs: 1 active+undersized+degraded, 17 active+undersized, 20 stale+active+undersized, 2 active+clean; 5 B data, 35 MiB used, 712 GiB / 720 GiB avail; 1/2 objects degraded (50.000%)
observed slow op as well
/ceph/teuthology-archive/yuriw-2021-03-01_23:47:21-rados-wip-yuri2-testing-2021-03-01-1417-nautilus-distro-basic-smithi/5925885/teuthology.log
Actions