Project

General

Profile

Actions

Bug #49693

open

Manager daemon is unresponsive, replacing it with standby daemon

Added by Gunther Heinrich about 3 years ago. Updated about 2 months ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
ceph-mgr
Target version:
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I noticed that on the cluster the active mgr daemon is marked as unresposive and another mgr takes over. Currently this happens on a daily basis. journalctl doesn't show anything suspicious. A restart of the daemon by systemctl seems to help at the moment.
Here are the journalctl entries

Mar 08 19:47:15 iz-ceph-01-mon-03 bash[1567]: cluster 2021-03-08T18:47:15.474781+0000 mon.iz-ceph-01-mon-01 (mon.0) 472871 : cluster [INF] Manager daemon iz-ceph-01-mon-03.gjmkfc is unresponsive, replacing it with standby daemon iz-ceph-01-mon-02.gfiexf
Mar 09 17:25:59 iz-ceph-01-mon-02 bash[1592]: cluster 2021-03-09T16:25:59.541147+0000 mon.iz-ceph-01-mon-01 (mon.0) 538293 : cluster [INF] Manager daemon iz-ceph-01-mon-02.gfiexf is unresponsive, replacing it with standby daemon iz-ceph-01-mon-05.exotes

The cluster is in version 15.2.7 and runs on Ubuntu 20.0.4.2

Actions

Also available in: Atom PDF