Actions
Bug #50033
closedmgr/stats: be resilient to offline MDS rank-0
Status:
Resolved
Priority:
Normal
Assignee:
Category:
Administration/Usability
Target version:
% Done:
0%
Source:
Community (dev)
Tags:
Backport:
pacific,quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
fs, rados
Component(FS):
cephfs-top
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
mgr/stats can repeatedly report stale perf stats when MDS rank-0 becomes offline. Even after a standby daemon transitions to active rank-0, the metrics reported are still stale.
Right now, the workaround is to wait for the query to timeout: https://github.com/ceph/ceph/blob/master/src/pybind/mgr/stats/fs/perf_stats.py#L40
To fix this, reregister user queries when a new MDS rank-0 is seen.
Actions