Project

General

Profile

Actions

Bug #50033

closed

mgr/stats: be resilient to offline MDS rank-0

Added by Venky Shankar about 3 years ago. Updated almost 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Administration/Usability
Target version:
% Done:

0%

Source:
Community (dev)
Tags:
Backport:
pacific,quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
fs, rados
Component(FS):
cephfs-top
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

mgr/stats can repeatedly report stale perf stats when MDS rank-0 becomes offline. Even after a standby daemon transitions to active rank-0, the metrics reported are still stale.

Right now, the workaround is to wait for the query to timeout: https://github.com/ceph/ceph/blob/master/src/pybind/mgr/stats/fs/perf_stats.py#L40

To fix this, reregister user queries when a new MDS rank-0 is seen.


Related issues 2 (0 open2 closed)

Copied to CephFS - Backport #54479: pacific: mgr/stats: be resilient to offline MDS rank-0ResolvedJos CollinActions
Copied to CephFS - Backport #54480: quincy: mgr/stats: be resilient to offline MDS rank-0ResolvedJos CollinActions
Actions #1

Updated by Venky Shankar about 3 years ago

  • Target version set to v17.0.0
Actions #2

Updated by Patrick Donnelly about 3 years ago

  • Subject changed from mge/stats: be resilient to offline MDS rank-0 to mgr/stats: be resilient to offline MDS rank-0
  • Status changed from New to Triaged
  • Assignee set to Jos Collin
Actions #3

Updated by Jos Collin almost 3 years ago

  • Status changed from Triaged to Fix Under Review
  • Pull request ID set to 42098
Actions #4

Updated by Jos Collin over 2 years ago

  • ceph-qa-suite fs, rados added
Actions #5

Updated by Venky Shankar about 2 years ago

  • Status changed from Fix Under Review to Pending Backport
  • Backport changed from pacific to pacific,quincy
Actions #6

Updated by Backport Bot about 2 years ago

  • Copied to Backport #54479: pacific: mgr/stats: be resilient to offline MDS rank-0 added
Actions #7

Updated by Backport Bot about 2 years ago

  • Copied to Backport #54480: quincy: mgr/stats: be resilient to offline MDS rank-0 added
Actions #8

Updated by Jos Collin almost 2 years ago

  • Status changed from Pending Backport to Resolved
Actions

Also available in: Atom PDF