Project

General

Profile

Actions

Bug #54611

closed

prometheus metrics shows incorrect ceph version for upgraded ceph daemon

Added by Prashant D about 2 years ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Administration/Usability
Target version:
-
% Done:

100%

Source:
Tags:
Backport:
pacific,quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Manager (RADOS bits)
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

If cluster is partially upgraded and on a particular host :

In baremetal environment, all daemons on host shows same version even though some daemons are restart and some are yet to be restarted.
In containerized environment, all daemons on host inherits last service daemon ceph version as host version.

Assume if ceph mon/mgr container is upgraded on host which is co-hosting osd, for all daemons the ceph version in prometheus metrics is reportedly same whereas ideally it should be depends on ceph version on which daemon is running.

For example (in containerized env), if we have upgraded mgr on testnode046 from ceph version 16.2.7-82.el8cp to ceph version 16.2.7-88.el8cp
the prometheus metrics reports below metadata information for mon and mgr

# TYPE ceph_mon_metadata untyped
...
ceph_mon_metadata{ceph_daemon="mon.testnode046",hostname="testnode046",public_addr="10.0.0.46",rank="1",ceph_version="ceph version 16.2.7-82.el8cp (f62bcd07da5ce2cb1e10c300819e14ccf8f2104b) pacific (stable)"} 1.0
..
# HELP ceph_mgr_metadata MGR metadata
# TYPE ceph_mgr_metadata gauge
ceph_mgr_metadata{ceph_daemon="mgr.testnode046",hostname="testnode046",ceph_version="ceph version 16.2.7-82.el8cp (f62bcd07da5ce2cb1e10c300819e14ccf8f2104b) pacific (stable)"} 1.0

which should be :

# TYPE ceph_mon_metadata untyped
...
ceph_mon_metadata{ceph_daemon="mon.testnode046",hostname="testnode046",public_addr="10.0.0.46",rank="1",ceph_version="ceph version 16.2.7-82.el8cp (f62bcd07da5ce2cb1e10c300819e14ccf8f2104b) pacific (stable)"} 1.0
..
# HELP ceph_mgr_metadata MGR metadata
# TYPE ceph_mgr_metadata gauge
ceph_mgr_metadata{ceph_daemon="mgr.testnode046",hostname="testnode046",ceph_version="ceph version 16.2.7-88.el8cp (40253637381fbc86b3bffdaa2dddbedda58aed44) pacific (stable)"} 1.0

This applies to OSD daemons as well.


Related issues 2 (0 open2 closed)

Copied to RADOS - Backport #55306: quincy: prometheus metrics shows incorrect ceph version for upgraded ceph daemonResolvedAdam KingActions
Copied to RADOS - Backport #55309: pacific: prometheus metrics shows incorrect ceph version for upgraded ceph daemonResolvedPrashant DActions
Actions #1

Updated by Prashant D about 2 years ago

  • Description updated (diff)
Actions #2

Updated by Prashant D about 2 years ago

  • Status changed from New to Fix Under Review
  • Pull request ID set to 45505
Actions #4

Updated by Prashant D about 2 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #5

Updated by Prashant D about 2 years ago

  • Backport set to quincy
Actions #6

Updated by Backport Bot about 2 years ago

  • Copied to Backport #55306: quincy: prometheus metrics shows incorrect ceph version for upgraded ceph daemon added
Actions #7

Updated by Prashant D about 2 years ago

  • Backport changed from quincy to pacific,quincy
Actions #8

Updated by Backport Bot about 2 years ago

  • Copied to Backport #55309: pacific: prometheus metrics shows incorrect ceph version for upgraded ceph daemon added
Actions #9

Updated by Backport Bot over 1 year ago

  • Tags set to backport_processed
Actions #10

Updated by Konstantin Shalygin over 1 year ago

  • Status changed from Pending Backport to Resolved
  • % Done changed from 0 to 100
  • Tags deleted (backport_processed)
Actions

Also available in: Atom PDF