Actions
Bug #44590
closedprometheus metrics wrongly reports scrubbing pgs
Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
prometheus module
Target version:
-
% Done:
0%
Source:
Tags:
Backport:
mimic,nautilus,octopus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
After change of ceph_pg_* in ceph 14.2.8 metrics ceph_pg_active and ceph_pg_clean do not count groups that are scrubbing
So cluster with all active and clean groups but with 3 scrubs pending will have:
ceph_pg_total{pool_id="26"} 2048.0 ceph_pg_active{pool_id="26"} 2044.0 ceph_pg_clean{pool_id="26"} 2044.0 ceph_pg_scrubbing{pool_id="26"} 0.0
while:
ceph pg dump|grep "^26\."|awk '{ print $12; }' | sort|uniq -c 2044 active+clean 2 active+clean+scrubbing 2 active+clean+scrubbing+deep
Updated by Jacek S. about 4 years ago
Durign a debug I found that for each pool each kind of state is overwriting a previous, I'm preparing a MR
Updated by Jacek S. about 4 years ago
Related MR: https://github.com/ceph/ceph/pull/33967
Updated by Neha Ojha about 4 years ago
- Status changed from New to Fix Under Review
- Pull request ID set to 33967
Updated by Jonas Jelten about 4 years ago
This probably also causes my bug that when some pgs are degraded and active, they're not reported as active in prometheus.
Updated by Kefu Chai about 4 years ago
- Backport changed from mimic,nautilus to mimic,nautilus,octopus
Updated by Sage Weil about 4 years ago
- Status changed from Fix Under Review to Pending Backport
Updated by Konstantin Shalygin about 4 years ago
- Copied to Backport #44735: nautilus: prometheus metrics wrongly reports scrubbing pgs added
Updated by Konstantin Shalygin about 4 years ago
- Copied to Backport #44736: mimic: prometheus metrics wrongly reports scrubbing pgs added
Updated by Konstantin Shalygin about 4 years ago
- Copied to Backport #44737: octopus: prometheus metrics wrongly reports scrubbing pgs added
Updated by Nathan Cutler almost 4 years ago
- Status changed from Pending Backport to Resolved
Actions