Actions
Bug #62639
openmgr/cephadm is not raising alerts when non-ceph daemons are down.
Status:
New
Priority:
Normal
Assignee:
-
Category:
cephadm
Target version:
-
% Done:
0%
Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
The ceph cluster deploys ceph and non-ceph daemons.
When a ceph daemon fails, ceph itself raises a healthcheck which is propagated through the prometheus alert rules to alert manager to the 'outside world'. However, when a non-ceph daemon fails (haproxy, keepalived, ganesha etc) it can go unnoticed at both the ceph CLI and prometheus/alertmanager.
Ideally the failure of any daemon controlled by cephadm should be flagged to the CLI as a healthcheck and therefore Prometheus/AlertManager
No data to display
Actions