Project

General

Profile

Actions

Bug #62639

open

mgr/cephadm is not raising alerts when non-ceph daemons are down.

Added by Paul Cuzner 8 months ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
cephadm
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

The ceph cluster deploys ceph and non-ceph daemons.

When a ceph daemon fails, ceph itself raises a healthcheck which is propagated through the prometheus alert rules to alert manager to the 'outside world'. However, when a non-ceph daemon fails (haproxy, keepalived, ganesha etc) it can go unnoticed at both the ceph CLI and prometheus/alertmanager.

Ideally the failure of any daemon controlled by cephadm should be flagged to the CLI as a healthcheck and therefore Prometheus/AlertManager

No data to display

Actions

Also available in: Atom PDF