Project

General

Profile

Feature #52558

mgr/dashboard: display cephadmin config checks

Added by Ernesto Puerta over 2 years ago. Updated over 2 years ago.

Status:
Need More Info
Priority:
Normal
Category:
Component - Cluster
Target version:
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

Description

cephadm now provides a way to detect Ceph configuration issues:

# ceph cephadm config-check ls
NAME             HEALTHCHECK                      STATUS    DESCRIPTION
kernel_security  CEPHADM_CHECK_KERNEL_LSM         enabled   checks SELINUX/Apparmor profiles are consistent across cluster hosts
os_subscription  CEPHADM_CHECK_SUBSCRIPTION       disabled  checks subscription states are consistent for all cluster hosts
public_network   CEPHADM_CHECK_PUBLIC_MEMBERSHIP  enabled   check that all hosts have a NIC on the Ceph public_netork
osd_mtu_size     CEPHADM_CHECK_MTU                enabled   check that OSD hosts share a common MTU setting
osd_linkspeed    CEPHADM_CHECK_LINKSPEED          enabled   check that OSD hosts share a common linkspeed
network_missing  CEPHADM_CHECK_NETWORK_MISSING    enabled   checks that the cluster/public networks defined exist on the Ceph hosts
ceph_release     CEPHADM_CHECK_CEPH_RELEASE       enabled   check for Ceph version consistency - ceph daemons should be on the same
                                                            release (unless upgrade is active)
kernel_version   CEPHADM_CHECK_KERNEL_VERSION     disabled  checks that the MAJ.MIN of the kernel on Ceph hosts is consistent

While, when enabled, these will be visiable as a HEALTH_WARN, it might be interesting to manage/display them form the UI.

It looks like it also checks for MTU setting, so we may want to simply remove the MTU alert? Or expose the Ceph health checks as alerts (is it possible)?

Target persona

Sys admin/operator.

Context

Usually during install or post-install, but also as a Day-2 misconfiuration issue.

Design

- To check that the alerts can be managed from cluster -> Mgr modules -> Cephadm?

- Expose them also as Alertmanager alerts (so previously as Prometheus metrics)?

History

#1 Updated by Ernesto Puerta over 2 years ago

  • Status changed from New to Need More Info
  • Assignee set to Aashish Sharma

Also available in: Atom PDF