Feature #52558
mgr/dashboard: display cephadmin config checks
Status:
Need More Info
Priority:
Normal
Assignee:
Category:
Component - Cluster
Target version:
% Done:
0%
Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:
Description
Description¶
cephadm now provides a way to detect Ceph configuration issues:
# ceph cephadm config-check ls NAME HEALTHCHECK STATUS DESCRIPTION kernel_security CEPHADM_CHECK_KERNEL_LSM enabled checks SELINUX/Apparmor profiles are consistent across cluster hosts os_subscription CEPHADM_CHECK_SUBSCRIPTION disabled checks subscription states are consistent for all cluster hosts public_network CEPHADM_CHECK_PUBLIC_MEMBERSHIP enabled check that all hosts have a NIC on the Ceph public_netork osd_mtu_size CEPHADM_CHECK_MTU enabled check that OSD hosts share a common MTU setting osd_linkspeed CEPHADM_CHECK_LINKSPEED enabled check that OSD hosts share a common linkspeed network_missing CEPHADM_CHECK_NETWORK_MISSING enabled checks that the cluster/public networks defined exist on the Ceph hosts ceph_release CEPHADM_CHECK_CEPH_RELEASE enabled check for Ceph version consistency - ceph daemons should be on the same release (unless upgrade is active) kernel_version CEPHADM_CHECK_KERNEL_VERSION disabled checks that the MAJ.MIN of the kernel on Ceph hosts is consistent
While, when enabled, these will be visiable as a HEALTH_WARN, it might be interesting to manage/display them form the UI.
It looks like it also checks for MTU setting, so we may want to simply remove the MTU alert? Or expose the Ceph health checks as alerts (is it possible)?
Target persona¶
Sys admin/operator.
Context¶
Usually during install or post-install, but also as a Day-2 misconfiuration issue.
Design¶
- To check that the alerts can be managed from cluster -> Mgr modules -> Cephadm?
- Expose them also as Alertmanager alerts (so previously as Prometheus metrics)?
History
#1 Updated by Ernesto Puerta over 2 years ago
- Status changed from New to Need More Info
- Assignee set to Aashish Sharma