Project

General

Profile

Feature #18851

Ability to add comments in certain views of Ceph daemons or status

Added by Brian Andrus about 3 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

It would be nice to maintain a record of why an OSD is down, or why a flag has been set within the Ceph Cluster so that any operator can differentiate at any time what might need immediate attention vs. longer-term maintenance items such as why an OSD is down.

For example, a "ceph comment" command that can be easily searched by daemon/status

Use-case #1:
Cluster has 10 OSDs down over a period of many months. As a cluster operator, it would be nice for me or the datacenter technicians to be able to know why a particular OSD is down. If the OSD has not been commented on, it should be researched as to why it is down.

Use-case #2: If I request my datacenter technicians to replace bad hard drives, it would be nice to search for specific terms or states (replaceme) while doing a monthly hardware sweep.

Use-case #3:
Cluster is in no-out, and I would like to know why and/or who to blame.

This would be good information to have in dashboards as well as accessible by anyone who has necessary keys. The comment author could automatically be populated by ceph key, or could be manually entered if shared keys are used.

Also available in: Atom PDF