Actions
Feature #40633
closedmds: dump recent log events for extraordinary events
% Done:
0%
Source:
Development
Tags:
Backport:
Reviewed:
Description
When major events happen like client eviction, we often want to get an idea what went wrong but production clusters usually run with low debugging (1/5) so it's difficult to diagnose what happened.
Make a config-enabled feature to dump recent events (see also MDSDaemon::respawn) during these events. Some interesting ones to start with:
- evicts a client.
- is in a recovery state for >60seconds.
- misses beacon ACKs from monitors
- misses internal heartbeats
The config option should be a float indicating the minimum time between these major events where the MDS will dump the in-memory logs. The default should probably be 60 seconds to limit log file bloat in a production environment. A <=0.0 value disables the feature.
Updated by Jos Collin over 2 years ago
- Status changed from In Progress to Fix Under Review
- Pull request ID set to 44710
Updated by Venky Shankar over 1 year ago
- Status changed from Fix Under Review to Resolved
Updated by Jos Collin over 1 year ago
- Backport deleted (
nautilus,mimic,luminous)
Actions