Project

General

Profile

Actions

Feature #40633

closed

mds: dump recent log events for extraordinary events

Added by Patrick Donnelly almost 5 years ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Administration/Usability
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Reviewed:
Affected Versions:
Component(FS):
MDS
Labels (FS):
Pull request ID:

Description

When major events happen like client eviction, we often want to get an idea what went wrong but production clusters usually run with low debugging (1/5) so it's difficult to diagnose what happened.

Make a config-enabled feature to dump recent events (see also MDSDaemon::respawn) during these events. Some interesting ones to start with:

- evicts a client.
- is in a recovery state for >60seconds.
- misses beacon ACKs from monitors
- misses internal heartbeats

The config option should be a float indicating the minimum time between these major events where the MDS will dump the in-memory logs. The default should probably be 60 seconds to limit log file bloat in a production environment. A <=0.0 value disables the feature.

Actions #1

Updated by Patrick Donnelly over 4 years ago

  • Target version deleted (v15.0.0)
Actions #2

Updated by Jos Collin over 2 years ago

  • Status changed from New to In Progress
Actions #3

Updated by Jos Collin over 2 years ago

  • Status changed from In Progress to Fix Under Review
  • Pull request ID set to 44710
Actions #4

Updated by Venky Shankar over 1 year ago

  • Status changed from Fix Under Review to Resolved
Actions #5

Updated by Jos Collin over 1 year ago

  • Backport deleted (nautilus,mimic,luminous)
Actions

Also available in: Atom PDF