Project

General

Profile

Actions

Feature #24854

open

mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doing

Added by Patrick Donnelly almost 6 years ago. Updated about 5 years ago.

Status:
New
Priority:
Urgent
Assignee:
-
Category:
Introspection/Control
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
mimic,luminous
Reviewed:
Affected Versions:
Component(FS):
MDS
Labels (FS):
Pull request ID:

Description

This should incrementally increase to 20 as the timeout reaches mds_beacon_grace.

Actions #1

Updated by Patrick Donnelly almost 6 years ago

  • Status changed from New to In Progress
Actions #2

Updated by Patrick Donnelly over 5 years ago

  • Tracker changed from Bug to Feature
  • Status changed from In Progress to New
  • Assignee deleted (Patrick Donnelly)
Actions #3

Updated by Stefan Kooman over 5 years ago

We had "debug_mds=20" when the MDS suddenly started logging "heartbeat_map is_healthy 'MDSRank' had timed out after 15", "mds.beacon.mds2 _send skipping beacon, heartbeat map not healthy". So I'm not sure if just increasing debug level would help enough to catch the actual cause here. See: https://www.spinics.net/lists/ceph-users/msg48403.html

Actions #4

Updated by Patrick Donnelly about 5 years ago

  • Target version changed from v14.0.0 to v15.0.0
Actions #5

Updated by Patrick Donnelly about 5 years ago

  • Target version deleted (v15.0.0)
Actions

Also available in: Atom PDF