mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doing
This should incrementally increase to 20 as the timeout reaches mds_beacon_grace.
#3 Updated by Stefan Kooman 2 months ago
We had "debug_mds=20" when the MDS suddenly started logging "heartbeat_map is_healthy 'MDSRank' had timed out after 15", "mds.beacon.mds2 _send skipping beacon, heartbeat map not healthy". So I'm not sure if just increasing debug level would help enough to catch the actual cause here. See: https://www.spinics.net/lists/ceph-users/msg48403.html