https://tracker.ceph.com/https://tracker.ceph.com/favicon.ico2018-07-17T01:47:40ZCeph CephFS - Feature #24854: mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doinghttps://tracker.ceph.com/issues/24854?journal_id=1170602018-07-17T01:47:40ZPatrick Donnellypdonnell@redhat.com
<ul><li><strong>Status</strong> changed from <i>New</i> to <i>In Progress</i></li></ul> CephFS - Feature #24854: mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doinghttps://tracker.ceph.com/issues/24854?journal_id=1192242018-08-21T21:06:12ZPatrick Donnellypdonnell@redhat.com
<ul><li><strong>Tracker</strong> changed from <i>Bug</i> to <i>Feature</i></li><li><strong>Status</strong> changed from <i>In Progress</i> to <i>New</i></li><li><strong>Assignee</strong> deleted (<del><i>Patrick Donnelly</i></del>)</li></ul> CephFS - Feature #24854: mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doinghttps://tracker.ceph.com/issues/24854?journal_id=1222762018-10-08T19:55:07ZStefan Koomanceph@kooman.org
<ul></ul><p>We had "debug_mds=20" when the MDS suddenly started logging "heartbeat_map is_healthy 'MDSRank' had timed out after 15", "mds.beacon.mds2 _send skipping beacon, heartbeat map not healthy". So I'm not sure if just increasing debug level would help enough to catch the actual cause here. See: <a class="external" href="https://www.spinics.net/lists/ceph-users/msg48403.html">https://www.spinics.net/lists/ceph-users/msg48403.html</a></p> CephFS - Feature #24854: mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doinghttps://tracker.ceph.com/issues/24854?journal_id=1309932019-03-07T23:21:47ZPatrick Donnellypdonnell@redhat.com
<ul><li><strong>Target version</strong> changed from <i>v14.0.0</i> to <i>v15.0.0</i></li></ul> CephFS - Feature #24854: mds: if MDS fails internal heartbeat, then debugging should be increased to diagnose what it's stuck doinghttps://tracker.ceph.com/issues/24854?journal_id=1310822019-03-07T23:32:27ZPatrick Donnellypdonnell@redhat.com
<ul><li><strong>Target version</strong> deleted (<del><i>v15.0.0</i></del>)</li></ul>