Bug #48439
Updated by Patrick Donnelly over 3 years ago
<pre> 2020-12-02T12:04:39.361+0000 7f965bac6700 7 mds.0.server reconnect timed out, 1 clients have not reconnected in time 2020-12-02T12:04:39.361+0000 7f965bac6700 1 mds.0.server reconnect gives up on client.4564 v1:172.21.15.47:0/603539598 2020-12-02T12:04:39.361+0000 7f965bac6700 0 log_channel(cluster) log [WRN] : evicting unresponsive client smithi047: (4564), after waiting 46.0999 seconds during MDS startup </pre> From: /ceph/teuthology-archive/pdonnell-2020-12-02_07:09:18-fs-wip-pdonnell-testing-20201202.050726-distro-basic-smithi/5674936/remote/smithi083/log/ceph-mds.b.log.gz (and others from that run. stock RHEL 8.3 -and testing- and testing kernels.) relevant lines from kernel log: <pre> 2020-12-02T12:03:53.267177+00:00 smithi047 kernel: ceph: mds0 reconnect start 2020-12-02T12:03:53.293238+00:00 smithi047 kernel: libceph: mds0 (1)172.21.15.83:6835 socket error on write 2020-12-02T12:04:42.388134+00:00 smithi047 kernel: ceph: mds0 recovery completed </pre> From: /ceph/teuthology-archive/pdonnell-2020-12-02_07:09:18-fs-wip-pdonnell-testing-20201202.050726-distro-basic-smithi/5674936/remote/smithi047/syslog/kern.log.gz