Project

General

Profile

Actions

Bug #48439

closed

fsstress failure with mds thrashing: "mds.0.6 Evicting (and blocklisting) client session 4564 (v1:172.21.15.47:0/603539598)"

Added by Patrick Donnelly over 3 years ago. Updated over 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
% Done:

0%

Source:
Q/A
Tags:
Backport:
pacific,octopus,nautilus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
kceph
Labels (FS):
qa-failure
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2020-12-02T12:04:39.361+0000 7f965bac6700  7 mds.0.server reconnect timed out, 1 clients have not reconnected in time
2020-12-02T12:04:39.361+0000 7f965bac6700  1 mds.0.server reconnect gives up on client.4564 v1:172.21.15.47:0/603539598
2020-12-02T12:04:39.361+0000 7f965bac6700  0 log_channel(cluster) log [WRN] : evicting unresponsive client smithi047: (4564), after waiting 46.0999 seconds during MDS startup

From: /ceph/teuthology-archive/pdonnell-2020-12-02_07:09:18-fs-wip-pdonnell-testing-20201202.050726-distro-basic-smithi/5674936/remote/smithi083/log/ceph-mds.b.log.gz

(and others from that run. stock RHEL 8.3 and testing kernels.)

relevant lines from kernel log:

2020-12-02T12:03:53.267177+00:00 smithi047 kernel: ceph: mds0 reconnect start
2020-12-02T12:03:53.293238+00:00 smithi047 kernel: libceph: mds0 (1)172.21.15.83:6835 socket error on write
2020-12-02T12:04:42.388134+00:00 smithi047 kernel: ceph: mds0 recovery completed

From: /ceph/teuthology-archive/pdonnell-2020-12-02_07:09:18-fs-wip-pdonnell-testing-20201202.050726-distro-basic-smithi/5674936/remote/smithi047/syslog/kern.log.gz


Related issues 1 (0 open1 closed)

Related to CephFS - Bug #47563: qa: kernel client closes session improperly causing eviction due to timeoutResolvedJeff Layton

Actions
Actions

Also available in: Atom PDF