Project

General

Profile

Actions

Bug #3663

closed

ceph kernel client is getting stuck on xstat* operations

Added by Roman Hlynovskiy over 11 years ago. Updated almost 8 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
-
Category:
-
Target version:
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

there are 2 kernel clients happily working with ceph. as soon as I try mounting ceph from the third client, it's getting stuck on stat* operations (observed from strace).
2 working clients are also getting affected by this broken client until it's completely killed. sometimes those 2 working clients should be completely rebooted

log from mds is collected with the following debug config:

[mds]
debug mds = 20
debug ms = 1

client kernel is 3.2.0-0.bpo.3-686-pae
ceph release is 0.55 from debian-testing repo

I might be wrong but according to the log before hitting the bug, mds is complaining that there are some laggy sessions exist for the client which will stuck, so maybe it's trying to assign new session from the client to the old laggy ones from the same client which creates kind of race condition?

log in attach. ready to collect more logs and evidences.


Files

mds-a.debug.log.bz2 (525 KB) mds-a.debug.log.bz2 Roman Hlynovskiy, 12/20/2012 08:48 PM
ceph_20121221_01_logs.tar.gz (27.2 KB) ceph_20121221_01_logs.tar.gz Roman Hlynovskiy, 12/20/2012 10:19 PM
Actions

Also available in: Atom PDF