Project

General

Profile

Actions

Bug #18757

closed

Jewel ceph-fuse does not recover after lost connection to MDS

Added by Henrik Korkuc about 7 years ago. Updated almost 7 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
jewel, kraken
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
ceph-fuse
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

After ceph-fuse loses connection to MDS for few minutes, it does not recover - accessing mountpoint hangs processes.
Replicated using 10.2.3 and 10.2.5 clients. Didn't try on Kraken.

Example setup:
periodically poke mount:
while [ 1 ]; do date; ls -l cephfs/test ; sleep 10; done

then firewall connection to MDS:
iptables -A INPUT -p tcp --dport 6816 -j DROP

after pokes stop wait few minutes (5?). If we wait for too short time, it recovers, if long enough it hangs. Attaching client logs with enabled debug options:
debug_mds = 20/20
debug_mds_balancer = 20/20
debug_mds_log = 20/20
debug_objecter = 20/20
debug_rados = 20/20
debug_client = 20/20
debug_ms = 20/20
debug_fuse = 20/20

and a thread backtrace.


Files

gdb.txt.gz (2.96 KB) gdb.txt.gz Henrik Korkuc, 02/01/2017 12:23 PM
ceph-client.cephfs.log.gz (27.6 KB) ceph-client.cephfs.log.gz Henrik Korkuc, 02/01/2017 12:24 PM
ceph-client.cephfs.log (586 KB) ceph-client.cephfs.log Henrik Korkuc, 02/02/2017 08:24 AM
ceph-mds.henrik-eu2.log.gz (97 KB) ceph-mds.henrik-eu2.log.gz Henrik Korkuc, 02/02/2017 08:25 AM

Related issues 2 (0 open2 closed)

Copied to CephFS - Backport #19677: jewel: Jewel ceph-fuse does not recover after lost connection to MDSResolvedNathan CutlerActions
Copied to CephFS - Backport #19678: kraken: Jewel ceph-fuse does not recover after lost connection to MDSResolvedNathan CutlerActions
Actions

Also available in: Atom PDF