Project

General

Profile

Actions

Bug #8291

closed

0.80: fuse client hangs after wake-up from suspend until restart of active MDS

Added by Dmitry Smirnov almost 10 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

When computer with CephFS mounted using ceph-fuse wakes-up from suspend the /mnt/ceph mount is not responding (i.e. `df -h /mnt/ceph` never comes back etc.).
To remedy situation I restarted active MDS and connectivity restored as soon as another MDS took over.
Here is what I found in MDS log:

2014-05-06 16:56:09.301733 7f2602a3d700  0 -- 192.168.0.204:6800/4504 >> 192.168.0.7:6801/11147 pipe(0x7f268320a780 sd=30 :0 s=1 pgs=0 cs=0 l=1 c=0x7f261088fde0).fault
2014-05-06 17:01:15.820747 7f2603e43700  0 log [INF] : closing stale session client.2311588 192.168.0.7:0/9222 after 300.686328
2014-05-06 17:53:42.524279 7f2602c3f700  0 auth: could not find secret_id=2792
2014-05-06 17:53:42.524287 7f2602c3f700  0 cephx: verify_authorizer could not get service secret for service mds secret_id=2792
2014-05-06 17:53:42.524292 7f2602c3f700  0 -- 192.168.0.204:6800/4504 >> 192.168.0.7:0/9222 pipe(0x7f2680d7cc80 sd=28 :6800 s=0 pgs=0 cs=0 l=0 c=0x7f260ed53600).accept: got bad authorizer
2014-05-06 17:53:42.739958 7f2602c3f700  0 -- 192.168.0.204:6800/4504 >> 192.168.0.7:0/9222 pipe(0x7f268299fe00 sd=28 :6800 s=0 pgs=0 cs=0 l=0 c=0x7f268588b1e0).accept we reset (peer sent cseq 2), sending RESETSESSION
2014-05-06 18:16:05.409976 7f2603642700 -1 mds.0.166 *** got signal Terminated ***
2014-05-06 18:16:05.410014 7f2603642700  1 mds.0.166 suicide.  wanted down:dne, now up:active
2014-05-06 18:16:06.838411 7f044fadf7c0  0 ceph version 0.80-rc1 (6769f4dc88425396921f94e1a37a1c90758aa3ea), process ceph-mds, pid 28665
2014-05-06 18:16:07.315467 7f044a72d700  1 mds.-1.0 handle_mds_map standby

Files

sleep-recover.patch (445 Bytes) sleep-recover.patch Zheng Yan, 05/12/2014 07:42 PM
Actions

Also available in: Atom PDF