Project

General

Profile

Actions

Bug #4038

closed

ceph-fuse: various hangs

Added by Sage Weil about 11 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Immediate
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

#0  pthread_cond_wait@@GLIBC_2.3.2 () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:162
#1  0x0000000000740d56 in Cond::Wait (this=0x7fff6450d9e0, mutex=...) at ./common/Cond.h:55
#2  0x000000000072c561 in Client::_fsync (this=0x19c8800, f=0x19b31e0, syncdataonly=false) at client/Client.cc:5911
---Type <return> to continue, or q <return> to quit---
#3  0x000000000073c1f0 in Client::ll_fsync (this=0x19c8800, fh=0x19b31e0, syncdataonly=false) at client/Client.cc:7404
#4  0x00000000006ee612 in fuse_ll_fsync (req=0x19bf000, ino=1099511627788, datasync=0, fi=0x7fff6450dc20) at client/fuse_ll.cc:384
#5  0x00007fef8549bbc5 in ?? () from /lib/libfuse.so.2
#6  0x00007fef85499d65 in fuse_session_loop () from /lib/libfuse.so.2
#7  0x00000000006ef74c in CephFuse::Handle::loop (this=0x19d3000) at client/fuse_ll.cc:680
#8  0x00000000006efb2b in CephFuse::loop (this=0x7fff6450dda0) at client/fuse_ll.cc:733
#9  0x00000000006ec1ab in main (argc=7, argv=0x7fff6450e4c8, envp=0x19b1000) at ceph_fuse.cc:163
ubuntu@teuthology:/a/sage-2013-02-06_10:45:22-regression-master-testing-basic/528$ 
Thread 1 (Thread 0x7f9fe6032780 (LWP 24564)):
#0  pthread_cond_wait@@GLIBC_2.3.2 () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:162
#1  0x0000000000740d56 in Cond::Wait (this=0x7fffb6dc6bd0, mutex=...) at ./common/Cond.h:55
#2  0x00000000006fff10 in Client::make_request (this=0x16fa800, request=0x1bc7000, uid=1000, gid=1000, ptarget=0x0, use_mds=-1, pdirbl=0x0) at client/Client.cc:1228
#3  0x000000000071e82a in Client::_setattr (this=0x16fa800, in=0x1777900, attr=0x7fffb6dc7150, mask=32, uid=1000, gid=1000) at client/Client.cc:4294
#4  0x0000000000730e4c in Client::ll_setattr (this=0x16fa800, vino=..., attr=0x7fffb6dc7150, mask=32, uid=1000, gid=1000) at client/Client.cc:6374
#5  0x00000000006ed0dd in fuse_ll_setattr (req=0x19f1500, ino=1099511627780, attr=0x7fffb6dc7150, to_set=8, fi=0x7fffb6dc71e0) at client/fuse_ll.cc:143
#6  0x00007f9fe5c01074 in ?? () from /lib/libfuse.so.2
#7  0x00007f9fe5bfed65 in fuse_session_loop () from /lib/libfuse.so.2
#8  0x00000000006ef74c in CephFuse::Handle::loop (this=0x1705000) at client/fuse_ll.cc:680
#9  0x00000000006efb2b in CephFuse::loop (this=0x7fffb6dc7360) at client/fuse_ll.cc:733
#10 0x00000000006ec1ab in main (argc=7, argv=0x7fffb6dc7a88, envp=0x16e3000) at ceph_fuse.cc:163
(gdb) q
A debugging session is active.

        Inferior 1 [process 24564] will be detached.

Quit anyway? (y or n) y
Detaching from program: /tmp/cephtest/binary/usr/local/bin/ceph-fuse, process 24564
ubuntu@plana45:/tmp/cephtest$ logout
Connection to plana45 closed.
ubuntu@teuthology:/a/sage-2013-02-06_10:45:22-regression-master-testing-basic/529$ cd ..
Thread 1 (Thread 0x7fb7623eb780 (LWP 31147)):
#0  pthread_cond_wait@@GLIBC_2.3.2 () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:162
#1  0x0000000000740d56 in Cond::Wait (this=0x7fff2fc38560, mutex=...) at ./common/Cond.h:55
#2  0x0000000000729b64 in Client::_read_async (this=0x2445800, f=0x2430550, off=1061158912, len=131072, bl=0x7fff2fc38960) at client/Client.cc:5569
#3  0x0000000000728aef in Client::_read (this=0x2445800, f=0x2430550, offset=1061158912, size=131072, bl=0x7fff2fc38960) at client/Client.cc:5455
#4  0x000000000073b77f in Client::ll_read (this=0x2445800, fh=0x2430550, off=1061158912, len=131072, bl=0x7fff2fc38960) at client/Client.cc:7370
#5  0x00000000006ee41b in fuse_ll_read (req=0x243c000, ino=1099511627791, size=131072, off=1061158912, fi=0x7fff2fc389f0) at client/fuse_ll.cc:346
---Type <return> to continue, or q <return> to quit---
#6  0x00007fb761fb9d5e in ?? () from /lib/libfuse.so.2
#7  0x00007fb761fb7d65 in fuse_session_loop () from /lib/libfuse.so.2
#8  0x00000000006ef74c in CephFuse::Handle::loop (this=0x2450000) at client/fuse_ll.cc:680
#9  0x00000000006efb2b in CephFuse::loop (this=0x7fff2fc38b70) at client/fuse_ll.cc:733
#10 0x00000000006ec1ab in main (argc=7, argv=0x7fff2fc39298, envp=0x242e000) at ceph_fuse.cc:163
(gdb) qy
Undefined command: "qy".  Try "help".
(gdb) q
A debugging session is active.

        Inferior 1 [process 31147] will be detached.

Quit anyway? (y or n) y
Detaching from program: /tmp/cephtest/binary/usr/local/bin/ceph-fuse, process 31147
ubuntu@plana53:~$ logout
Connection to plana53 closed.
ubuntu@teuthology:/a/sage-2013-02-06_10:45:22-regression-master-testing-basic/531$ cd ..
Actions #1

Updated by Greg Farnum about 11 years ago

  • Status changed from New to 7
  • Assignee set to Sage Weil

There's a shortcut return in CInode::_flush() that wasn't setting the new completion to done (when called from _fsync()). Sage is testing the one-liner fix.

Actions #2

Updated by Greg Farnum about 11 years ago

  • Status changed from 7 to Resolved

He says it fixed the problem, and it's in master now. (commit: 46d7dbd3472f26926c6d048bfc3c150074bfd283)

Actions #3

Updated by John Spray over 7 years ago

  • Project changed from Ceph to CephFS
  • Category deleted (11)

Bulk updating project=ceph category=ceph-fuse issues to move to fs project so that we can remove the ceph-fuse category from the ceph project

Actions

Also available in: Atom PDF