Project

General

Profile

Actions

Bug #57219

closed

Bug #46609: mds: CDir.cc: 956: FAILED ceph_assert(auth_pins == 0)

qa: mds crashed while running workunit test fs/misc/dirfrag.sh

Added by Kotresh Hiremath Ravishankar over 1 year ago. Updated over 1 year ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
qa, qa-failure
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Seen in https://pulpito.ceph.com/yuriw-2022-08-19_21:01:11-fs-wip-yuri10-testing-2022-08-18-1400-pacific-distro-default-smithi/6981268

MDS Traceback:

2022-08-19T22:36:51.108 INFO:tasks.cephfs.filesystem:scrub status for tag:81cf046d-236c-4c24-9979-02658b99c282 - {'path': '/', 'tag': '81cf046d-236c-4c24-9979-02658b99c282', 'options': 'recursive,force'}
2022-08-19T22:37:09.226 INFO:tasks.ceph.mds.g.smithi071.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.10-687-g5cd353a5/rpm/el8/BUILD/ceph-16.2.10-687-g5cd353a5/src/mds/CDir.cc: In function 'void CDir::finish_old_fragment(MDSContext::vec&, bool)' thread 7f8a1f9f6700 time 2022-08-19T22:37:09.222972+0000
2022-08-19T22:37:09.226 INFO:tasks.ceph.mds.g.smithi071.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.10-687-g5cd353a5/rpm/el8/BUILD/ceph-16.2.10-687-g5cd353a5/src/mds/CDir.cc: 965: FAILED ceph_assert(auth_pins == 0)
2022-08-19T22:37:09.247 INFO:tasks.ceph.mds.g.smithi071.stderr: ceph version 16.2.10-687-g5cd353a5 (5cd353a5faa5cc051c0002bf5cca2445d5285a85) pacific (stable)
2022-08-19T22:37:09.248 INFO:tasks.ceph.mds.g.smithi071.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x158) [0x7f8a2aa77c68]
2022-08-19T22:37:09.248 INFO:tasks.ceph.mds.g.smithi071.stderr: 2: /usr/lib64/ceph/libceph-common.so.2(+0x277e82) [0x7f8a2aa77e82]
2022-08-19T22:37:09.248 INFO:tasks.ceph.mds.g.smithi071.stderr: 3: (CDir::finish_old_fragment(std::vector<MDSContext*, std::allocator<MDSContext*> >&, bool)+0x1da) [0x55ac9e028f0a]
2022-08-19T22:37:09.249 INFO:tasks.ceph.mds.g.smithi071.stderr: 4: (CDir::split(int, std::vector<CDir*, std::allocator<CDir*> >*, std::vector<MDSContext*, std::allocator<MDSContext*> >&, bool)+0x1d37) [0x55ac9e02ac67]
2022-08-19T22:37:09.249 INFO:tasks.ceph.mds.g.smithi071.stderr: 5: (MDCache::adjust_dir_fragments(CInode*, std::vector<CDir*, std::allocator<CDir*> > const&, frag_t, int, std::vector<CDir*, std::allocator<CDir*> >*, std::vector<MDSContext*, std::allocator<MDSContext*> >&, bool)+0x278) [0x55ac9df066a8]
2022-08-19T22:37:09.250 INFO:tasks.ceph.mds.g.smithi071.stderr: 6: (MDCache::dispatch_fragment_dir(boost::intrusive_ptr<MDRequestImpl>&)+0x8df) [0x55ac9df0d67f]
2022-08-19T22:37:09.250 INFO:tasks.ceph.mds.g.smithi071.stderr: 7: (MDCache::dispatch_request(boost::intrusive_ptr<MDRequestImpl>&)+0xa5) [0x55ac9df1b0f5]
2022-08-19T22:37:09.251 INFO:tasks.ceph.mds.g.smithi071.stderr: 8: (MDSContext::complete(int)+0x203) [0x55ac9e0d7643]
2022-08-19T22:37:09.251 INFO:tasks.ceph.mds.g.smithi071.stderr: 9: (MDSCacheObject::finish_waiting(unsigned long, int)+0xce) [0x55ac9e0f949e]
2022-08-19T22:37:09.251 INFO:tasks.ceph.mds.g.smithi071.stderr: 10: (Locker::eval_gather(SimpleLock*, bool, bool*, std::vector<MDSContext*, std::allocator<MDSContext*> >*)+0x13d6) [0x55ac9dfac176]
2022-08-19T22:37:09.251 INFO:tasks.ceph.mds.g.smithi071.stderr: 11: (Locker::handle_file_lock(ScatterLock*, boost::intrusive_ptr<MLock const> const&)+0xed1) [0x55ac9dfba211]
2022-08-19T22:37:09.252 INFO:tasks.ceph.mds.g.smithi071.stderr: 12: (Locker::handle_lock(boost::intrusive_ptr<MLock const> const&)+0x1b3) [0x55ac9dfbad83]
2022-08-19T22:37:09.252 INFO:tasks.ceph.mds.g.smithi071.stderr: 13: (Locker::dispatch(boost::intrusive_ptr<Message const> const&)+0xb4) [0x55ac9dfbefb4]
2022-08-19T22:37:09.253 INFO:tasks.ceph.mds.g.smithi071.stderr: 14: (MDSRank::handle_message(boost::intrusive_ptr<Message const> const&)+0xbcc) [0x55ac9ddcddac]
2022-08-19T22:37:09.253 INFO:tasks.ceph.mds.g.smithi071.stderr: 15: (MDSRank::_dispatch(boost::intrusive_ptr<Message const> const&, bool)+0x7bb) [0x55ac9ddd074b]
2022-08-19T22:37:09.253 INFO:tasks.ceph.mds.g.smithi071.stderr: 16: (MDSRankDispatcher::ms_dispatch(boost::intrusive_ptr<Message const> const&)+0x55) [0x55ac9ddd0d45]
2022-08-19T22:37:09.254 INFO:tasks.ceph.mds.g.smithi071.stderr: 17: (MDSDaemon::ms_dispatch2(boost::intrusive_ptr<Message> const&)+0x108) [0x55ac9ddc0938]
2022-08-19T22:37:09.254 INFO:tasks.ceph.mds.g.smithi071.stderr: 18: (DispatchQueue::entry()+0x126a) [0x7f8a2acbdf6a]
2022-08-19T22:37:09.255 INFO:tasks.ceph.mds.g.smithi071.stderr: 19: (DispatchQueue::DispatchThread::entry()+0x11) [0x7f8a2ad70581]
2022-08-19T22:37:09.255 INFO:tasks.ceph.mds.g.smithi071.stderr: 20: /lib64/libpthread.so.0(+0x81ca) [0x7f8a292081ca]
2022-08-19T22:37:09.255 INFO:tasks.ceph.mds.g.smithi071.stderr: 21: clone()

PR included in this batch:

https://github.com/ceph/ceph/pull/46949 - pacific: ceph-fuse: add dedicated snap stag map for each directory
https://github.com/ceph/ceph/pull/47056 - pacific: mds: notify the xattr_version to replica MDSes
https://github.com/ceph/ceph/pull/47059 - pacific: mds: switch to use projected inode instead
https://github.com/ceph/ceph/pull/47180 - pacific: mds: fix crash when exporting unlinked dir
https://github.com/ceph/ceph/pull/47433 - pacific: mds: skip fetching the dirfrags if not a directory
Actions #1

Updated by Xiubo Li over 1 year ago

  • Status changed from New to Duplicate
  • Parent task set to #46609

This should be a know issue, please see https://tracker.ceph.com/issues/46609.

Actions

Also available in: Atom PDF