Project

General

Profile

Actions

Bug #37853

closed

remove snapped dir success but core dump by bad backtrace in _purge_stray_purged

Added by Min Chen over 5 years ago. Updated about 5 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Community (dev)
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Client, Ganesha FSAL
Labels (FS):
crash, snapshots
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2019-01-08 10:04:39.651547 7fc3d5e9a700 -1 log_channel(cluster) log [ERR] : bad backtrace on directory inode 0x100000065db
2019-01-08 10:04:40.647794 7fc3d6e9c700 -1 mds.0.cache.strays Rogue reference after purge to [dentry #0x100/stray4/100000065db [660,head] auth (dversion lock) v=4463 inode=0x56460d5af800 state=1879048229|new | request=0 lock=0 inodepin=1 purging=1 dirty=1 authpin=0 0x56460ad8ef00]
2019-01-08 10:04:40.649984 7fc3d6e9c700 -1 /root/rpmbuild/BUILD/ceph-12.2.7-515-g5c00ca1/src/mds/StrayManager.cc: In function 'void StrayManager::_purge_stray_purged(CDentry*, bool)' thread 7fc3d6e9c700 time 2019-01-08 10:04:40.647818
/root/rpmbuild/BUILD/ceph-12.2.7-515-g5c00ca1/src/mds/StrayManager.cc: 210: FAILED assert(0 == "rogue reference to purging inode")

ceph version 12.2.7-515-g5c00ca1 (5c00ca1ba23265d64e28a0c48cab8079e20ef436) luminous (stable)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x5645fdb78c30]
2: (StrayManager::_purge_stray_purged(CDentry*, bool)+0x5f1) [0x5645fd9a7f91]
3: (MDSIOContextBase::complete(int)+0x39b) [0x5645fdac1dab]
4: (Finisher::finisher_thread_entry()+0x3e8) [0x5645fdb77e48]
5: (()+0x7e25) [0x7fc3e4600e25]
6: (clone()+0x6d) [0x7fc3e36e134d]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Actions #1

Updated by Min Chen over 5 years ago

reproduce simlar result:

2019-01-10 14:27:21.092670 7fa7bc8cc700 -1 log_channel(cluster) log [ERR] : bad backtrace on directory inode 0x100000029bf
2019-01-10 14:27:22.660108 7fa7bd8ce700 -1 mds.0.cache.strays Rogue reference after purge to [dentry #0x100/stray0/100000029bf [10a8,head] auth (dversion lock) v=1589 inode=0x55cca8080a00 state=1342177316 | request=0 lock=0 inodepin=1 purging=1 dirty=0 authpin=0 0x55cca59c4000]
2019-01-10 14:27:22.661729 7fa7bd8ce700 -1 /root/rpmbuild/BUILD/ceph-12.2.7-527-g1bdb235/src/mds/StrayManager.cc: In function 'void StrayManager::_purge_stray_purged(CDentry*, bool)' thread 7fa7bd8ce700 time 2019-01-10 14:27:22.660126
/root/rpmbuild/BUILD/ceph-12.2.7-527-g1bdb235/src/mds/StrayManager.cc: 210: FAILED assert(0 == "rogue reference to purging inode")

ceph version 12.2.7-527-g1bdb235 (1bdb235c3777e65269aeb7e6b2ab9df6bf5460b5) luminous (stable)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x55cc893c6c30]
2: (StrayManager::_purge_stray_purged(CDentry*, bool)+0x5f1) [0x55cc891f5f91]
3: (MDSIOContextBase::complete(int)+0x39b) [0x55cc8930fdab]
4: (Finisher::finisher_thread_entry()+0x3e8) [0x55cc893c5e48]
5: (()+0x7e25) [0x7fa7cb035e25]
6: (clone()+0x6d) [0x7fa7ca11634d]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

dump backtrace:
rados -p cephfs_metadata getxattr 100000029bf.00000000 parent > backtrace
ceph-dencoder type inode_backtrace_t import backtrace decode dump_json {
"ino": 1099511638463,
"ancestors": [ {
"dirino": 1536,
"dname": "100000029bf",
"version": 1589
}, {
"dirino": 256,
"dname": "stray0",
"version": 1215
}
],
"pool": 1,
"old_pools": []
}

Actions #2

Updated by Patrick Donnelly over 5 years ago

  • Subject changed from Lunminous: remove snapped dir success but core dump by bad backtrace in _purge_stray_purged to remove snapped dir success but core dump by bad backtrace in _purge_stray_purged
  • Target version deleted (v12.2.11)
  • Start date deleted (01/10/2019)
  • Tags deleted (v12.2.7)
  • Affected Versions v12.2.11 added
  • Component(FS) Client added
  • Component(FS) deleted (libcephfs)

Snapshots are not stable in Luminous. Why are you using them?

Actions #3

Updated by Patrick Donnelly over 5 years ago

  • Assignee set to Zheng Yan
Actions #4

Updated by Patrick Donnelly over 5 years ago

  • Status changed from New to Rejected
Actions #5

Updated by Patrick Donnelly about 5 years ago

  • Category deleted (109)
  • Component(FS) deleted (MDS)
Actions

Also available in: Atom PDF