Project

General

Profile

Actions

Bug #16768

closed

multimds: check_rstat assertion failure

Added by Patrick Donnelly almost 8 years ago. Updated about 5 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
multimds
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2016-07-19T11:43:54.392 INFO:tasks.ceph.mds.e.mira027.stderr:/srv/autobuild-ceph/gitbuilder.git/build/out~/ceph-11.0.0-709-g12c0683/src/mds/CDir.cc: In function 'bool CDir::check_rstats(bool)' thread 7fe0b191e700 time 2016-07-19 18:43:55.424696
2016-07-19T11:43:54.392 INFO:tasks.ceph.mds.e.mira027.stderr:/srv/autobuild-ceph/gitbuilder.git/build/out~/ceph-11.0.0-709-g12c0683/src/mds/CDir.cc: 289: FAILED assert(nest_info.rbytes == fnode.rstat.rbytes)
2016-07-19T11:43:54.392 INFO:tasks.ceph.mds.e.mira027.stderr: ceph version v11.0.0-709-g12c0683 (12c068365c43a140fe1fe23bf68318342710e84d)
2016-07-19T11:43:54.393 INFO:tasks.ceph.mds.e.mira027.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x95) [0x1878453]
2016-07-19T11:43:54.393 INFO:tasks.ceph.mds.e.mira027.stderr: 2: (CDir::check_rstats(bool)+0x16e8) [0x164daf8]
2016-07-19T11:43:54.393 INFO:tasks.ceph.mds.e.mira027.stderr: 3: (MDCache::predirty_journal_parents(std::shared_ptr<MutationImpl>, EMetaBlob*, CInode*, CDir*, int, int, snapid_t)+0x2640) [0x14bed80]
2016-07-19T11:43:54.393 INFO:tasks.ceph.mds.e.mira027.stderr: 4: (Server::_rename_prepare(std::shared_ptr<MDRequestImpl>&, EMetaBlob*, ceph::buffer::list*, CDentry*, CDentry*, CDentry*)+0x1a86) [0x143199e]
2016-07-19T11:43:54.394 INFO:tasks.ceph.mds.e.mira027.stderr: 5: (Server::handle_slave_rename_prep(std::shared_ptr<MDRequestImpl>&)+0x1fc2) [0x1436548]
2016-07-19T11:43:54.394 INFO:tasks.ceph.mds.e.mira027.stderr: 6: (Server::dispatch_slave_request(std::shared_ptr<MDRequestImpl>&)+0xc33) [0x14034db]
2016-07-19T11:43:54.394 INFO:tasks.ceph.mds.e.mira027.stderr: 7: (Server::_slave_rename_sessions_flushed(std::shared_ptr<MDRequestImpl>&)+0x22f) [0x143cb19]
2016-07-19T11:43:54.395 INFO:tasks.ceph.mds.e.mira027.stderr: 8: (C_MDS_SlaveRenameSessionsFlushed::finish(int)+0x2a) [0x1451d3e]
2016-07-19T11:43:54.395 INFO:tasks.ceph.mds.e.mira027.stderr: 9: (Context::complete(int)+0x27) [0x137daf5]
2016-07-19T11:43:54.395 INFO:tasks.ceph.mds.e.mira027.stderr: 10: (MDSInternalContextBase::complete(int)+0x1c6) [0x1705596]
2016-07-19T11:43:54.395 INFO:tasks.ceph.mds.e.mira027.stderr: 11: (C_GatherBase<MDSInternalContextBase, MDSInternalContextGather>::delete_me()+0x41) [0x13d48a3]
2016-07-19T11:43:54.396 INFO:tasks.ceph.mds.e.mira027.stderr: 12: (C_GatherBase<MDSInternalContextBase, MDSInternalContextGather>::sub_finish(MDSInternalContextBase*, int)+0x2a8) [0x13e0d5a]
2016-07-19T11:43:54.396 INFO:tasks.ceph.mds.e.mira027.stderr: 13: (C_GatherBase<MDSInternalContextBase, MDSInternalContextGather>::C_GatherSub::finish(int)+0x29) [0x13e08ab]
2016-07-19T11:43:54.396 INFO:tasks.ceph.mds.e.mira027.stderr: 14: (Context::complete(int)+0x27) [0x137daf5]
2016-07-19T11:43:54.397 INFO:tasks.ceph.mds.e.mira027.stderr: 15: (C_GatherBase<MDSInternalContextBase, MDSInternalContextGather>::C_GatherSub::complete(int)+0x20) [0x13e096a]
2016-07-19T11:43:54.397 INFO:tasks.ceph.mds.e.mira027.stderr: 16: (MDSRank::_advance_queues()+0x4c3) [0x13a9047]
2016-07-19T11:43:54.398 INFO:tasks.ceph.mds.e.mira027.stderr: 17: (MDSRank::_dispatch(Message*, bool)+0x55d) [0x13a6463]
2016-07-19T11:43:54.398 INFO:tasks.ceph.mds.e.mira027.stderr: 18: (MDSRankDispatcher::ms_dispatch(Message*)+0x34) [0x13a5ef0]
2016-07-19T11:43:54.398 INFO:tasks.ceph.mds.e.mira027.stderr: 19: (MDSDaemon::ms_dispatch(Message*)+0x21d) [0x13782bf]
2016-07-19T11:43:54.401 INFO:tasks.ceph.mds.e.mira027.stderr: 20: (Messenger::ms_deliver_dispatch(Message*)+0x98) [0x1af71bc]
2016-07-19T11:43:54.402 INFO:tasks.ceph.mds.e.mira027.stderr: 21: (DispatchQueue::entry()+0x5dd) [0x1af62d9]
2016-07-19T11:43:54.402 INFO:tasks.ceph.mds.e.mira027.stderr: 22: (DispatchQueue::DispatchThread::entry()+0x1c) [0x1902046]
2016-07-19T11:43:54.403 INFO:tasks.ceph.mds.e.mira027.stderr: 23: (Thread::entry_wrapper()+0xc1) [0x19ef733]
2016-07-19T11:43:54.404 INFO:tasks.ceph.mds.e.mira027.stderr: 24: (Thread::_entry_func(void*)+0x18) [0x19ef668]
2016-07-19T11:43:54.404 INFO:tasks.ceph.mds.e.mira027.stderr: 25: (()+0x8182) [0x7fe0b6730182]
2016-07-19T11:43:54.406 INFO:tasks.ceph.mds.e.mira027.stderr: 26: (clone()+0x6d) [0x7fe0b583147d]
2016-07-19T11:43:54.407 INFO:tasks.ceph.mds.e.mira027.stderr: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

This test was marked dead due to timeout (I think?):

2016-07-19T11:43:55.737 INFO:tasks.workunit.client.0.mira034.stdout:5/839: dread d12/d16/d24/d32/de7/df5/ff7 [0,4194304] 0
2016-07-19T11:43:55.744 INFO:tasks.workunit.client.0.mira034.stdout:5/840: mkdir d12/d6d/d9b/ddd/d12a 0
2016-07-19T11:43:55.757 INFO:tasks.workunit.client.0.mira034.stdout:5/841: dwrite d12/f11b [0,4194304] 0
2016-07-19T11:43:55.767 INFO:tasks.workunit.client.0.mira034.stdout:5/842: unlink d12/d16/d24/d32/d5b/fb5 0
2016-07-19T11:43:55.777 INFO:tasks.workunit.client.0.mira034.stdout:5/843: dread d12/d16/d24/fa9 [0,4194304] 0
2016-07-19T14:42:53.022 INFO:tasks.workunit.client.0.mira034.stderr:/home/ubuntu/cephtest/workunit.client.0/suites/fsstress.sh: line 1: 13555 Terminated              $command

Killed after 3 hours.

From: http://pulpito.ceph.com/pdonnell-2016-07-18_20:02:54-multimds-master---basic-mira/321809/


Related issues 2 (0 open2 closed)

Related to CephFS - Bug #16807: Crash in handle_slave_rename_prepResolved07/25/2016

Actions
Has duplicate CephFS - Bug #8090: multimds: mds crash in check_rstats Duplicate04/13/2014

Actions
Actions

Also available in: Atom PDF