Project

General

Profile

Actions

Bug #38724

closed

_txc_add_transaction error (39) Directory not empty not handled on operation 21 (op 1, counting from 0)

Added by Sage Weil about 5 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

100%

Source:
Tags:
Backport:
nautilus, mimic, luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

   -22> 2019-03-13 11:16:04.949 7f6815aa5700  5 osd.48 pg_epoch: 19907 pg[7.1( v 18204'27406 (17926'24297,18204'27406] lb MIN (bitwise) local-lis/les=18205/18206 n=20 ec=18092/14826 lis/c 18205/18205 les/c/f 18206/18206/0 19906/19906/19658) [25,14] r=-1 lpr=19906 pi=[18205,19906)/2 crt=18204'27406 lcod 0'0 unknown NOTIFY mbc={}] exit Started/Stray 1.090630 5 0.000104
   -21> 2019-03-13 11:16:04.949 7f6815aa5700  5 osd.48 pg_epoch: 19907 pg[7.1( v 18204'27406 (17926'24297,18204'27406] lb MIN (bitwise) local-lis/les=18205/18206 n=20 ec=18092/14826 lis/c 18205/18205 les/c/f 18206/18206/0 19906/19906/19658) [25,14] r=-1 lpr=19906 pi=[18205,19906)/2 crt=18204'27406 lcod 0'0 unknown NOTIFY mbc={}] enter Started/ToDelete
   -20> 2019-03-13 11:16:04.949 7f6815aa5700  5 osd.48 pg_epoch: 19907 pg[7.1( v 18204'27406 (17926'24297,18204'27406] lb MIN (bitwise) local-lis/les=18205/18206 n=20 ec=18092/14826 lis/c 18205/18205 les/c/f 18206/18206/0 19906/19906/19658) [25,14] r=-1 lpr=19906 pi=[18205,19906)/2 crt=18204'27406 lcod 0'0 unknown NOTIFY mbc={}] enter Started/ToDelete/WaitDeleteReseved
   -17> 2019-03-13 11:16:04.953 7f6815aa5700 -1 bluestore(/var/lib/ceph/osd/ceph-48) _txc_add_transaction error (39) Directory not empty not handled on operation 21 (op 1, counting from 0)
   -16> 2019-03-13 11:16:04.953 7f6815aa5700  0 bluestore(/var/lib/ceph/osd/ceph-48) _dump_transaction transaction dump:
{
    "ops": [
        {
            "op_num": 0,
            "op_name": "remove",
            "collection": "7.1_head",
            "oid": "#7:80000000::::head#" 
        },
        {
            "op_num": 1,
            "op_name": "rmcoll",
            "collection": "7.1_head" 
        }
    ]
}
    -9> 2019-03-13 11:16:04.957 7f6815aa5700 -1 /build/ceph-14.1.0-597-g342814f/src/os/bluestore/BlueStore.cc: In function 'void BlueStore::_txc_add_transaction(BlueStore::TransContext*, ObjectStore::Transaction*)' thread 7f6815aa5700 time 2019-03-13 11:16:04.955344
/build/ceph-14.1.0-597-g342814f/src/os/bluestore/BlueStore.cc: 11069: abort()

 ceph version 14.1.0-597-g342814f (342814f60900a7eaf5bc3573fabb3a5acd940716) nautilus (rc)
 1: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0xda) [0x55b1156c4e45]
 2: (BlueStore::_txc_add_transaction(BlueStore::TransContext*, ObjectStore::Transaction*)+0x296a) [0x55b115cb971a]
 3: (BlueStore::queue_transactions(boost::intrusive_ptr<ObjectStore::CollectionImpl>&, std::vector<ObjectStore::Transaction, std::allocator<ObjectStore::Transaction> >&, boost::intrusive_ptr<TrackedOp>, ThreadPool::TPHandle*)+0x5e6) [0x55b115cbdc86]
 4: (ObjectStore::queue_transaction(boost::intrusive_ptr<ObjectStore::CollectionImpl>&, ObjectStore::Transaction&&, boost::intrusive_ptr<TrackedOp>, ThreadPool::TPHandle*)+0x7f) [0x55b1158751ef]
 5: (PG::_delete_some(ObjectStore::Transaction*)+0x710) [0x55b1158d8d70]
 6: (PG::RecoveryState::Deleting::react(PG::DeleteSome const&)+0x71) [0x55b1158d9b31]
 7: (boost::statechart::simple_state<PG::RecoveryState::Deleting, PG::RecoveryState::ToDelete, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x131) [0x55b115921ed1]
 8: (boost::statechart::state_machine<PG::RecoveryState::RecoveryMachine, PG::RecoveryState::Initial, std::allocator<void>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)+0x6b) [0x55b1158f631b]
 9: (PG::do_peering_event(std::shared_ptr<PGPeeringEvent>, PG::RecoveryCtx*)+0x122) [0x55b1158e5be2]
 10: (OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0x1b4) [0x55b115820f94]
 11: (OSD::dequeue_delete(OSDShard*, PG*, unsigned int, ThreadPool::TPHandle&)+0xd2) [0x55b115821272]
 12: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xbf5) [0x55b115814f25]
 13: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x4ac) [0x55b115e3577c]

hit this on my personal clsuter. log attached.

Files

ceph-osd.5.clippedtoupload.log (77.4 KB) ceph-osd.5.clippedtoupload.log OSD 5 crashed Edward Kalk, 07/10/2019 02:46 PM
ceph-osd.21.clippedtoupload.log (17.7 KB) ceph-osd.21.clippedtoupload.log OSD 21 crashed Edward Kalk, 07/10/2019 02:49 PM
osd 10 will not start 07112019.txt (222 KB) osd 10 will not start 07112019.txt Edward Kalk, 07/11/2019 04:39 PM

Related issues 5 (0 open5 closed)

Related to RADOS - Bug #39152: nautilus osd crash: Caught signal (Aborted) tp_osd_tpDuplicate04/09/2019

Actions
Has duplicate RADOS - Bug #40642: Bluestore crash due to mass activation on another poolDuplicate07/03/2019

Actions
Copied to RADOS - Backport #39692: mimic: _txc_add_transaction error (39) Directory not empty not handled on operation 21 (op 1, counting from 0)ResolvedNathan CutlerActions
Copied to RADOS - Backport #39693: nautilus: _txc_add_transaction error (39) Directory not empty not handled on operation 21 (op 1, counting from 0)ResolvedSage WeilActions
Copied to RADOS - Backport #39694: luminous: _txc_add_transaction error (39) Directory not empty not handled on operation 21 (op 1, counting from 0)RejectedActions
Actions

Also available in: Atom PDF