Bug #56785
crash: void OSDShard::register_and_wake_split_child(PG*): assert(!slot->waiting_for_split.empty())
Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
31fc5832fbf3246cb5c7587f291200327e499827c6b6e17cde16a575f8a0277e
Crash signature (v2):
Tags:
Description
Assert condition: !slot->waiting_for_split.empty()
Assert function: void OSDShard::register_and_wake_split_child(PG*)
Sanitized backtrace:
OSDShard::register_and_wake_split_child(PG*) OSD::_finish_splits(std::set<boost::intrusive_ptr<PG>, std::less<boost::intrusive_ptr<PG> >, std::allocator<boost::intrusive_ptr<PG> > >&) Context::complete(int) OSD::ShardedOpWQ::handle_oncommits(std::list<Context*, std::allocator<Context*> >&) OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*) ShardedThreadPool::shardedthreadpool_worker(unsigned int) ShardedThreadPool::WorkThreadSharded::entry()
Crash dump sample:
{ "archived": "2022-07-10 15:25:34.315953", "assert_condition": "!slot->waiting_for_split.empty()", "assert_file": "osd/OSD.cc", "assert_func": "void OSDShard::register_and_wake_split_child(PG*)", "assert_line": 10690, "assert_msg": "osd/OSD.cc: In function 'void OSDShard::register_and_wake_split_child(PG*)' thread 7faf6fbf1700 time 2022-07-10T15:58:48.931782+0100\nosd/OSD.cc: 10690: FAILED ceph_assert(!slot->waiting_for_split.empty())", "assert_thread_name": "tp_osd_tp", "backtrace": [ "/lib/x86_64-linux-gnu/libpthread.so.0(+0x14140) [0x7faf8e57c140]", "gsignal()", "abort()", "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x17e) [0x55a42a04eabc]", "/usr/bin/ceph-osd(+0xc21c00) [0x55a42a04ec00]", "(OSDShard::register_and_wake_split_child(PG*)+0x99a) [0x55a42a11478a]", "(OSD::_finish_splits(std::set<boost::intrusive_ptr<PG>, std::less<boost::intrusive_ptr<PG> >, std::allocator<boost::intrusive_ptr<PG> > >&)+0x14e) [0x55a42a118c1e]", "(Context::complete(int)+0x9) [0x55a42a14bf89]", "(OSD::ShardedOpWQ::handle_oncommits(std::__cxx11::list<Context*, std::allocator<Context*> >&)+0x24) [0x55a42a15d424]", "(OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x73f) [0x55a42a138e5f]", "(ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x41a) [0x55a42a8181aa]", "(ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x55a42a81a780]", "/lib/x86_64-linux-gnu/libpthread.so.0(+0x8ea7) [0x7faf8e570ea7]", "clone()" ], "ceph_version": "17.2.1", "crash_id": "2022-07-10T14:58:49.177092Z_2efa0371-0584-405a-8b6c-c32a7d6f9b25", "entity_name": "osd.80d14b2c0bf175fbe1c342ee221d7eca80db0e5f", "os_id": "11", "os_name": "Debian GNU/Linux 11 (bullseye)", "os_version": "11 (bullseye)", "os_version_id": "11", "process_name": "ceph-osd", "stack_sig": "31fc5832fbf3246cb5c7587f291200327e499827c6b6e17cde16a575f8a0277e", "timestamp": "2022-07-10T14:58:49.177092Z", "utsname_machine": "x86_64", "utsname_release": "5.15.39-1-pve", "utsname_sysname": "Linux", "utsname_version": "#1 SMP PVE 5.15.39-1 (Wed, 22 Jun 2022 17:22:00 +0200)" }
History
#1 Updated by Telemetry Bot over 1 year ago
#2 Updated by Laura Flores 12 months ago
- Tags set to test-failure
/a/yuriw-2022-12-10_00:03:28-rados-wip-yuri7-testing-2022-12-09-1107-quincy-distro-default-smithi/7111159
#3 Updated by Laura Flores 12 months ago
Only 4 occurrences of this crash in the wild, but let's keep an eye on this since now we have a test that reproduced it.