Project

General

Profile

Actions

Bug #56785

open

crash: void OSDShard::register_and_wake_split_child(PG*): assert(!slot->waiting_for_split.empty())

Added by Telemetry Bot over 1 year ago. Updated over 1 year ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

31fc5832fbf3246cb5c7587f291200327e499827c6b6e17cde16a575f8a0277e


Description

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=d44ea277d2ae53e186d6b488012d648f07f6c1cbf5c5b0c671d3f941962cdb03

Assert condition: !slot->waiting_for_split.empty()
Assert function: void OSDShard::register_and_wake_split_child(PG*)

Sanitized backtrace:

    OSDShard::register_and_wake_split_child(PG*)
    OSD::_finish_splits(std::set<boost::intrusive_ptr<PG>, std::less<boost::intrusive_ptr<PG> >, std::allocator<boost::intrusive_ptr<PG> > >&)
    Context::complete(int)
    OSD::ShardedOpWQ::handle_oncommits(std::list<Context*, std::allocator<Context*> >&)
    OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)
    ShardedThreadPool::shardedthreadpool_worker(unsigned int)
    ShardedThreadPool::WorkThreadSharded::entry()

Crash dump sample:
{
    "archived": "2022-07-10 15:25:34.315953",
    "assert_condition": "!slot->waiting_for_split.empty()",
    "assert_file": "osd/OSD.cc",
    "assert_func": "void OSDShard::register_and_wake_split_child(PG*)",
    "assert_line": 10690,
    "assert_msg": "osd/OSD.cc: In function 'void OSDShard::register_and_wake_split_child(PG*)' thread 7faf6fbf1700 time 2022-07-10T15:58:48.931782+0100\nosd/OSD.cc: 10690: FAILED ceph_assert(!slot->waiting_for_split.empty())",
    "assert_thread_name": "tp_osd_tp",
    "backtrace": [
        "/lib/x86_64-linux-gnu/libpthread.so.0(+0x14140) [0x7faf8e57c140]",
        "gsignal()",
        "abort()",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x17e) [0x55a42a04eabc]",
        "/usr/bin/ceph-osd(+0xc21c00) [0x55a42a04ec00]",
        "(OSDShard::register_and_wake_split_child(PG*)+0x99a) [0x55a42a11478a]",
        "(OSD::_finish_splits(std::set<boost::intrusive_ptr<PG>, std::less<boost::intrusive_ptr<PG> >, std::allocator<boost::intrusive_ptr<PG> > >&)+0x14e) [0x55a42a118c1e]",
        "(Context::complete(int)+0x9) [0x55a42a14bf89]",
        "(OSD::ShardedOpWQ::handle_oncommits(std::__cxx11::list<Context*, std::allocator<Context*> >&)+0x24) [0x55a42a15d424]",
        "(OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x73f) [0x55a42a138e5f]",
        "(ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x41a) [0x55a42a8181aa]",
        "(ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x55a42a81a780]",
        "/lib/x86_64-linux-gnu/libpthread.so.0(+0x8ea7) [0x7faf8e570ea7]",
        "clone()" 
    ],
    "ceph_version": "17.2.1",
    "crash_id": "2022-07-10T14:58:49.177092Z_2efa0371-0584-405a-8b6c-c32a7d6f9b25",
    "entity_name": "osd.80d14b2c0bf175fbe1c342ee221d7eca80db0e5f",
    "os_id": "11",
    "os_name": "Debian GNU/Linux 11 (bullseye)",
    "os_version": "11 (bullseye)",
    "os_version_id": "11",
    "process_name": "ceph-osd",
    "stack_sig": "31fc5832fbf3246cb5c7587f291200327e499827c6b6e17cde16a575f8a0277e",
    "timestamp": "2022-07-10T14:58:49.177092Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.15.39-1-pve",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP PVE 5.15.39-1 (Wed, 22 Jun 2022 17:22:00 +0200)" 
}

Actions #1

Updated by Telemetry Bot over 1 year ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v16.2.5, v16.2.6, v17.2.1 added
Actions #2

Updated by Laura Flores over 1 year ago

  • Translation missing: en.field_tag_list set to test-failure

/a/yuriw-2022-12-10_00:03:28-rados-wip-yuri7-testing-2022-12-09-1107-quincy-distro-default-smithi/7111159

Actions #3

Updated by Laura Flores over 1 year ago

Only 4 occurrences of this crash in the wild, but let's keep an eye on this since now we have a test that reproduced it.

Actions

Also available in: Atom PDF