Project

General

Profile

Actions

Bug #59777

closed

crash: void PeeringState::check_past_interval_bounds() const: abort

Added by Telemetry Bot 11 months ago. Updated 11 months ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

6dbfb99ee8593fd3ad1a42e59c60451a52474c179773a279b865bd4945a82df7
74474804e5b55a5f180628b3fd289ab54956096f082c1add5841fb1bbcacecfb
998f1f9f21208f741c4c7bcf49c56d9e38df5ce76691b67faadccb434eb618eb


Description

New crash events were reported via Telemetry with newer versions (['17.2.5']) than encountered in Tracker (17.2.0).

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=739786197cebeeca2801b9e3d192ba720d5fe9c32347532b1968e9c7108a27fd

Assert condition: abort
Assert function: void PeeringState::check_past_interval_bounds() const

Sanitized backtrace:

    PeeringState::check_past_interval_bounds() const
    PeeringState::GetInfo::GetInfo(boost::statechart::state<PeeringState::GetInfo, PeeringState::Peering, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)>::my_context)
    PeeringState::Down::react(MNotifyRec const&)
    boost::statechart::simple_state<PeeringState::Down, PeeringState::Peering, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)>::react_impl(boost::statechart::event_base const&, void const*)
    boost::statechart::state_machine<PeeringState::PeeringMachine, PeeringState::Initial, std::allocator<boost::statechart::none>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)
    PG::do_peering_event(std::shared_ptr<PGPeeringEvent>, PeeringCtx&)
    OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)
    ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)
    OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)
    ShardedThreadPool::shardedthreadpool_worker(unsigned int)
    ShardedThreadPool::WorkThreadSharded::entry()

Crash dump sample:
{
    "assert_condition": "abort",
    "assert_file": "osd/PeeringState.cc",
    "assert_func": "void PeeringState::check_past_interval_bounds() const",
    "assert_line": 991,
    "assert_msg": "osd/PeeringState.cc: In function 'void PeeringState::check_past_interval_bounds() const' thread 7fde8aabe700 time 2022-12-17T22:14:24.464347+0000\nosd/PeeringState.cc: 991: ceph_abort_msg(\"past_interval start interval mismatch\")",
    "assert_thread_name": "tp_osd_tp",
    "backtrace": [
        "/lib/x86_64-linux-gnu/libpthread.so.0(+0x13140) [0x7fdea6c54140]",
        "gsignal()",
        "abort()",
        "(ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x18a) [0x55c0d19404de]",
        "(PeeringState::check_past_interval_bounds() const+0x67c) [0x55c0d1ca6acc]",
        "(PeeringState::GetInfo::GetInfo(boost::statechart::state<PeeringState::GetInfo, PeeringState::Peering, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::my_context)+0x173) [0x55c0d1ccf883]",
        "(PeeringState::Down::react(MNotifyRec const&)+0x28b) [0x55c0d1cd003b]",
        "(boost::statechart::simple_state<PeeringState::Down, PeeringState::Peering, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x154) [0x55c0d1cfd694]",
        "(boost::statechart::state_machine<PeeringState::PeeringMachine, PeeringState::Initial, std::allocator<boost::statechart::none>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)+0x84) [0x55c0d1ad6a54]",
        "(PG::do_peering_event(std::shared_ptr<PGPeeringEvent>, PeeringCtx&)+0xe8) [0x55c0d1abc3a8]",
        "(OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0x1b4) [0x55c0d1a16584]",
        "(ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x55) [0x55c0d1c7b3e5]",
        "(OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xa27) [0x55c0d1a28367]",
        "(ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x41a) [0x55c0d20d13da]",
        "(ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x55c0d20d39b0]",
        "/lib/x86_64-linux-gnu/libpthread.so.0(+0x7ea7) [0x7fdea6c48ea7]",
        "clone()" 
    ],
    "ceph_version": "16.2.10",
    "crash_id": "2022-12-17T22:14:24.498922Z_3d35eeb2-a5bf-4285-a648-5fe52eb2ded7",
    "entity_name": "osd.6793d38ae8e7d320b959b190fc81c54812e29283",
    "os_id": "11",
    "os_name": "Debian GNU/Linux 11 (bullseye)",
    "os_version": "11 (bullseye)",
    "os_version_id": "11",
    "process_name": "ceph-osd",
    "stack_sig": "6dbfb99ee8593fd3ad1a42e59c60451a52474c179773a279b865bd4945a82df7",
    "timestamp": "2022-12-17T22:14:24.498922Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.15.74-1-pve",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP PVE 5.15.74-1 (Mon, 14 Nov 2022 20:17:15 +0100)" 
}


Related issues 2 (0 open2 closed)

Related to RADOS - Bug #54708: crash: void PeeringState::check_past_interval_bounds() const: abortDuplicate

Actions
Is duplicate of RADOS - Bug #49689: osd/PeeringState.cc: ceph_abort_msg("past_interval start interval mismatch") startResolvedMatan Breizman

Actions
Actions #1

Updated by Telemetry Bot 11 months ago

  • Related to Bug #54708: crash: void PeeringState::check_past_interval_bounds() const: abort added
Actions #2

Updated by Telemetry Bot 11 months ago

  • Related to Bug #49689: osd/PeeringState.cc: ceph_abort_msg("past_interval start interval mismatch") start added
Actions #3

Updated by Telemetry Bot 11 months ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v16.2.10, v16.2.4, v16.2.5, v16.2.7, v17.2.0, v17.2.5 added
Actions #4

Updated by Matan Breizman 11 months ago

  • Related to deleted (Bug #49689: osd/PeeringState.cc: ceph_abort_msg("past_interval start interval mismatch") start)
Actions #5

Updated by Matan Breizman 11 months ago

  • Is duplicate of Bug #49689: osd/PeeringState.cc: ceph_abort_msg("past_interval start interval mismatch") start added
Actions #6

Updated by Matan Breizman 11 months ago

  • Status changed from New to Duplicate
  • Crash signature (v1) updated (diff)
Actions

Also available in: Atom PDF