Project

General

Profile

Actions

Bug #52160

closed

crash: void PeeringState::check_past_interval_bounds() const: abort

Added by Telemetry Bot over 2 years ago. Updated about 2 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

06609aa10e3319971c8c6251223bafde6e0052349f3e9a5a03edb74f42e3fa2d
070400b67064b04e820f22c4c5c2d50166103b509fcd8578a829d884ab76d18f
36c20a627e03678780b2fe3dc808af4d2b2567a472a4f61d8178c61d7184a0bb
48ab57a2c81536fcd4d9c8a7ae6b0d297421a5e916602ccf1c53aded5adf1b8a
95d3f1ffec846b1fe432b371d1bb2f07c934d7a56d49a10e7f0d6b989d7e21c7
bea7cac302e857eee2324d2293899a2a015475abf2766cd9ad62e5d30f204468


Description

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=95d3f1ffec846b1fe432b371d1bb2f07c934d7a56d49a10e7f0d6b989d7e21c7

Assert condition: abort
Assert function: void PeeringState::check_past_interval_bounds() const

Sanitized backtrace:

    PeeringState::check_past_interval_bounds() const
    PeeringState::Reset::react(PeeringState::AdvMap const&)
    boost::statechart::simple_state<PeeringState::Reset, PeeringState::PeeringMachine, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)
    PeeringState::advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PeeringCtx&)
    PG::handle_advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PeeringCtx&)
    OSD::advance_pg(unsigned int, PG*, ThreadPool::TPHandle&, PeeringCtx&)
    OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)
    ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)
    OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)
    ShardedThreadPool::shardedthreadpool_worker(unsigned int)
    ShardedThreadPool::WorkThreadSharded::entry()
    clone()

Crash dump sample:
{
    "assert_condition": "abort",
    "assert_file": "osd/PeeringState.cc",
    "assert_func": "void PeeringState::check_past_interval_bounds() const",
    "assert_line": 935,
    "assert_msg": "osd/PeeringState.cc: In function 'void PeeringState::check_past_interval_bounds() const' thread 7f4593c65700 time 2021-06-08T16:38:48.818572+0300\nosd/PeeringState.cc: 935: ceph_abort_msg(\"past_interval start interval mismatch\")",
    "assert_thread_name": "tp_osd_tp",
    "backtrace": [
        "(()+0x12980) [0x7f45c31f3980]",
        "(gsignal()+0xc7) [0x7f45c1ea5fb7]",
        "(abort()+0x141) [0x7f45c1ea7921]",
        "(ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x1b2) [0x5594d4c43ddf]",
        "(PeeringState::check_past_interval_bounds() const+0x88d) [0x5594d4f2bead]",
        "(PeeringState::Reset::react(PeeringState::AdvMap const&)+0x1de) [0x5594d4f445fe]",
        "(boost::statechart::simple_state<PeeringState::Reset, PeeringState::PeeringMachine, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x140) [0x5594d4f840b0]",
        "(PeeringState::advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PeeringCtx&)+0x20e) [0x5594d4f272de]",
        "(PG::handle_advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PeeringCtx&)+0xf2) [0x5594d4d68ad2]",
        "(OSD::advance_pg(unsigned int, PG*, ThreadPool::TPHandle&, PeeringCtx&)+0x351) [0x5594d4cdc351]",
        "(OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0x9e) [0x5594d4cde28e]",
        "(ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x50) [0x5594d4f10fa0]",
        "(OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x90c) [0x5594d4cd079c]",
        "(ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x4ac) [0x5594d5324e5c]",
        "(ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x5594d53280b0]",
        "(()+0x76db) [0x7f45c31e86db]",
        "(clone()+0x3f) [0x7f45c1f8871f]" 
    ],
    "ceph_version": "15.2.13",
    "crash_id": "2021-06-08T13:38:48.833028Z_210c18d3-6adc-49e9-a73e-7f97431b1a42",
    "entity_name": "osd.995462fa83f76e57e023a31f489914d53d8fd726",
    "os_id": "ubuntu",
    "os_name": "Ubuntu",
    "os_version": "18.04.5 LTS (Bionic Beaver)",
    "os_version_id": "18.04",
    "process_name": "ceph-osd",
    "stack_sig": "bea7cac302e857eee2324d2293899a2a015475abf2766cd9ad62e5d30f204468",
    "timestamp": "2021-06-08T13:38:48.833028Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.4.0-73-generic",
    "utsname_sysname": "Linux",
    "utsname_version": "#82~18.04.1-Ubuntu SMP Fri Apr 16 15:10:02 UTC 2021" 
}


Related issues 1 (0 open1 closed)

Is duplicate of RADOS - Bug #49689: osd/PeeringState.cc: ceph_abort_msg("past_interval start interval mismatch") startResolvedMatan Breizman

Actions
Actions #1

Updated by Telemetry Bot over 2 years ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v15.2.10, v15.2.13, v15.2.4, v15.2.5, v15.2.6 added
Actions #2

Updated by Neha Ojha over 2 years ago

  • Status changed from New to Duplicate
  • Crash signature (v1) updated (diff)
Actions #3

Updated by Neha Ojha over 2 years ago

  • Is duplicate of Bug #49689: osd/PeeringState.cc: ceph_abort_msg("past_interval start interval mismatch") start added
Actions #4

Updated by Telemetry Bot about 2 years ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
Actions

Also available in: Atom PDF