Project

General

Profile

Actions

Bug #56331

open

crash: MOSDPGLog::encode_payload(unsigned long)

Added by Telemetry Bot almost 2 years ago. Updated over 1 year ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

b1fa0d4357cf041ac97a0fbfb3d989a15cd0f1d6c0a552d16212d8dd4887dda6


Description

New crash events were reported via Telemetry with newer versions (['17.2.0']) than encountered in Tracker (0.0.0).

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=cb2c58509d1c5eb5b0a787ca4b62a0f2976ffbe27196d8164b6b3cdae041f113

Sanitized backtrace:

    MOSDPGLog::encode_payload(unsigned long)
    Message::encode(unsigned long, int, bool)
    ProtocolV2::prepare_send_message(unsigned long, Message*)
    ProtocolV2::send_message(Message*)
    AsyncConnection::send_message(Message*)
    PG::send_cluster_message(int, boost::intrusive_ptr<Message>, unsigned int, bool)
    PeeringState::fulfill_log(pg_shard_t, pg_query_t const&, unsigned int)
    PeeringState::fulfill_query(MQuery const&, PeeringCtxWrapper&)
    PeeringState::Stray::react(MQuery const&)
    boost::statechart::simple_state<PeeringState::Stray, PeeringState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)>::react_impl(boost::statechart::event_base const&, void const*)
    boost::statechart::state_machine<PeeringState::PeeringMachine, PeeringState::Initial, std::allocator<boost::statechart::none>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)
    PG::do_peering_event(std::shared_ptr<PGPeeringEvent>, PeeringCtx&)
    OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)
    ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)
    OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)
    ShardedThreadPool::shardedthreadpool_worker(unsigned int)
    ShardedThreadPool::WorkThreadSharded::entry()

Crash dump sample:
{
    "backtrace": [
        "/lib64/libpthread.so.0(+0x12ce0) [0x7f4f16c8cce0]",
        "gsignal()",
        "abort()",
        "/lib64/libc.so.6(+0x21c89) [0x7f4f158d5c89]",
        "/lib64/libc.so.6(+0x473a6) [0x7f4f158fb3a6]",
        "(MOSDPGLog::encode_payload(unsigned long)+0x389) [0x5565471f12f9]",
        "(Message::encode(unsigned long, int, bool)+0x2e) [0x556547a0471e]",
        "(ProtocolV2::prepare_send_message(unsigned long, Message*)+0x44) [0x556547c8a614]",
        "(ProtocolV2::send_message(Message*)+0x3ae) [0x556547c8ac6e]",
        "(AsyncConnection::send_message(Message*)+0x53e) [0x556547c6731e]",
        "(PG::send_cluster_message(int, boost::intrusive_ptr<Message>, unsigned int, bool)+0xa5) [0x5565472523f5]",
        "(PeeringState::fulfill_log(pg_shard_t, pg_query_t const&, unsigned int)+0x2b4) [0x5565474aa9f4]",
        "(PeeringState::fulfill_query(MQuery const&, PeeringCtxWrapper&)+0x15e) [0x5565474ab0fe]",
        "(PeeringState::Stray::react(MQuery const&)+0x37) [0x5565474ab1d7]",
        "(boost::statechart::simple_state<PeeringState::Stray, PeeringState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x231) [0x5565474fe5e1]",
        "(boost::statechart::state_machine<PeeringState::PeeringMachine, PeeringState::Initial, std::allocator<boost::statechart::none>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)+0x74) [0x556547274c44]",
        "(PG::do_peering_event(std::shared_ptr<PGPeeringEvent>, PeeringCtx&)+0x2d6) [0x556547268e96]",
        "(OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0x175) [0x5565471de3b5]",
        "(ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x56) [0x556547475526]",
        "(OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xaf8) [0x5565471d00d8]",
        "(ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x5c4) [0x5565478d59c4]",
        "(ShardedThreadPool::WorkThreadSharded::entry()+0x14) [0x5565478d6d64]",
        "/lib64/libpthread.so.0(+0x81cf) [0x7f4f16c821cf]",
        "clone()" 
    ],
    "ceph_version": "17.2.0",
    "crash_id": "2022-06-05T08:12:38.608461Z_150ea983-95eb-4495-b53c-6ec51489e56e",
    "entity_name": "osd.a6f4d9de23744c4f2fbd5ffa8b55427a42cb1d69",
    "os_id": "centos",
    "os_name": "CentOS Stream",
    "os_version": "8",
    "os_version_id": "8",
    "process_name": "ceph-osd",
    "stack_sig": "b1fa0d4357cf041ac97a0fbfb3d989a15cd0f1d6c0a552d16212d8dd4887dda6",
    "timestamp": "2022-06-05T08:12:38.608461Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.4.0-105-generic",
    "utsname_sysname": "Linux",
    "utsname_version": "#119-Ubuntu SMP Mon Mar 7 18:49:24 UTC 2022" 
}


Related issues 1 (0 open1 closed)

Related to RADOS - Bug #53600: Crash in MOSDPGLog::encode_payloadRejectedBrad Hubbard

Actions
Actions #1

Updated by Telemetry Bot almost 2 years ago

  • Related to Bug #53600: Crash in MOSDPGLog::encode_payload added
Actions #2

Updated by Telemetry Bot almost 2 years ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v17.2.0 added
Actions #3

Updated by Telemetry Bot over 1 year ago

  • Affected Versions v17.2.1 added
Actions

Also available in: Atom PDF