Project

General

Profile

Bug #58155

mon:ceph_assert(m < ranks.size()) `different code path than tracker 50089`

Added by Kamoltat (Junior) Sirivadhna 2 months ago. Updated about 1 month ago.

Status:
Fix Under Review
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
pacific,quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Same problem with https://tracker.ceph.com/issues/50089, but it is a different code path.

We opened a new tracker for cleanliness.

First encountered downstream.

ceph-mon.ceph-pdhiran-efffr4-node1-installer.log:158673:/builddir/build/BUILD/ceph-17.2.5/src/mon/MonMap.h: 404: FAILED ceph_assert(m < ranks.size())
ceph-mon.ceph-pdhiran-efffr4-node1-installer.log:168809:/builddir/build/BUILD/ceph-17.2.5/src/mon/MonMap.h: 404: FAILED ceph_assert(m < ranks.size())

{
    "crash_id": "2022-12-01T03:10:23.198880Z_42911b46-3338-474c-94c1-3c2c1337c216",
    "timestamp": "2022-12-01T03:10:23.198880Z",
    "process_name": "ceph-mon",
    "entity_name": "mon.ceph-pdhiran-efffr4-node1-installer",
    "ceph_version": "17.2.5-14.el9cp",
    "utsname_hostname": "ceph-pdhiran-efffr4-node1-installer",
    "utsname_sysname": "Linux",
    "utsname_release": "5.14.0-70.13.1.el9_0.x86_64",
    "utsname_version": "#1 SMP PREEMPT Thu Apr 14 12:42:38 EDT 2022",
    "utsname_machine": "x86_64",
    "os_name": "Red Hat Enterprise Linux",
    "os_id": "rhel",
    "os_version_id": "9.1",
    "os_version": "9.1 (Plow)",
    "assert_condition": "m < ranks.size()",
    "assert_func": "const entity_addrvec_t& MonMap::get_addrs(unsigned int) const",
    "assert_file": "/builddir/build/BUILD/ceph-17.2.5/src/mon/MonMap.h",
    "assert_line": 404,
    "assert_thread_name": "ms_dispatch",
    "assert_msg": "/builddir/build/BUILD/ceph-17.2.5/src/mon/MonMap.h: In function 'const entity_addrvec_t& MonMap::get_addrs(unsigned int) const' thread 7fe5417eb640 time 2022-12-01T03:10:23.194895+0000\n/builddir/build/BUILD/ceph-17.2.5/src/mon/MonMap.h: 404: FAILED ceph_assert(m < ranks.size())\n",
    "backtrace": [
        "/lib64/libc.so.6(+0x54d90) [0x7fe548fc3d90]",
        "/lib64/libc.so.6(+0xa154c) [0x7fe54901054c]",
        "raise()",
        "abort()",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x188) [0x7fe549890ad3]",
        "/usr/lib64/ceph/libceph-common.so.2(+0x141c37) [0x7fe549890c37]",
        "(Elector::send_peer_ping(int, utime_t const*)+0x499) [0x556057109059]",
        "(Elector::begin_peer_ping(int)+0x21c) [0x55605710bf8c]",
        "(Elector::handle_ping(boost::intrusive_ptr<MonOpRequest>)+0x147) [0x556057111017]",
        "(Elector::dispatch(boost::intrusive_ptr<MonOpRequest>)+0xb8) [0x556057111588]",
        "(Monitor::dispatch_op(boost::intrusive_ptr<MonOpRequest>)+0xeaf) [0x5560570903cf]",
        "(Monitor::_ms_dispatch(Message*)+0x7b1) [0x5560570911a1]",
        "/usr/bin/ceph-mon(+0x26599e) [0x55605704b99e]",
        "(DispatchQueue::entry()+0x53a) [0x7fe549a7890a]",
        "/usr/lib64/ceph/libceph-common.so.2(+0x3b4201) [0x7fe549b03201]",
        "/lib64/libc.so.6(+0x9f802) [0x7fe54900e802]",
        "/lib64/libc.so.6(+0x3f450) [0x7fe548fae450]" 
    ]
}


Related issues

Related to RADOS - Bug #50089: mon/MonMap.h: FAILED ceph_assert(m < ranks.size()) when reducing number of monitors in the cluster Resolved

History

#1 Updated by Kamoltat (Junior) Sirivadhna 2 months ago

  • Related to Bug #50089: mon/MonMap.h: FAILED ceph_assert(m < ranks.size()) when reducing number of monitors in the cluster added

#2 Updated by Kamoltat (Junior) Sirivadhna 2 months ago

  • Description updated (diff)

#3 Updated by Kamoltat (Junior) Sirivadhna about 2 months ago

  • Pull request ID set to 49259

#4 Updated by Radoslaw Zarzynski about 1 month ago

  • Status changed from In Progress to Fix Under Review
  • Backport set to pacific,quincy

Also available in: Atom PDF