Project

General

Profile

Actions

Bug #54937

open

crash: void OSDMap::check_health(ceph::common::CephContext*, health_check_map_t*) const: assert(num_down_in_osds <= num_in_osds)

Added by Telemetry Bot about 2 years ago. Updated 11 months ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

e59169be543a0f110e45984d92c08353addc657cbf88822789786df208b0f18d
0cb4f4d107a647de333be54e190473de639ffadd102c736c116a90dbec9b66ca


Description

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=32a296fd650ca974d5a78f423adefff7b90dbcd6fe99eddb6560363554b0801b

Assert condition: num_down_in_osds <= num_in_osds
Assert function: void OSDMap::check_health(ceph::common::CephContext*, health_check_map_t*) const

Sanitized backtrace:

    OSDMap::check_health(ceph::common::CephContext*, health_check_map_t*) const
    OSDMonitor::encode_pending(std::shared_ptr<MonitorDBStore::Transaction>)
    PaxosService::propose_pending()
    OSDMonitor::tick()
    Monitor::tick()
    Context::complete(int)
    CommonSafeTimer<std::mutex>::timer_thread()
    CommonSafeTimerThread<std::mutex>::entry()

Crash dump sample:
{
    "archived": "2022-03-13 05:46:09.296912",
    "assert_condition": "num_down_in_osds <= num_in_osds",
    "assert_file": "osd/OSDMap.cc",
    "assert_func": "void OSDMap::check_health(ceph::common::CephContext*, health_check_map_t*) const",
    "assert_line": 5704,
    "assert_msg": "osd/OSDMap.cc: In function 'void OSDMap::check_health(ceph::common::CephContext*, health_check_map_t*) const' thread 7f667d068700 time 2022-03-13T04:39:55.066560+0000\nosd/OSDMap.cc: 5704: FAILED ceph_assert(num_down_in_osds <= num_in_osds)",
    "assert_thread_name": "safe_timer",
    "backtrace": [
        "/lib64/libpthread.so.0(+0x12c20) [0x7f6685ee0c20]",
        "gsignal()",
        "abort()",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1a9) [0x7f66881a4ba3]",
        "/usr/lib64/ceph/libceph-common.so.2(+0x276d6c) [0x7f66881a4d6c]",
        "(OSDMap::check_health(ceph::common::CephContext*, health_check_map_t*) const+0x3e5e) [0x7f66885ed29e]",
        "(OSDMonitor::encode_pending(std::shared_ptr<MonitorDBStore::Transaction>)+0x3c38) [0x558d2748c218]",
        "(PaxosService::propose_pending()+0x21a) [0x558d273fd9ea]",
        "(OSDMonitor::tick()+0x4a8) [0x558d2743f9f8]",
        "(Monitor::tick()+0x150) [0x558d272ca510]",
        "(Context::complete(int)+0xd) [0x558d272ebdbd]",
        "(CommonSafeTimer<std::mutex>::timer_thread()+0x10f) [0x7f668829995f]",
        "(CommonSafeTimerThread<std::mutex>::entry()+0x11) [0x7f668829acf1]",
        "/lib64/libpthread.so.0(+0x817f) [0x7f6685ed617f]",
        "clone()" 
    ],
    "ceph_version": "16.2.7",
    "crash_id": "2022-03-13T04:39:55.070527Z_e80e5212-5e08-4827-bd40-ed151f8f61c1",
    "entity_name": "mon.676ff60aa64518d09da9bc045a405d9067b9815e",
    "os_id": "centos",
    "os_name": "CentOS Stream",
    "os_version": "8",
    "os_version_id": "8",
    "process_name": "ceph-mon",
    "stack_sig": "e59169be543a0f110e45984d92c08353addc657cbf88822789786df208b0f18d",
    "timestamp": "2022-03-13T04:39:55.070527Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.15.0-0.bpo.3-amd64",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP Debian 5.15.15-2~bpo11+1 (2022-02-03)" 
}

Actions #1

Updated by Telemetry Bot about 2 years ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v16.2.7 added
Actions #2

Updated by Telemetry Bot 11 months ago

  • Crash signature (v1) updated (diff)
  • Affected Versions v17.2.5 added
Actions

Also available in: Atom PDF