Project

General

Profile

Bug #52233

crash: void Infiniband::init(): assert(device)

Added by Telemetry Bot 2 months ago. Updated about 2 months ago.

Status:
New
Priority:
Low
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

72beec4873cd300ac60ac3713f1aa7199ee2a398fed925e0c9154b0a1ee694e7

Crash signature (v2):

184ea175092db1eb5f584b66abb346fd6d953e093be837808b7e5e6d52078f41


Description

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=184ea175092db1eb5f584b66abb346fd6d953e093be837808b7e5e6d52078f41

Assert condition: device
Assert function: void Infiniband::init()

Sanitized backtrace:

    Infiniband::init()
    RDMAWorker::listen(entity_addr_t&, unsigned int, SocketOptions const&, ServerSocket*)
    EventCenter::process_events(unsigned int, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >*)
    clone()

Crash dump sample:
{
    "archived": "2021-07-19 01:48:41.268759",
    "assert_condition": "device",
    "assert_file": "msg/async/rdma/Infiniband.cc",
    "assert_func": "void Infiniband::init()",
    "assert_line": 1061,
    "assert_msg": "msg/async/rdma/Infiniband.cc: In function 'void Infiniband::init()' thread 7f059f07d700 time 2021-07-18T18:28:06.119700-0700\nmsg/async/rdma/Infiniband.cc: 1061: FAILED ceph_assert(device)",
    "assert_thread_name": "msgr-worker-0",
    "backtrace": [
        "(()+0x14140) [0x7f05a30ed140]",
        "(gsignal()+0x141) [0x7f05a2bb8ce1]",
        "(abort()+0x123) [0x7f05a2ba2537]",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x17b) [0x563a6be060b7]",
        "(()+0x9d01f8) [0x563a6be061f8]",
        "(Infiniband::init()+0xbe1) [0x563a6c8a3291]",
        "(RDMAWorker::listen(entity_addr_t&, unsigned int, SocketOptions const&, ServerSocket*)+0x2c) [0x563a6c6a3dcc]",
        "(()+0x12521de) [0x563a6c6881de]",
        "(EventCenter::process_events(unsigned int, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >*)+0x718) [0x563a6c6983a8]",
        "(()+0x1267c4a) [0x563a6c69dc4a]",
        "(()+0xceed0) [0x7f05a2f70ed0]",
        "(()+0x8ea7) [0x7f05a30e1ea7]",
        "(clone()+0x3f) [0x7f05a2c7adef]" 
    ],
    "ceph_version": "15.2.13",
    "crash_id": "2021-07-19T01:28:06.130756Z_6e03d96d-4f3a-4b74-a524-2201be6a746d",
    "entity_name": "osd.ca04bbd0aaf1e8fe2410a84addc0b64c254de092",
    "os_id": "11",
    "os_name": "Debian GNU/Linux 11 (bullseye)",
    "os_version": "11 (bullseye)",
    "os_version_id": "11",
    "process_name": "ceph-osd",
    "stack_sig": "72beec4873cd300ac60ac3713f1aa7199ee2a398fed925e0c9154b0a1ee694e7",
    "timestamp": "2021-07-19T01:28:06.130756Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.11.22-2-pve",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP PVE 5.11.22-3 (Sun, 11 Jul 2021 13:45:15 +0200)" 
}

History

#1 Updated by Telemetry Bot 2 months ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v15.2.13 added

#2 Updated by Neha Ojha about 2 months ago

  • Priority changed from Normal to Low

One cluster is reporting all the crashes.

Also available in: Atom PDF