Project

General

Profile

Actions

Bug #52166

closed

crash: void Device::binding_port(ceph::common::CephContext*, int): assert(active_port)

Added by Telemetry Bot over 2 years ago. Updated about 2 years ago.

Status:
Won't Fix
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):

286ebf698e3a074f8bf5a79c1e4483a776c0e5bb6cafc634f0130ceacebabbae
7afc9fb6ba07cb0afc429fb9a15bb413b15fc4bec8b9e00ef3bb2f176a1fef2e
8c306905582a9d790b7524474fbd6d407b2cf1cadbb548be9bf372c9be959f3b
f65d40d90f5ff10d34a176a690ed6163af549066b75a84c50ae6a86f08207bd4


Description

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=8c306905582a9d790b7524474fbd6d407b2cf1cadbb548be9bf372c9be959f3b

Assert condition: active_port
Assert function: void Device::binding_port(ceph::common::CephContext*, int)

Sanitized backtrace:

    Device::binding_port(ceph::common::CephContext*, int)
    Infiniband::init()
    RDMAWorker::listen(entity_addr_t&, unsigned int, SocketOptions const&, ServerSocket*)
    EventCenter::process_events(unsigned int, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >*)
    clone()

Crash dump sample:
{
    "assert_condition": "active_port",
    "assert_file": "msg/async/rdma/Infiniband.cc",
    "assert_func": "void Device::binding_port(ceph::common::CephContext*, int)",
    "assert_line": 154,
    "assert_msg": "msg/async/rdma/Infiniband.cc: In function 'void Device::binding_port(ceph::common::CephContext*, int)' thread 7fabe8a94700 time 2021-06-16T06:34:27.408160-0600\nmsg/async/rdma/Infiniband.cc: 154: FAILED ceph_assert(active_port)",
    "assert_thread_name": "msgr-worker-0",
    "backtrace": [
        "(()+0x12730) [0x7fabecd44730]",
        "(gsignal()+0x10b) [0x7fabec8257bb]",
        "(abort()+0x121) [0x7fabec810535]",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1a5) [0x5655349c9b09]",
        "(()+0x9c5c90) [0x5655349c9c90]",
        "(Device::binding_port(ceph::common::CephContext*, int)+0x516) [0x5655353e80f6]",
        "(Infiniband::init()+0x1ed) [0x5655353e8c3d]",
        "(RDMAWorker::listen(entity_addr_t&, unsigned int, SocketOptions const&, ServerSocket*)+0x2c) [0x56553520c9fc]",
        "(()+0x11eea2e) [0x5655351f2a2e]",
        "(EventCenter::process_events(unsigned int, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >*)+0x13df) [0x56553520236f]",
        "(()+0x120326b) [0x56553520726b]",
        "(()+0xbbb2f) [0x7fabecc09b2f]",
        "(()+0x7fa3) [0x7fabecd39fa3]",
        "(clone()+0x3f) [0x7fabec8e74cf]" 
    ],
    "ceph_version": "15.2.13",
    "crash_id": "2021-06-16T12:34:27.436676Z_83ac6ebe-6557-4732-8ae0-be22065a10de",
    "entity_name": "osd.46b2717dfa5762966957aafb65cbc1e38b7e4712",
    "os_id": "10",
    "os_name": "Debian GNU/Linux 10 (buster)",
    "os_version": "10 (buster)",
    "os_version_id": "10",
    "process_name": "ceph-osd",
    "stack_sig": "7afc9fb6ba07cb0afc429fb9a15bb413b15fc4bec8b9e00ef3bb2f176a1fef2e",
    "timestamp": "2021-06-16T12:34:27.436676Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.4.119-1-pve",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP PVE 5.4.119-1 (Tue, 01 Jun 2021 15:32:00 +0200)" 
}

Actions #1

Updated by Telemetry Bot over 2 years ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v15.2.10, v15.2.13, v15.2.6 added
Actions #2

Updated by Neha Ojha over 2 years ago

  • Status changed from New to Won't Fix
  • Crash signature (v1) updated (diff)

RDMA is not being actively worked on, this is one cluster reporting all the crashes.

Actions #3

Updated by Telemetry Bot about 2 years ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
Actions

Also available in: Atom PDF