Project

General

Profile

Actions

Bug #62992

open

Heartbeat crash in reset_timeout and clear_timeout

Added by Laura Flores 8 months ago. Updated 23 days ago.

Status:
Pending Backport
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
backport_processed
Backport:
reef
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

/a/lflores-2023-09-08_18:08:19-rados-wip-lflores-testing-2023-09-08-1504-reef-distro-default-smithi/7391228

{
    "crash_id": "2023-09-08T19:46:54.998286Z_6b0fe488-8ceb-4ba3-bae3-92b5ba65667c",
    "timestamp": "2023-09-08T19:46:54.998286Z",
    "process_name": "memcheck-amd64-",
    "entity_name": "osd.7",
    "ceph_version": "18.2.0-400-g3a47b8b8",
    "utsname_hostname": "smithi154",
    "utsname_sysname": "Linux",
    "utsname_release": "5.14.0-363.el9.x86_64",
    "utsname_version": "#1 SMP PREEMPT_DYNAMIC Tue Sep 5 18:30:19 UTC 2023",
    "utsname_machine": "x86_64",
    "os_name": "CentOS Stream",
    "os_id": "centos",
    "os_version_id": "9",
    "os_version": "9",
    "backtrace": [
        "/lib64/libc.so.6(+0x54db0) [0x549fdb0]",
        "/lib64/libc.so.6(+0xa154c) [0x54ec54c]",
        "(ceph::HeartbeatMap::_check(ceph::heartbeat_handle_d const*, char const*, std::chrono::time_point<ceph::coarse_mono_clock, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> > >)+0x226) [0xb9ac76]",
        "(ceph::HeartbeatMap::reset_timeout(ceph::heartbeat_handle_d*, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >)+0x70) [0xb9ad70]",
        "(ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x248) [0xbaace8]",
        "ceph-osd(+0xaa3264) [0xbab264]",
        "/lib64/libc.so.6(+0x9f802) [0x54ea802]",
        "clone()" 
    ]
}

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?var-sig_v2=e037b11ac9f7a2e20c20fcf65d4439bf3fe294cbd5fae43cf74d9e92710d56e6&orgId=1


Related issues 2 (2 open0 closed)

Related to RADOS - Bug #64637: LeakPossiblyLost in BlueStore::_do_write_small() in osdNew

Actions
Copied to RADOS - Backport #63559: reef: Heartbeat crash in osdIn ProgressMatan BreizmanActions
Actions

Also available in: Atom PDF