Project

General

Profile

Actions

Bug #59768

closed

crash: void EMetaBlob::replay(MDSRank*, LogSegment*, MDPeerUpdate*): assert(g_conf()->mds_wipe_sessions)

Added by Telemetry Bot 12 months ago. Updated 8 months ago.

Status:
Duplicate
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

Source:
Telemetry
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):

091ccd8cb7bd1522d12ce6b17544a404770ed509286192ec847feaa8de9d6efe
37902c97220e24b205237e7067e6ecd552c18160d67ee10814c97ea8d7c3b3be
419d91a42dfdc5f5068bf2ff5e5945f889992e19d10eccccfd77b8bcdfd8719f
423b3aef909232279de647969b5b9b85791f282dae2ccae040b775e1014f27f1
4a18347d09157f6de24dde3d395d3e75ff284a425a5fe15d4b03a4e7e864713f
50f3960239ab3a73aa0eb541a928275eef5159eebb679247c291581f193aa17e
6d059dfc1e090e5e4743cc7b64eb2212fea19dfeb18c9e2290cf6e0520c544a4
d1ed6561318fa32a99d556ee561d6fe798cd283765ce83010b0d07e377741f16
e7440620618d7f0acd70453f419182f157a0e59ce25f981e5bed7042cbff62ad
f261349ee6277ac7d513d97440cfca36885fc807654f2b3b9f9a2022adf3f233


Description

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=1a84e31a4bc3ae6dc69d901c1f7aad8377b9a8188188204e17cef44a700d0565

Assert condition: g_conf()->mds_wipe_sessions
Assert function: void EMetaBlob::replay(MDSRank*, LogSegment*, MDPeerUpdate*)

Sanitized backtrace:

    EMetaBlob::replay(MDSRank*, LogSegment*, MDPeerUpdate*)
    EUpdate::replay(MDSRank*)
    MDLog::_replay_thread()
    MDLog::ReplayThread::entry()

Crash dump sample:
{
    "assert_condition": "g_conf()->mds_wipe_sessions",
    "assert_file": "mds/journal.cc",
    "assert_func": "void EMetaBlob::replay(MDSRank*, LogSegment*, MDPeerUpdate*)",
    "assert_line": 1618,
    "assert_msg": "mds/journal.cc: In function 'void EMetaBlob::replay(MDSRank*, LogSegment*, MDPeerUpdate*)' thread 7f5c3163e700 time 2023-03-24T00:21:12.944648+0000\nmds/journal.cc: 1618: FAILED ceph_assert(g_conf()->mds_wipe_sessions)",
    "assert_thread_name": "md_log_replay",
    "backtrace": [
        "/lib64/libpthread.so.0(+0x12c20) [0x7f5c40651c20]",
        "gsignal()",
        "abort()",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1a9) [0x7f5c41663ba3]",
        "/usr/lib64/ceph/libceph-common.so.2(+0x276d6c) [0x7f5c41663d6c]",
        "(EMetaBlob::replay(MDSRank*, LogSegment*, MDPeerUpdate*)+0x5ae5) [0x55ec215558c5]",
        "(EUpdate::replay(MDSRank*)+0x40) [0x55ec215572e0]",
        "(MDLog::_replay_thread()+0xcd1) [0x55ec214dd8e1]",
        "(MDLog::ReplayThread::entry()+0x11) [0x55ec211df311]",
        "/lib64/libpthread.so.0(+0x817a) [0x7f5c4064717a]",
        "clone()" 
    ],
    "ceph_version": "16.2.7",
    "crash_id": "2023-03-24T00:21:12.947817Z_835df170-a907-4ee6-8e09-63e6d0f3564a",
    "entity_name": "mds.3f5cc9fd68718a579f08de0136ed981232bc2600",
    "os_id": "centos",
    "os_name": "CentOS Linux",
    "os_version": "8",
    "os_version_id": "8",
    "process_name": "ceph-mds",
    "stack_sig": "37902c97220e24b205237e7067e6ecd552c18160d67ee10814c97ea8d7c3b3be",
    "timestamp": "2023-03-24T00:21:12.947817Z",
    "utsname_machine": "x86_64",
    "utsname_release": "5.15.0-67-generic",
    "utsname_sysname": "Linux",
    "utsname_version": "#74~20.04.1-Ubuntu SMP Wed Feb 22 14:52:34 UTC 2023" 
}


Related issues 1 (0 open1 closed)

Related to CephFS - Bug #58489: mds stuck in 'up:replay' and crashed.ResolvedXiubo Li

Actions
Actions #1

Updated by Telemetry Bot 12 months ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v16.2.1, v16.2.10, v16.2.11, v16.2.5, v16.2.6, v16.2.7, v17.2.1, v17.2.3, v17.2.5 added
Actions #2

Updated by Milind Changire 10 months ago

  • Assignee set to Neeraj Pratap Singh
  • Crash signature (v1) updated (diff)
Actions #3

Updated by Neeraj Pratap Singh 8 months ago

While I was debugging this issue, it seemed that the issue doesn't exist anymore.
And I found this PR: https://github.com/ceph/ceph/pull/49970, which solved this crash.
@Xiubo, Can u pls have a look and confirm?

Actions #4

Updated by Xiubo Li 8 months ago

Neeraj Pratap Singh wrote:

While I was debugging this issue, it seemed that the issue doesn't exist anymore.
And I found this PR: https://github.com/ceph/ceph/pull/49970, which solved this crash.
@Xiubo, Can u pls have a look and confirm?

@Neeraj,

Yeah, they should be the same issue.

Actions #5

Updated by Venky Shankar 8 months ago

  • Related to Bug #58489: mds stuck in 'up:replay' and crashed. added
Actions #6

Updated by Venky Shankar 8 months ago

  • Status changed from New to Duplicate
Actions

Also available in: Atom PDF