Project

General

Profile

Bug #59785

crash: void ScatterLock::set_xlock_snap_sync(MDSContext*): assert(state == LOCK_XLOCK || state == LOCK_XLOCKDONE)

Added by Telemetry Bot 10 months ago. Updated 7 months ago.

Status:
Closed
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):

0d5474f3dcdc83c39d61885da32fb98ca4365b710389add6e01f2d8dad97e76a
1f4bea291a029fbd031643a239d97bd137e4358eafb566a428e2e2fa24b73e02
3112ad14b627065405e1aa4d3e7d847bcfd4c7f9b83435e5e92486941cef6795
345cc31387bcc10544cc1b037ad8a915981bc12fd2dd425f894a829a1d7c4191
36a124e017ec6d574bd6b6f8dee18fa7575ccbdcfcdeba09bcb79ec235a09785
58af346449e9bd2c3e7512289984d2756d764bb6c99cfd850b4af333f4d597f1
5f310d14ffe4b2600195c874fba3761c268218711ee4a449413862bb5553fb4c
683679603ccaead18ac3489ee1489b27e2794f55b103a69b332f9f93fc58bb1c
6ba4111231440bcbf8eb265bb83bf2b025a6a75ce01bf484ca135da00657846c
7656ec7e2e2057171b2c2951b393ad14c0addc224b6e140dffadbec12a25ccca
7678fc0af9c13d96faa09a482699bb7b33bd96faeb0d06ab359b49ff51f924c0
76d4c6e42702f113cea259903bc4b1c3f5e926a477c6724d503b7d5e4068a0ef
926099b4c7074bf7e7ea9020e46f0e44b9c3ccddd143b2ddf8f61e4d7eb0eefe
93f106979cd421e1642ce757264672aa027d544485e459371de9b228bfeeb00e
957c2dcc66e018c6469f350f0ed664fc78fc5abea20d34a23dffbab781c8706b
aa551d863a992131c929a2f4310cf0a3f63b249b03f64701c6ac74c616bb71cf
aa888d706e78863e0a247551453f12c54841b168c2f04989b0c806cb80918918
bd294db91a300255a59e77db239b7a77ef8a55e418e184313d34064ee0f4baab
cc557c5f35ac8ab3d9e780a4749b4ff5c0bedbdf68996af02ad915701a75b0f0
d54bd212563c0b2d93568f405365fe91e0d276f1a75a26ad20b257441ef09f5c
df0ddb28d887fe8bc930093eec09df5b05bc3ff84d48402025919314a79ecd0c
ea27463a0f321abbb4bb32662d6512a42b8dfc29a36438b5e6e0c729a0cce193
f2be2847705465e10f2cd7106edaa8469eb70cafd534b36dafb8c32f5cdb9641


Description

New crash events were reported via Telemetry with newer versions (['17.2.1', '17.2.5']) than encountered in Tracker (17.0.0).

http://telemetry.front.sepia.ceph.com:4000/d/jByk5HaMz/crash-spec-x-ray?orgId=1&var-sig_v2=0cc462317baee377357357c833913c76af55248fe612e5e634489b1cdd602c8e

Assert condition: state LOCK_XLOCK || state LOCK_XLOCKDONE
Assert function: void ScatterLock::set_xlock_snap_sync(MDSContext*)

Sanitized backtrace:

    MDCache::truncate_inode(CInode*, LogSegment*)
    C_MDS_inode_update_finish::finish(int)
    MDSContext::complete(int)
    MDSIOContextBase::complete(int)
    MDSLogContextBase::complete(int)
    Finisher::finisher_thread_entry()

Crash dump sample:
{
    "archived": "2023-05-03 01:09:57.631954",
    "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE",
    "assert_file": "mds/ScatterLock.h",
    "assert_func": "void ScatterLock::set_xlock_snap_sync(MDSContext*)",
    "assert_line": 59,
    "assert_msg": "mds/ScatterLock.h: In function 'void ScatterLock::set_xlock_snap_sync(MDSContext*)' thread 7f88eb909700 time 2023-05-02T17:59:07.870137-0700\nmds/ScatterLock.h: 59: FAILED ceph_assert(state == LOCK_XLOCK || state == LOCK_XLOCKDONE)",
    "assert_thread_name": "MR_Finisher",
    "backtrace": [
        "(()+0x12cf0) [0x7f88f9aa5cf0]",
        "(gsignal()+0x10f) [0x7f88f8502aff]",
        "(abort()+0x127) [0x7f88f84d5ea5]",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1a9) [0x7f88faec5be3]",
        "(()+0x27ddac) [0x7f88faec5dac]",
        "(MDCache::truncate_inode(CInode*, LogSegment*)+0x32c) [0x5565e7fdd62c]",
        "(C_MDS_inode_update_finish::finish(int)+0x133) [0x5565e7fb4a83]",
        "(MDSContext::complete(int)+0x56) [0x5565e81c6656]",
        "(MDSIOContextBase::complete(int)+0x39c) [0x5565e81c6b5c]",
        "(MDSLogContextBase::complete(int)+0x44) [0x5565e81c6cb4]",
        "(Finisher::finisher_thread_entry()+0x1a5) [0x7f88faf56a95]",
        "(()+0x81ca) [0x7f88f9a9b1ca]",
        "(clone()+0x43) [0x7f88f84ede73]" 
    ],
    "ceph_version": "15.2.17",
    "crash_id": "2023-05-03T00:59:07.872266Z_a3a98d29-fef2-4172-b307-b268db51a9ca",
    "entity_name": "mds.6c892a556f9e0b190562b8cc5ffe5bbcc4c09785",
    "os_id": "rhel",
    "os_name": "Red Hat Enterprise Linux",
    "os_version": "8.7 (Ootpa)",
    "os_version_id": "8.7",
    "process_name": "ceph-mds",
    "stack_sig": "926099b4c7074bf7e7ea9020e46f0e44b9c3ccddd143b2ddf8f61e4d7eb0eefe",
    "timestamp": "2023-05-03T00:59:07.872266Z",
    "utsname_machine": "x86_64",
    "utsname_release": "4.18.0-425.10.1.el8_7.x86_64",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP Wed Dec 14 16:00:01 EST 2022" 
}


Related issues

Related to CephFS - Bug #54730: crash: void ScatterLock::set_xlock_snap_sync(MDSContext*): assert(state == LOCK_XLOCK || state == LOCK_XLOCKDONE) Resolved
Related to CephFS - Bug #49132: mds crashed "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE", Resolved
Related to CephFS - Backport #58322: quincy: mds crashed "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE", Resolved
Related to CephFS - Backport #58323: pacific: mds crashed "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE", Resolved

History

#1 Updated by Telemetry Bot 10 months ago

  • Related to Bug #54730: crash: void ScatterLock::set_xlock_snap_sync(MDSContext*): assert(state == LOCK_XLOCK || state == LOCK_XLOCKDONE) added

#2 Updated by Telemetry Bot 10 months ago

  • Related to Bug #49132: mds crashed "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE", added

#3 Updated by Telemetry Bot 10 months ago

  • Related to Backport #58322: quincy: mds crashed "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE", added

#4 Updated by Telemetry Bot 10 months ago

  • Related to Backport #58323: pacific: mds crashed "assert_condition": "state == LOCK_XLOCK || state == LOCK_XLOCKDONE", added

#5 Updated by Telemetry Bot 10 months ago

  • Crash signature (v1) updated (diff)
  • Crash signature (v2) updated (diff)
  • Affected Versions v14.2.11, v14.2.15, v14.2.16, v14.2.22, v14.2.9, v15.2.10, v15.2.13, v15.2.14, v15.2.15, v15.2.16, v15.2.17, v15.2.8, v16.2.10, v16.2.11, v16.2.7, v16.2.9, v17.2.1, v17.2.5 added

#6 Updated by Milind Changire 8 months ago

  • Assignee set to Milind Changire
  • Crash signature (v1) updated (diff)

#7 Updated by Milind Changire 7 months ago

Venky,
What versions do I backport PR#48743 to?
That PR is only available in version 18.x

This is the older tracker: https://tracker.ceph.com/issues/49132 related to the PR

#8 Updated by Venky Shankar 7 months ago

Milind Changire wrote:

Venky,
What versions do I backport PR#48743 to?
That PR is only available in version 18.x

This is the older tracker: https://tracker.ceph.com/issues/49132 related to the PR

Isn't the fix available in p/q releases?

- https://tracker.ceph.com/issues/58323
- https://tracker.ceph.com/issues/58322

#9 Updated by Milind Changire 7 months ago

Venky Shankar wrote:

Milind Changire wrote:

Venky,
What versions do I backport PR#48743 to?
That PR is only available in version 18.x

This is the older tracker: https://tracker.ceph.com/issues/49132 related to the PR

Isn't the fix available in p/q releases?

- https://tracker.ceph.com/issues/58323
- https://tracker.ceph.com/issues/58322

Wasn't thinking straight.
The backports are indeed available.
This shouldn't be seen in latest builds: v16.2.12 and later ... and ceph-17.2.6-2 and later

#10 Updated by Venky Shankar 7 months ago

Milind Changire wrote:

Venky Shankar wrote:

Milind Changire wrote:

Venky,
What versions do I backport PR#48743 to?
That PR is only available in version 18.x

This is the older tracker: https://tracker.ceph.com/issues/49132 related to the PR

Isn't the fix available in p/q releases?

- https://tracker.ceph.com/issues/58323
- https://tracker.ceph.com/issues/58322

Wasn't thinking straight.
The backports are indeed available.
This shouldn't be seen in latest builds: v16.2.12 and later ... and ceph-17.2.6-2 and later

Correct. I think you can close this tracker mentioning the tags where the fix is available for supported releases.

#11 Updated by Milind Changire 7 months ago

  • Status changed from New to Closed

This issue won't be seen in latest builds for pacific v16.2.12 and later ... and quincy ceph-17.2.6-2 and later.

Also available in: Atom PDF