Project

General

Profile

Actions

Bug #44903

closed

Ceph - CephFS kernel client crashed with kernel BUG at fs/ceph/mds_client.c:2100!

Added by Mathias Lindberg about 4 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
fs/ceph
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

- CentOS Linux release 7.7.1908 (Core)
- kernel 3.10.0-1062.12.1.el7.x86_64
- Ceph 13.2.8

[2935239.684346] kernel BUG at fs/ceph/mds_client.c:2100!
[2935239.689471] invalid opcode: 0000 [#1] SMP
[2935239.693783] Modules linked in: squashfs loop overlay(T) osc(OE) mgc(OE) lustre(OE) lmv(OE) fld(OE) mdc(OE) fid(OE) lov(OE) ksocklnd(OE) ptlrpc(OE) obdclass(OE) lnet(OE) rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs libcfs(OE) lockd grace fscache ceph libceph libcrc32c dns_resolver 8021q garp mrp stp llc ib_isert iscsi_target_mod ib_srpt target_core_mod ib_srp scsi_transport_srp scsi_tgt rpcrdma rdma_ucm ib_iser sunrpc ib_umad ib_ipoib rdma_cm iw_cm libiscsi scsi_transport_iscsi ib_cm mlx5_ib ib_uverbs ib_core iTCO_wdt iTCO_vendor_support zfs(POE) zunicode(POE) zavl(POE) icp(POE) skx_edac intel_powerclamp coretemp intel_rapl iosf_mbi kvm_intel kvm irqbypass crc32_pclmul ghash_clmulni_intel zcommon(POE) znvpair(POE) aesni_intel lrw gf128mul glue_helper spl(OE) ablk_helper cryptd pcspkr mei_me joydev sg mei
[2935239.766074] i2c_i801 lpc_ich wmi ipmi_si ipmi_devintf ipmi_msghandler pcc_cpufreq acpi_power_meter acpi_pad binfmt_misc ip_tables ext4 mbcache jbd2 sd_mod crc_t10dif crct10dif_generic ast i2c_algo_bit drm_kms_helper mlx5_core syscopyarea sysfillrect sysimgblt ixgbe fb_sys_fops ttm mlxfw ahci devlink crct10dif_pclmul mdio crct10dif_common drm crc32c_intel libahci ptp libata pps_core dca drm_panel_orientation_quirks nfit libnvdimm [last unloaded: beegfs]
[2935239.805878] CPU: 38 PID: 35993 Comm: MATLAB Kdump: loaded Tainted: P OE ------------ T 3.10.0-1062.12.1.el7.x86_64 #1
[2935239.817575] Hardware name: Intel Corporation S2600BPB/S2600BPB, BIOS SE5C620.86B.02.01.0010.010620200716 01/06/2020
[2935239.828149] task: ffff9b56cea09070 ti: ffff9b3053ee0000 task.ti: ffff9b3053ee0000
[2935239.835781] RIP: 0010:[<ffffffffc08ba1fd>] [<ffffffffc08ba1fd>] prepare_send_request+0x7fd/0x830 [ceph]
[2935239.845621] RSP: 0018:ffff9b3053ee3af8 EFLAGS: 00010297
[2935239.851096] RAX: ffff9b4aced68ecd RBX: ffff9b5784aaf000 RCX: 000000005e844b48
[2935239.858385] RDX: 0000000035a3bdfc RSI: 0000000000000000 RDI: ffff9b4aced68ec5
[2935239.865672] RBP: ffff9b3053ee3b98 R08: 0000000000000000 R09: 0000000000000000
[2935239.872962] R10: ffff9b297fc07800 R11: 0000000000000000 R12: ffff9b2fb9cc0780
[2935239.880251] R13: 0000000000000000 R14: ffff9b3fdd12e000 R15: ffff9b4aced68e40
[2935239.887540] FS: 00002b7e8c000700(0000) GS:ffff9b3fdf380000(0000) knlGS:0000000000000000
[2935239.895781] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[2935239.901683] CR2: 000000000127b930 CR3: 0000000eccf06000 CR4: 00000000007607e0
[2935239.908975] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[2935239.916265] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[2935239.923552] PKRU: 55555554
[2935239.926430] Call Trace:
[2935239.929072] [<ffffffffc08ba572>] __do_request+0x342/0x430 [ceph]
[2935239.935338] [<ffffffffc08bbccd>] ceph_mdsc_do_request+0x9d/0x280 [ceph]
[2935239.942202] [<ffffffffc089b901>] ceph_d_revalidate+0x221/0x520 [ceph]
[2935239.948894] [<ffffffff8be6620a>] ? __d_lookup+0x7a/0x160
[2935239.954459] [<ffffffff8be56c3a>] lookup_fast+0x1da/0x230
[2935239.960023] [<ffffffff8be5afbd>] path_lookupat+0x16d/0x8b0
[2935239.965756] [<ffffffff8be24f75>] ? kmem_cache_alloc+0x35/0x1f0
[2935239.971839] [<ffffffff8be5c4df>] ? getname_flags+0x4f/0x1a0
[2935239.977665] [<ffffffff8be5b72b>] filename_lookup+0x2b/0xc0
[2935239.983409] [<ffffffff8be5d677>] user_path_at_empty+0x67/0xc0
[2935239.989409] [<ffffffff8bcd38c1>] ? __wake_up_common_lock+0x91/0xc0
[2935239.995840] [<ffffffff8bcd3360>] ? task_rq_unlock+0x20/0x20
[2935240.001668] [<ffffffff8be5d6e1>] user_path_at+0x11/0x20
[2935240.007149] [<ffffffff8be503c3>] vfs_fstatat+0x63/0xc0
[2935240.012542] [<ffffffff8be5077e>] SYSC_newstat+0x2e/0x60
[2935240.018029] [<ffffffff8be4ce0e>] ? _
_fput+0xe/0x10
[2935240.023249] [<ffffffff8bcc2d30>] ? task_work_run+0xc0/0xe0
[2935240.028988] [<ffffffff8be50c3e>] SyS_newstat+0xe/0x10
[2935240.034292] [<ffffffff8c38dede>] system_call_fastpath+0x25/0x2a
[2935240.040457] Code: 48 c7 c6 5f 12 8d c0 48 c7 c7 c0 b4 8d c0 31 c0 e8 29 41 6f cb 31 c0 e9 fa f8 ff ff 44 89 e0 44 89 a3 90 02 00 00 e9 eb f8 ff ff <0f> 0b 49 8b 8c 24 c0 fc ff ff 4d 8b 84 24 c8 fc ff ff 4c 89 e2
[2935240.061008] RIP [<ffffffffc08ba1fd>] __prepare_send_request+0x7fd/0x830 [ceph]
[2935240.068516] RSP <ffff9b3053ee3af8>

Actions #1

Updated by Greg Farnum about 4 years ago

  • Project changed from CephFS to Linux kernel client
  • Category set to fs/ceph
Actions #2

Updated by Jeff Layton about 4 years ago

  • Status changed from New to Resolved

This is probably a duplicate of this bug, and should make the next RHEL7 update. The problem was fixed in mainline quite some time ago (see commit 1bcb344086f3ecf8d6705f6d708441baa823beb3). Marking this resolved.

https://bugzilla.redhat.com/show_bug.cgi?id=1699402

Actions

Also available in: Atom PDF