Project

General

Profile

Actions

Bug #58756

closed

qa: error during scrub thrashing

Added by Jos Collin about 1 year ago. Updated about 1 year ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
quincy
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
fs
Component(FS):
Labels (FS):
qa-failure
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

error during scrub thrashing in [1]

[1] https://pulpito.ceph.com/yuriw-2023-02-16_19:08:52-fs-wip-yuri3-testing-2023-02-16-0752-quincy-distro-default-smithi/7176546/

2023-02-16T20:18:55.286 DEBUG:teuthology.run_tasks:Unwinding manager fwd_scrub
2023-02-16T20:18:55.319 INFO:tasks.fwd_scrub:joining ForwardScrubbers
2023-02-16T20:18:55.320 ERROR:teuthology.run_tasks:Manager failed: fwd_scrub
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_teuthology_fbbadb5ff5cfccce0d20e136f8956e65ec955359/teuthology/run_tasks.py", line 188, in run_tasks
    suppress = manager.__exit__(*exc_info)
  File "/usr/lib/python3.8/contextlib.py", line 120, in __exit__
    next(self.gen)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_fd249f18f047d5b86a29796a05bfeba7a7666d84/qa/tasks/fwd_scrub.py", line 151, in task
    stop_all_fwd_scrubbers(ctx.ceph[config['cluster']].thrashers)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_fd249f18f047d5b86a29796a05bfeba7a7666d84/qa/tasks/fwd_scrub.py", line 86, in stop_all_fwd_scrubbers
    raise RuntimeError(f"error during scrub thrashing: {thrasher.exception}")
RuntimeError: error during scrub thrashing: Command failed on smithi073 with status 13: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph tell mds.1:0 scrub start / force,recursive'
Actions #1

Updated by Xiubo Li about 1 year ago

This should be a known issue with https://tracker.ceph.com/issues/58564:

2023-02-16T19:43:36.006343+00:00 smithi073 kernel:
2023-02-16T19:43:36.006470+00:00 smithi073 kernel: ======================================================
2023-02-16T19:43:36.006499+00:00 smithi073 kernel: WARNING: possible circular locking dependency detected
2023-02-16T19:43:36.006523+00:00 smithi073 kernel: 6.2.0-rc7-ceph-gd9ba97321a89 #1 Tainted: G S
2023-02-16T19:43:36.006546+00:00 smithi073 kernel: ------------------------------------------------------
2023-02-16T19:43:36.006569+00:00 smithi073 kernel: runc/73796 is trying to acquire lock:
2023-02-16T19:43:36.006592+00:00 smithi073 kernel: ffffffff82463f30 (cpu_hotplug_lock){++++}-{0:0}, at: static_key_slow_inc+0x12/0x30
2023-02-16T19:43:36.006625+00:00 smithi073 kernel: #012but task is already holding lock:
2023-02-16T19:43:36.006651+00:00 smithi073 kernel: ffffffff8256abe8 (freezer_mutex){+.+.}-{3:3}, at: freezer_write+0x91/0x560
2023-02-16T19:43:36.006674+00:00 smithi073 kernel: #012which lock already depends on the new lock.
2023-02-16T19:43:36.006698+00:00 smithi073 kernel: #012the existing dependency chain (in reverse order) is:
2023-02-16T19:43:36.006721+00:00 smithi073 kernel: #012-> #2 (freezer_mutex){+.+.}-{3:3}:
2023-02-16T19:43:36.006744+00:00 smithi073 kernel:       __mutex_lock+0x9c/0xf30
2023-02-16T19:43:36.006766+00:00 smithi073 kernel:       freezer_attach+0x30/0xf0
2023-02-16T19:43:36.006790+00:00 smithi073 kernel:       cgroup_migrate_execute+0x3f3/0x4c0
2023-02-16T19:43:36.006824+00:00 smithi073 kernel:       cgroup_attach_task+0x23a/0x3f0
2023-02-16T19:43:36.006852+00:00 smithi073 kernel:       __cgroup1_procs_write.constprop.12+0xfb/0x140
2023-02-16T19:43:36.006876+00:00 smithi073 kernel:       cgroup_file_write+0x95/0x230
2023-02-16T19:43:36.006900+00:00 smithi073 kernel:       kernfs_fop_write_iter+0x13b/0x1d0
2023-02-16T19:43:36.006922+00:00 smithi073 kernel:       vfs_write+0x348/0x4d0
2023-02-16T19:43:36.006945+00:00 smithi073 kernel:       ksys_write+0x60/0xe0
2023-02-16T19:43:36.006967+00:00 smithi073 kernel:       do_syscall_64+0x38/0x80
2023-02-16T19:43:36.006991+00:00 smithi073 kernel:       entry_SYSCALL_64_after_hwframe+0x63/0xcd
2023-02-16T19:43:36.007016+00:00 smithi073 kernel: #012-> #1 (cgroup_threadgroup_rwsem){++++}-{0:0}:
2023-02-16T19:43:36.007040+00:00 smithi073 kernel:       percpu_down_write+0x49/0x2d0
2023-02-16T19:43:36.007079+00:00 smithi073 kernel:       cgroup_procs_write_start+0x88/0x270
2023-02-16T19:43:36.007105+00:00 smithi073 kernel:       __cgroup1_procs_write.constprop.12+0x57/0x140
2023-02-16T19:43:36.007128+00:00 smithi073 kernel:       cgroup_file_write+0x95/0x230
2023-02-16T19:43:36.007149+00:00 smithi073 kernel:       kernfs_fop_write_iter+0x13b/0x1d0
2023-02-16T19:43:36.007171+00:00 smithi073 kernel:       vfs_write+0x348/0x4d0
2023-02-16T19:43:36.007193+00:00 smithi073 kernel:       ksys_write+0x60/0xe0
2023-02-16T19:43:36.007215+00:00 smithi073 kernel:       do_syscall_64+0x38/0x80
2023-02-16T19:43:36.007237+00:00 smithi073 kernel:       entry_SYSCALL_64_after_hwframe+0x63/0xcd
2023-02-16T19:43:36.007258+00:00 smithi073 kernel: #012-> #0 (cpu_hotplug_lock){++++}-{0:0}:
2023-02-16T19:43:36.007280+00:00 smithi073 kernel:       __lock_acquire+0x108b/0x1e40
2023-02-16T19:43:36.007302+00:00 smithi073 kernel:       lock_acquire+0xd8/0x300
2023-02-16T19:43:36.007326+00:00 smithi073 kernel:       cpus_read_lock+0x40/0xd0
2023-02-16T19:43:36.007348+00:00 smithi073 kernel:       static_key_slow_inc+0x12/0x30
2023-02-16T19:43:36.007370+00:00 smithi073 kernel:       freezer_apply_state+0x98/0xb0
2023-02-16T19:43:36.007393+00:00 smithi073 kernel:       freezer_write+0x327/0x560
2023-02-16T19:43:36.007415+00:00 smithi073 kernel:       cgroup_file_write+0x95/0x230
2023-02-16T19:43:36.007437+00:00 smithi073 kernel:       kernfs_fop_write_iter+0x13b/0x1d0
2023-02-16T19:43:36.007458+00:00 smithi073 kernel:       vfs_write+0x348/0x4d0
2023-02-16T19:43:36.007480+00:00 smithi073 kernel:       ksys_write+0x60/0xe0
2023-02-16T19:43:36.007501+00:00 smithi073 kernel:       do_syscall_64+0x38/0x80
2023-02-16T19:43:36.007522+00:00 smithi073 kernel:       entry_SYSCALL_64_after_hwframe+0x63/0xcd
2023-02-16T19:43:36.007544+00:00 smithi073 kernel: #012other info that might help us debug this:
2023-02-16T19:43:36.007565+00:00 smithi073 kernel: Chain exists of:#012  cpu_hotplug_lock --> cgroup_threadgroup_rwsem --> freezer_mutex
2023-02-16T19:43:36.007590+00:00 smithi073 kernel: Possible unsafe locking scenario:
2023-02-16T19:43:36.007613+00:00 smithi073 kernel:       CPU0                    CPU1
2023-02-16T19:43:36.007636+00:00 smithi073 kernel:       ----                    ----
2023-02-16T19:43:36.007657+00:00 smithi073 kernel:  lock(freezer_mutex);
2023-02-16T19:43:36.007679+00:00 smithi073 kernel:                               lock(cgroup_threadgroup_rwsem);
2023-02-16T19:43:36.007700+00:00 smithi073 kernel:                               lock(freezer_mutex);
2023-02-16T19:43:36.007723+00:00 smithi073 kernel:  lock(cpu_hotplug_lock);
2023-02-16T19:43:36.007746+00:00 smithi073 kernel: #012 *** DEADLOCK ***
2023-02-16T19:43:36.007768+00:00 smithi073 kernel: 5 locks held by runc/73796:
2023-02-16T19:43:36.007792+00:00 smithi073 kernel: #0: ffff88810e8104e8 (&f->f_pos_lock){+.+.}-{3:3}, at: __fdget_pos+0x48/0x60
2023-02-16T19:43:36.007842+00:00 smithi073 kernel: #1: ffff88810ed6e448 (sb_writers#6){.+.+}-{0:0}, at: ksys_write+0x60/0xe0
2023-02-16T19:43:36.007866+00:00 smithi073 kernel: #2: ffff88816f8fdc88 (&of->mutex){+.+.}-{3:3}, at: kernfs_fop_write_iter+0x108/0x1d0
2023-02-16T19:43:36.007889+00:00 smithi073 kernel: #3: ffff88810aa27c80 (kn->active#169){.+.+}-{0:0}, at: kernfs_fop_write_iter+0x111/0x1d0
2023-02-16T19:43:36.007912+00:00 smithi073 kernel: #4: ffffffff8256abe8 (freezer_mutex){+.+.}-{3:3}, at: freezer_write+0x91/0x560
2023-02-16T19:43:36.007935+00:00 smithi073 kernel: #012stack backtrace:
2023-02-16T19:43:36.007958+00:00 smithi073 kernel: CPU: 1 PID: 73796 Comm: runc Tainted: G S                 6.2.0-rc7-ceph-gd9ba97321a89 #1
2023-02-16T19:43:36.007981+00:00 smithi073 kernel: Hardware name: Supermicro SYS-5018R-WR/X10SRW-F, BIOS 2.0 12/17/2015
2023-02-16T19:43:36.008004+00:00 smithi073 kernel: Call Trace:
2023-02-16T19:43:36.008029+00:00 smithi073 kernel: <TASK>
2023-02-16T19:43:36.008053+00:00 smithi073 kernel: dump_stack_lvl+0x59/0x71
2023-02-16T19:43:36.008076+00:00 smithi073 kernel: check_noncircular+0xfe/0x110
2023-02-16T19:43:36.008099+00:00 smithi073 kernel: ? __lock_acquire+0xf12/0x1e40
2023-02-16T19:43:36.008138+00:00 smithi073 kernel: __lock_acquire+0x108b/0x1e40
2023-02-16T19:43:36.008162+00:00 smithi073 kernel: lock_acquire+0xd8/0x300
2023-02-16T19:43:36.008186+00:00 smithi073 kernel: ? static_key_slow_inc+0x12/0x30
2023-02-16T19:43:36.008210+00:00 smithi073 kernel: ? freezer_write+0x1d6/0x560
2023-02-16T19:43:36.008231+00:00 smithi073 kernel: cpus_read_lock+0x40/0xd0
2023-02-16T19:43:36.008253+00:00 smithi073 kernel: ? static_key_slow_inc+0x12/0x30
2023-02-16T19:43:36.008274+00:00 smithi073 kernel: static_key_slow_inc+0x12/0x30
2023-02-16T19:43:36.008296+00:00 smithi073 kernel: freezer_apply_state+0x98/0xb0
2023-02-16T19:43:36.008317+00:00 smithi073 kernel: freezer_write+0x327/0x560
2023-02-16T19:43:36.008339+00:00 smithi073 kernel: cgroup_file_write+0x95/0x230
2023-02-16T19:43:36.008360+00:00 smithi073 kernel: kernfs_fop_write_iter+0x13b/0x1d0
2023-02-16T19:43:36.008382+00:00 smithi073 kernel: vfs_write+0x348/0x4d0
2023-02-16T19:43:36.008406+00:00 smithi073 kernel: ksys_write+0x60/0xe0
2023-02-16T19:43:36.008428+00:00 smithi073 kernel: do_syscall_64+0x38/0x80
2023-02-16T19:43:36.008450+00:00 smithi073 kernel: entry_SYSCALL_64_after_hwframe+0x63/0xcd
2023-02-16T19:43:36.008472+00:00 smithi073 kernel: RIP: 0033:0x55d73f7179db
2023-02-16T19:43:36.008494+00:00 smithi073 kernel: Code: fa ff eb bf e8 e6 b4 fa ff e9 61 ff ff ff cc e8 db 83 fa ff 48 8b 7c 24 10 48 8b 74 24 18 48 8b 54 24 20 48 8b 44 24 08 0f 05 <48> 3d 01 f0 ff ff 76 20 48 c7 44 24 28 ff ff ff ff 48 c7 44 24 30
2023-02-16T19:43:36.008518+00:00 smithi073 kernel: RSP: 002b:000000c0001b82e0 EFLAGS: 00000206 ORIG_RAX: 0000000000000001
2023-02-16T19:43:36.008540+00:00 smithi073 kernel: RAX: ffffffffffffffda RBX: 000000c00002e800 RCX: 000055d73f7179db
2023-02-16T19:43:36.008563+00:00 smithi073 kernel: RDX: 0000000000000006 RSI: 000000c0001b8518 RDI: 000000000000000d
2023-02-16T19:43:36.008585+00:00 smithi073 kernel: RBP: 000000c0001b8330 R08: 000000c0001b8301 R09: 0000000000000004
2023-02-16T19:43:36.008609+00:00 smithi073 kernel: R10: 00007fa09c042fd8 R11: 0000000000000206 R12: 00000000000000f2
2023-02-16T19:43:36.008631+00:00 smithi073 kernel: R13: 0000000000000000 R14: 000055d73fb9b35e R15: 0000000000000000
2023-02-16T19:43:36.008654+00:00 smithi073 kernel: </TASK>

Actions #2

Updated by Jos Collin about 1 year ago

  • Status changed from New to Duplicate
Actions

Also available in: Atom PDF