Actions
Bug #3040
closedbtrfs: recursive locking of sb_internal#2
Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):
Description
2012-08-24T15:29:12.929530-07:00 plana17 kernel: [ 378.433842] ============================================= 2012-08-24T15:29:12.929537-07:00 plana17 kernel: [ 378.433842] [ INFO: possible recursive locking detected ] 2012-08-24T15:29:12.929539-07:00 plana17 kernel: [ 378.433845] 3.6.0-rc2-ceph-00143-g995fc06 #1 Not tainted 2012-08-24T15:29:12.929541-07:00 plana17 kernel: [ 378.433845] --------------------------------------------- 2012-08-24T15:29:12.929542-07:00 plana17 kernel: [ 378.433847] kworker/6:1/238 is trying to acquire lock: 2012-08-24T15:29:12.929546-07:00 plana17 kernel: [ 378.433872] (sb_internal#2){.+.+..}, at: [<ffffffffa0042b74>] start_transaction+0x124/0x430 [btrfs] 2012-08-24T15:29:12.929547-07:00 plana17 kernel: [ 378.433873] 2012-08-24T15:29:12.929548-07:00 plana17 kernel: [ 378.433873] but task is already holding lock: 2012-08-24T15:29:12.929553-07:00 plana17 kernel: [ 378.433890] (sb_internal#2){.+.+..}, at: [<ffffffffa0042590>] do_async_commit+0x0/0x80 [btrfs] 2012-08-24T15:29:12.929554-07:00 plana17 kernel: [ 378.433891] 2012-08-24T15:29:12.929556-07:00 plana17 kernel: [ 378.433891] other info that might help us debug this: 2012-08-24T15:29:12.929557-07:00 plana17 kernel: [ 378.433892] Possible unsafe locking scenario: 2012-08-24T15:29:12.929559-07:00 plana17 kernel: [ 378.433892] 2012-08-24T15:29:12.929560-07:00 plana17 kernel: [ 378.433892] CPU0 2012-08-24T15:29:12.929561-07:00 plana17 kernel: [ 378.433893] ---- 2012-08-24T15:29:12.929562-07:00 plana17 kernel: [ 378.433895] lock(sb_internal#2); 2012-08-24T15:29:12.929566-07:00 plana17 kernel: [ 378.433897] lock(sb_internal#2); 2012-08-24T15:29:12.929567-07:00 plana17 kernel: [ 378.433898] 2012-08-24T15:29:12.929568-07:00 plana17 kernel: [ 378.433898] *** DEADLOCK *** 2012-08-24T15:29:12.929570-07:00 plana17 kernel: [ 378.433898] 2012-08-24T15:29:12.929571-07:00 plana17 kernel: [ 378.433898] May be due to missing lock nesting notation 2012-08-24T15:29:12.929572-07:00 plana17 kernel: [ 378.433898] 2012-08-24T15:29:12.929574-07:00 plana17 kernel: [ 378.433899] 3 locks held by kworker/6:1/238: 2012-08-24T15:29:12.929578-07:00 plana17 kernel: [ 378.433906] #0: (events){.+.+.+}, at: [<ffffffff810717d6>] process_one_work+0x136/0x5f0 2012-08-24T15:29:12.929581-07:00 plana17 kernel: [ 378.433911] #1: ((&(&ac->work)->work)){+.+...}, at: [<ffffffff810717d6>] process_one_work+0x136/0x5f0 2012-08-24T15:29:12.929583-07:00 plana17 kernel: [ 378.433929] #2: (sb_internal#2){.+.+..}, at: [<ffffffffa0042590>] do_async_commit+0x0/0x80 [btrfs] 2012-08-24T15:29:12.929585-07:00 plana17 kernel: [ 378.433932] 2012-08-24T15:29:12.929586-07:00 plana17 kernel: [ 378.433932] stack backtrace: 2012-08-24T15:29:12.929588-07:00 plana17 kernel: [ 378.433935] Pid: 238, comm: kworker/6:1 Not tainted 3.6.0-rc2-ceph-00143-g995fc06 #1 2012-08-24T15:29:12.929589-07:00 plana17 kernel: [ 378.433936] Call Trace: 2012-08-24T15:29:12.929593-07:00 plana17 kernel: [ 378.433941] [<ffffffff810b2032>] __lock_acquire+0x1512/0x1b90 2012-08-24T15:29:12.929595-07:00 plana17 kernel: [ 378.433944] [<ffffffff810ada73>] ? __bfs+0x23/0x270 2012-08-24T15:29:12.929598-07:00 plana17 kernel: [ 378.433961] [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs] 2012-08-24T15:29:12.929616-07:00 plana17 kernel: [ 378.433964] [<ffffffff810b2c82>] lock_acquire+0xa2/0x140 2012-08-24T15:29:12.929619-07:00 plana17 kernel: [ 378.433980] [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs] 2012-08-24T15:29:12.929621-07:00 plana17 kernel: [ 378.433982] [<ffffffff810b3546>] ? mark_held_locks+0x86/0x140 2012-08-24T15:29:12.929623-07:00 plana17 kernel: [ 378.433987] [<ffffffff8117dac6>] __sb_start_write+0xc6/0x1b0 2012-08-24T15:29:12.929625-07:00 plana17 kernel: [ 378.434003] [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs] 2012-08-24T15:29:12.929630-07:00 plana17 kernel: [ 378.434019] [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs] 2012-08-24T15:29:12.929632-07:00 plana17 kernel: [ 378.434022] [<ffffffff81172e75>] ? kmem_cache_alloc+0xb5/0x160 2012-08-24T15:29:12.929634-07:00 plana17 kernel: [ 378.434024] [<ffffffff81172f9b>] ? kmem_cache_free+0x7b/0x160 2012-08-24T15:29:12.929636-07:00 plana17 kernel: [ 378.434042] [<ffffffffa0058b48>] ? free_extent_state+0x58/0xd0 [btrfs] 2012-08-24T15:29:12.929639-07:00 plana17 kernel: [ 378.434058] [<ffffffffa0042b74>] start_transaction+0x124/0x430 [btrfs] 2012-08-24T15:29:12.929641-07:00 plana17 kernel: [ 378.434076] [<ffffffffa005940d>] ? __set_extent_bit+0x37d/0x4e0 [btrfs] 2012-08-24T15:29:12.929643-07:00 plana17 kernel: [ 378.434092] [<ffffffffa0042ed5>] btrfs_join_transaction+0x15/0x20 [btrfs] 2012-08-24T15:29:12.929647-07:00 plana17 kernel: [ 378.434109] [<ffffffffa00496b7>] cow_file_range+0x87/0x4a0 [btrfs] 2012-08-24T15:29:12.929649-07:00 plana17 kernel: [ 378.434114] [<ffffffff81634c6b>] ? _raw_spin_unlock+0x2b/0x40 2012-08-24T15:29:12.929651-07:00 plana17 kernel: [ 378.434131] [<ffffffffa004a80c>] run_delalloc_range+0x34c/0x370 [btrfs] 2012-08-24T15:29:12.929653-07:00 plana17 kernel: [ 378.434149] [<ffffffffa005cbb0>] __extent_writepage+0x5e0/0x770 [btrfs] 2012-08-24T15:29:12.929655-07:00 plana17 kernel: [ 378.434152] [<ffffffff810b3546>] ? mark_held_locks+0x86/0x140 2012-08-24T15:29:12.929657-07:00 plana17 kernel: [ 378.434155] [<ffffffff8112aa5e>] ? find_get_pages_tag+0x2e/0x1c0 2012-08-24T15:29:12.929660-07:00 plana17 kernel: [ 378.434174] [<ffffffffa005cffa>] extent_write_cache_pages.isra.25.constprop.39+0x2ba/0x410 [btrfs] 2012-08-24T15:29:12.929662-07:00 plana17 kernel: [ 378.434187] [<ffffffffa002f7cc>] ? btrfs_run_delayed_refs+0xac/0x550 [btrfs] 2012-08-24T15:29:12.929666-07:00 plana17 kernel: [ 378.434190] [<ffffffff81196117>] ? igrab+0x27/0x70 2012-08-24T15:29:12.929668-07:00 plana17 kernel: [ 378.434208] [<ffffffffa005d389>] extent_writepages+0x49/0x60 [btrfs] 2012-08-24T15:29:12.929670-07:00 plana17 kernel: [ 378.434224] [<ffffffffa0046a90>] ? btrfs_submit_direct+0x670/0x670 [btrfs] 2012-08-24T15:29:12.929673-07:00 plana17 kernel: [ 378.434240] [<ffffffffa00444c8>] btrfs_writepages+0x28/0x30 [btrfs] 2012-08-24T15:29:12.929674-07:00 plana17 kernel: [ 378.434243] [<ffffffff81136443>] do_writepages+0x23/0x40 2012-08-24T15:29:12.929676-07:00 plana17 kernel: [ 378.434247] [<ffffffff8112b839>] __filemap_fdatawrite_range+0x59/0x60 2012-08-24T15:29:12.929678-07:00 plana17 kernel: [ 378.434249] [<ffffffff8112c6ac>] filemap_flush+0x1c/0x20 2012-08-24T15:29:12.929682-07:00 plana17 kernel: [ 378.434266] [<ffffffffa0050b1e>] btrfs_start_delalloc_inodes+0xbe/0x200 [btrfs] 2012-08-24T15:29:12.929685-07:00 plana17 kernel: [ 378.434270] [<ffffffff8132babd>] ? do_raw_spin_unlock+0x5d/0xb0 2012-08-24T15:29:12.929687-07:00 plana17 kernel: [ 378.434286] [<ffffffffa0041ebd>] btrfs_commit_transaction+0x44d/0xb20 [btrfs] 2012-08-24T15:29:12.929709-07:00 plana17 kernel: [ 378.434290] [<ffffffff81079850>] ? __init_waitqueue_head+0x60/0x60 2012-08-24T15:29:12.929712-07:00 plana17 kernel: [ 378.434293] [<ffffffff810717d6>] ? process_one_work+0x136/0x5f0 2012-08-24T15:29:12.929714-07:00 plana17 kernel: [ 378.434308] [<ffffffffa00425f1>] do_async_commit+0x61/0x80 [btrfs] 2012-08-24T15:29:12.929716-07:00 plana17 kernel: [ 378.434324] [<ffffffffa0042590>] ? btrfs_commit_transaction+0xb20/0xb20 [btrfs] 2012-08-24T15:29:12.929718-07:00 plana17 kernel: [ 378.434327] [<ffffffff81071840>] process_one_work+0x1a0/0x5f0 2012-08-24T15:29:12.929722-07:00 plana17 kernel: [ 378.434330] [<ffffffff810717d6>] ? process_one_work+0x136/0x5f0 2012-08-24T15:29:12.929724-07:00 plana17 kernel: [ 378.434346] [<ffffffffa0042590>] ? btrfs_commit_transaction+0xb20/0xb20 [btrfs] 2012-08-24T15:29:12.929726-07:00 plana17 kernel: [ 378.434350] [<ffffffff8107360d>] worker_thread+0x18d/0x4c0 2012-08-24T15:29:12.929728-07:00 plana17 kernel: [ 378.434354] [<ffffffff81073480>] ? manage_workers.isra.22+0x2c0/0x2c0 2012-08-24T15:29:12.929730-07:00 plana17 kernel: [ 378.434356] [<ffffffff810791ee>] kthread+0xae/0xc0 2012-08-24T15:29:12.929732-07:00 plana17 kernel: [ 378.434359] [<ffffffff810b379d>] ? trace_hardirqs_on+0xd/0x10 2012-08-24T15:29:12.929734-07:00 plana17 kernel: [ 378.434363] [<ffffffff8163e744>] kernel_thread_helper+0x4/0x10 2012-08-24T15:29:12.929738-07:00 plana17 kernel: [ 378.434366] [<ffffffff81635430>] ? retint_restore_args+0x13/0x13 2012-08-24T15:29:12.929740-07:00 plana17 kernel: [ 378.434368] [<ffffffff81079140>] ? flush_kthread_work+0x1a0/0x1a0 2012-08-24T15:29:12.929742-07:00 plana17 kernel: [ 378.434371] [<ffffffff8163e740>] ? gs_change+0x13/0x13
Updated by Sage Weil over 11 years ago
disabled this check in teuthology/tasks/internal.py for now... revert that when this is resolved!
Actions