Project

General

Profile

Actions

Bug #3040

closed

btrfs: recursive locking of sb_internal#2

Added by Sage Weil over 11 years ago. Updated over 11 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

2012-08-24T15:29:12.929530-07:00 plana17 kernel: [  378.433842] =============================================
2012-08-24T15:29:12.929537-07:00 plana17 kernel: [  378.433842] [ INFO: possible recursive locking detected ]
2012-08-24T15:29:12.929539-07:00 plana17 kernel: [  378.433845] 3.6.0-rc2-ceph-00143-g995fc06 #1 Not tainted
2012-08-24T15:29:12.929541-07:00 plana17 kernel: [  378.433845] ---------------------------------------------
2012-08-24T15:29:12.929542-07:00 plana17 kernel: [  378.433847] kworker/6:1/238 is trying to acquire lock:
2012-08-24T15:29:12.929546-07:00 plana17 kernel: [  378.433872]  (sb_internal#2){.+.+..}, at: [<ffffffffa0042b74>] start_transaction+0x124/0x430 [btrfs]
2012-08-24T15:29:12.929547-07:00 plana17 kernel: [  378.433873] 
2012-08-24T15:29:12.929548-07:00 plana17 kernel: [  378.433873] but task is already holding lock:
2012-08-24T15:29:12.929553-07:00 plana17 kernel: [  378.433890]  (sb_internal#2){.+.+..}, at: [<ffffffffa0042590>] do_async_commit+0x0/0x80 [btrfs]
2012-08-24T15:29:12.929554-07:00 plana17 kernel: [  378.433891] 
2012-08-24T15:29:12.929556-07:00 plana17 kernel: [  378.433891] other info that might help us debug this:
2012-08-24T15:29:12.929557-07:00 plana17 kernel: [  378.433892]  Possible unsafe locking scenario:
2012-08-24T15:29:12.929559-07:00 plana17 kernel: [  378.433892] 
2012-08-24T15:29:12.929560-07:00 plana17 kernel: [  378.433892]        CPU0
2012-08-24T15:29:12.929561-07:00 plana17 kernel: [  378.433893]        ----
2012-08-24T15:29:12.929562-07:00 plana17 kernel: [  378.433895]   lock(sb_internal#2);
2012-08-24T15:29:12.929566-07:00 plana17 kernel: [  378.433897]   lock(sb_internal#2);
2012-08-24T15:29:12.929567-07:00 plana17 kernel: [  378.433898] 
2012-08-24T15:29:12.929568-07:00 plana17 kernel: [  378.433898]  *** DEADLOCK ***
2012-08-24T15:29:12.929570-07:00 plana17 kernel: [  378.433898] 
2012-08-24T15:29:12.929571-07:00 plana17 kernel: [  378.433898]  May be due to missing lock nesting notation
2012-08-24T15:29:12.929572-07:00 plana17 kernel: [  378.433898] 
2012-08-24T15:29:12.929574-07:00 plana17 kernel: [  378.433899] 3 locks held by kworker/6:1/238:
2012-08-24T15:29:12.929578-07:00 plana17 kernel: [  378.433906]  #0:  (events){.+.+.+}, at: [<ffffffff810717d6>] process_one_work+0x136/0x5f0
2012-08-24T15:29:12.929581-07:00 plana17 kernel: [  378.433911]  #1:  ((&(&ac->work)->work)){+.+...}, at: [<ffffffff810717d6>] process_one_work+0x136/0x5f0
2012-08-24T15:29:12.929583-07:00 plana17 kernel: [  378.433929]  #2:  (sb_internal#2){.+.+..}, at: [<ffffffffa0042590>] do_async_commit+0x0/0x80 [btrfs]
2012-08-24T15:29:12.929585-07:00 plana17 kernel: [  378.433932] 
2012-08-24T15:29:12.929586-07:00 plana17 kernel: [  378.433932] stack backtrace:
2012-08-24T15:29:12.929588-07:00 plana17 kernel: [  378.433935] Pid: 238, comm: kworker/6:1 Not tainted 3.6.0-rc2-ceph-00143-g995fc06 #1
2012-08-24T15:29:12.929589-07:00 plana17 kernel: [  378.433936] Call Trace:
2012-08-24T15:29:12.929593-07:00 plana17 kernel: [  378.433941]  [<ffffffff810b2032>] __lock_acquire+0x1512/0x1b90
2012-08-24T15:29:12.929595-07:00 plana17 kernel: [  378.433944]  [<ffffffff810ada73>] ? __bfs+0x23/0x270
2012-08-24T15:29:12.929598-07:00 plana17 kernel: [  378.433961]  [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs]
2012-08-24T15:29:12.929616-07:00 plana17 kernel: [  378.433964]  [<ffffffff810b2c82>] lock_acquire+0xa2/0x140
2012-08-24T15:29:12.929619-07:00 plana17 kernel: [  378.433980]  [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs]
2012-08-24T15:29:12.929621-07:00 plana17 kernel: [  378.433982]  [<ffffffff810b3546>] ? mark_held_locks+0x86/0x140
2012-08-24T15:29:12.929623-07:00 plana17 kernel: [  378.433987]  [<ffffffff8117dac6>] __sb_start_write+0xc6/0x1b0
2012-08-24T15:29:12.929625-07:00 plana17 kernel: [  378.434003]  [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs]
2012-08-24T15:29:12.929630-07:00 plana17 kernel: [  378.434019]  [<ffffffffa0042b74>] ? start_transaction+0x124/0x430 [btrfs]
2012-08-24T15:29:12.929632-07:00 plana17 kernel: [  378.434022]  [<ffffffff81172e75>] ? kmem_cache_alloc+0xb5/0x160
2012-08-24T15:29:12.929634-07:00 plana17 kernel: [  378.434024]  [<ffffffff81172f9b>] ? kmem_cache_free+0x7b/0x160
2012-08-24T15:29:12.929636-07:00 plana17 kernel: [  378.434042]  [<ffffffffa0058b48>] ? free_extent_state+0x58/0xd0 [btrfs]
2012-08-24T15:29:12.929639-07:00 plana17 kernel: [  378.434058]  [<ffffffffa0042b74>] start_transaction+0x124/0x430 [btrfs]
2012-08-24T15:29:12.929641-07:00 plana17 kernel: [  378.434076]  [<ffffffffa005940d>] ? __set_extent_bit+0x37d/0x4e0 [btrfs]
2012-08-24T15:29:12.929643-07:00 plana17 kernel: [  378.434092]  [<ffffffffa0042ed5>] btrfs_join_transaction+0x15/0x20 [btrfs]
2012-08-24T15:29:12.929647-07:00 plana17 kernel: [  378.434109]  [<ffffffffa00496b7>] cow_file_range+0x87/0x4a0 [btrfs]
2012-08-24T15:29:12.929649-07:00 plana17 kernel: [  378.434114]  [<ffffffff81634c6b>] ? _raw_spin_unlock+0x2b/0x40
2012-08-24T15:29:12.929651-07:00 plana17 kernel: [  378.434131]  [<ffffffffa004a80c>] run_delalloc_range+0x34c/0x370 [btrfs]
2012-08-24T15:29:12.929653-07:00 plana17 kernel: [  378.434149]  [<ffffffffa005cbb0>] __extent_writepage+0x5e0/0x770 [btrfs]
2012-08-24T15:29:12.929655-07:00 plana17 kernel: [  378.434152]  [<ffffffff810b3546>] ? mark_held_locks+0x86/0x140
2012-08-24T15:29:12.929657-07:00 plana17 kernel: [  378.434155]  [<ffffffff8112aa5e>] ? find_get_pages_tag+0x2e/0x1c0
2012-08-24T15:29:12.929660-07:00 plana17 kernel: [  378.434174]  [<ffffffffa005cffa>] extent_write_cache_pages.isra.25.constprop.39+0x2ba/0x410 [btrfs]
2012-08-24T15:29:12.929662-07:00 plana17 kernel: [  378.434187]  [<ffffffffa002f7cc>] ? btrfs_run_delayed_refs+0xac/0x550 [btrfs]
2012-08-24T15:29:12.929666-07:00 plana17 kernel: [  378.434190]  [<ffffffff81196117>] ? igrab+0x27/0x70
2012-08-24T15:29:12.929668-07:00 plana17 kernel: [  378.434208]  [<ffffffffa005d389>] extent_writepages+0x49/0x60 [btrfs]
2012-08-24T15:29:12.929670-07:00 plana17 kernel: [  378.434224]  [<ffffffffa0046a90>] ? btrfs_submit_direct+0x670/0x670 [btrfs]
2012-08-24T15:29:12.929673-07:00 plana17 kernel: [  378.434240]  [<ffffffffa00444c8>] btrfs_writepages+0x28/0x30 [btrfs]
2012-08-24T15:29:12.929674-07:00 plana17 kernel: [  378.434243]  [<ffffffff81136443>] do_writepages+0x23/0x40
2012-08-24T15:29:12.929676-07:00 plana17 kernel: [  378.434247]  [<ffffffff8112b839>] __filemap_fdatawrite_range+0x59/0x60
2012-08-24T15:29:12.929678-07:00 plana17 kernel: [  378.434249]  [<ffffffff8112c6ac>] filemap_flush+0x1c/0x20
2012-08-24T15:29:12.929682-07:00 plana17 kernel: [  378.434266]  [<ffffffffa0050b1e>] btrfs_start_delalloc_inodes+0xbe/0x200 [btrfs]
2012-08-24T15:29:12.929685-07:00 plana17 kernel: [  378.434270]  [<ffffffff8132babd>] ? do_raw_spin_unlock+0x5d/0xb0
2012-08-24T15:29:12.929687-07:00 plana17 kernel: [  378.434286]  [<ffffffffa0041ebd>] btrfs_commit_transaction+0x44d/0xb20 [btrfs]
2012-08-24T15:29:12.929709-07:00 plana17 kernel: [  378.434290]  [<ffffffff81079850>] ? __init_waitqueue_head+0x60/0x60
2012-08-24T15:29:12.929712-07:00 plana17 kernel: [  378.434293]  [<ffffffff810717d6>] ? process_one_work+0x136/0x5f0
2012-08-24T15:29:12.929714-07:00 plana17 kernel: [  378.434308]  [<ffffffffa00425f1>] do_async_commit+0x61/0x80 [btrfs]
2012-08-24T15:29:12.929716-07:00 plana17 kernel: [  378.434324]  [<ffffffffa0042590>] ? btrfs_commit_transaction+0xb20/0xb20 [btrfs]
2012-08-24T15:29:12.929718-07:00 plana17 kernel: [  378.434327]  [<ffffffff81071840>] process_one_work+0x1a0/0x5f0
2012-08-24T15:29:12.929722-07:00 plana17 kernel: [  378.434330]  [<ffffffff810717d6>] ? process_one_work+0x136/0x5f0
2012-08-24T15:29:12.929724-07:00 plana17 kernel: [  378.434346]  [<ffffffffa0042590>] ? btrfs_commit_transaction+0xb20/0xb20 [btrfs]
2012-08-24T15:29:12.929726-07:00 plana17 kernel: [  378.434350]  [<ffffffff8107360d>] worker_thread+0x18d/0x4c0
2012-08-24T15:29:12.929728-07:00 plana17 kernel: [  378.434354]  [<ffffffff81073480>] ? manage_workers.isra.22+0x2c0/0x2c0
2012-08-24T15:29:12.929730-07:00 plana17 kernel: [  378.434356]  [<ffffffff810791ee>] kthread+0xae/0xc0
2012-08-24T15:29:12.929732-07:00 plana17 kernel: [  378.434359]  [<ffffffff810b379d>] ? trace_hardirqs_on+0xd/0x10
2012-08-24T15:29:12.929734-07:00 plana17 kernel: [  378.434363]  [<ffffffff8163e744>] kernel_thread_helper+0x4/0x10
2012-08-24T15:29:12.929738-07:00 plana17 kernel: [  378.434366]  [<ffffffff81635430>] ? retint_restore_args+0x13/0x13
2012-08-24T15:29:12.929740-07:00 plana17 kernel: [  378.434368]  [<ffffffff81079140>] ? flush_kthread_work+0x1a0/0x1a0
2012-08-24T15:29:12.929742-07:00 plana17 kernel: [  378.434371]  [<ffffffff8163e740>] ? gs_change+0x13/0x13
Actions #1

Updated by Sage Weil over 11 years ago

disabled this check in teuthology/tasks/internal.py for now... revert that when this is resolved!

Actions #2

Updated by Sage Weil over 11 years ago

  • Status changed from New to 7
Actions #3

Updated by Sage Weil over 11 years ago

  • Status changed from 7 to Resolved
Actions

Also available in: Atom PDF