Project

General

Profile

Actions

Bug #5876

closed

Assertion failure in rbd_img_obj_callback() : rbd_assert(which >= img_request->next_completion);

Added by Olivier Bonvalet over 10 years ago. Updated almost 10 years ago.

Status:
Resolved
Priority:
High
Assignee:
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hi,

with CuttleFish with RBD kernel client (from Linux 3.9.11), I have this kernel BUG :

Aug  3 03:51:35 murmillia kernel: [772458.641942] Assertion failure in rbd_img_obj_callback() at line 1708:
Aug  3 03:51:35 murmillia kernel: [772458.641942] 
Aug  3 03:51:35 murmillia kernel: [772458.641942]     rbd_assert(which >= img_request->next_completion);
Aug  3 03:51:35 murmillia kernel: [772458.641942] 
Aug  3 03:51:35 murmillia kernel: [772458.642022] ------------[ cut here ]------------
Aug  3 03:51:35 murmillia kernel: [772458.642038] kernel BUG at drivers/block/rbd.c:1708!
Aug  3 03:51:35 murmillia kernel: [772458.642054] invalid opcode: 0000 [#1] SMP 
Aug  3 03:51:35 murmillia kernel: [772458.642103] Modules linked in: xt_physdev iptable_filter ip_tables x_tables cbc rbd libceph libcrc32c loop xen_gntdev bridge coretemp ghash_clmulni_intel aesni_intel aes_x86_64 xts lrw gf128mul ablk_helper cryptd iTCO_wdt gpio_ich iTCO_vendor_support microcode serio_raw sb_edac edac_core evdev i2c_i801 lpc_ich mfd_core ioatdma shpchp wmi ac button dm_mod hid_generic usbhid hid sg sd_mod crc_t10dif crc32c_intel isci megaraid_sas ahci libsas libahci ehci_pci libata ehci_hcd scsi_transport_sas usbcore scsi_mod usb_common igb i2c_algo_bit i2c_core ixgbe dca ptp pps_core mdio
Aug  3 03:51:35 murmillia kernel: [772458.642687] CPU 2 
Aug  3 03:51:35 murmillia kernel: [772458.642698] Pid: 20090, comm: kworker/2:2 Not tainted 3.9-dae-dom0 #1 Supermicro X9DRW-7TPF+/X9DRW-7TPF+
Aug  3 03:51:35 murmillia kernel: [772458.642794] RIP: e030:[<ffffffffa020f1d3>]  [<ffffffffa020f1d3>] rbd_img_obj_callback+0x103/0x29a [rbd]
Aug  3 03:51:35 murmillia kernel: [772458.642876] RSP: e02b:ffff880015e43cf8  EFLAGS: 00010282
Aug  3 03:51:35 murmillia kernel: [772458.642916] RAX: 0000000000000070 RBX: ffff88001586f5c0 RCX: 0000000000000000
Aug  3 03:51:35 murmillia kernel: [772458.642982] RDX: ffff88003f84e8f0 RSI: ffff88003f84dea8 RDI: ffff880015e402b8
Aug  3 03:51:35 murmillia kernel: [772458.643048] RBP: 0000000000000001 R08: 0000000000000000 R09: 0000000000000000
Aug  3 03:51:35 murmillia kernel: [772458.643113] R10: 0000000000000000 R11: 00000000000002d5 R12: ffff880010ca7540
Aug  3 03:51:35 murmillia kernel: [772458.643179] R13: ffff88001c49e030 R14: 0000000000000000 R15: ffff880001c78720
Aug  3 03:51:35 murmillia kernel: [772458.643247] FS:  00007f4764e28700(0000) GS:ffff88003f840000(0000) knlGS:0000000000000000
Aug  3 03:51:35 murmillia kernel: [772458.643314] CS:  e033 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug  3 03:51:35 murmillia kernel: [772458.643356] CR2: 00007f4764e319b8 CR3: 000000000160c000 CR4: 0000000000042660
Aug  3 03:51:35 murmillia kernel: [772458.643421] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug  3 03:51:35 murmillia kernel: [772458.643485] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Aug  3 03:51:35 murmillia kernel: [772458.643551] Process kworker/2:2 (pid: 20090, threadinfo ffff880015e42000, task ffff880015e44fe0)
Aug  3 03:51:35 murmillia kernel: [772458.643618] Stack:
Aug  3 03:51:35 murmillia kernel: [772458.643649]  000000000000a000 ffff88001486b480 ffff8800360d9400 ffff8800360d9408
Aug  3 03:51:35 murmillia kernel: [772458.643730]  ffff88001c49e030 0000000000000000 ffff880001c78720 ffffffffa029fd72
Aug  3 03:51:35 murmillia kernel: [772458.643811]  0000000000000015 ffff880001c78778 0020917c00000001 0001da5e00000000
Aug  3 03:51:35 murmillia kernel: [772458.643893] Call Trace:
Aug  3 03:51:35 murmillia kernel: [772458.643933]  [<ffffffffa029fd72>] ? dispatch+0x424/0x590 [libceph]
Aug  3 03:51:35 murmillia kernel: [772458.643980]  [<ffffffffa029ac2f>] ? con_work+0x104d/0x1d05 [libceph]
Aug  3 03:51:35 murmillia kernel: [772458.644026]  [<ffffffff8100721f>] ? __switch_to+0x13e/0x3c0
Aug  3 03:51:35 murmillia kernel: [772458.644069]  [<ffffffff8104b2a6>] ? mmdrop+0xd/0x1c
Aug  3 03:51:35 murmillia kernel: [772458.644109]  [<ffffffff8104be3a>] ? finish_task_switch+0x50/0x8a
Aug  3 03:51:35 murmillia kernel: [772458.644152]  [<ffffffff810410ce>] ? process_one_work+0x156/0x208
Aug  3 03:51:35 murmillia kernel: [772458.644195]  [<ffffffff810427a0>] ? worker_thread+0x114/0x1bb
Aug  3 03:51:35 murmillia kernel: [772458.644237]  [<ffffffff8104268c>] ? manage_workers+0x202/0x202
Aug  3 03:51:35 murmillia kernel: [772458.644279]  [<ffffffff81045711>] ? kthread+0x7d/0x85
Aug  3 03:51:35 murmillia kernel: [772458.644319]  [<ffffffff81045694>] ? __kthread_parkme+0x59/0x59
Aug  3 03:51:35 murmillia kernel: [772458.644364]  [<ffffffff81356e3c>] ? ret_from_fork+0x7c/0xb0
Aug  3 03:51:35 murmillia kernel: [772458.644405]  [<ffffffff81045694>] ? __kthread_parkme+0x59/0x59
Aug  3 03:51:35 murmillia kernel: [772458.644447] Code: d7 13 e1 0f 0b 3b 6b 34 73 23 48 c7 c1 4a 34 21 a0 ba ac 06 00 00 31 c0 48 c7 c6 b0 3f 21 a0 48 c7 c7 bf 30 21 a0 e8 4e d7 13 e1 <0f> 0b 4c 8d 73 30 41 b5 01 4c 89 f7 e8 f9 2d 14 e1 3b 6b 34 0f 
Aug  3 03:51:35 murmillia kernel: [772458.644851] RIP  [<ffffffffa020f1d3>] rbd_img_obj_callback+0x103/0x29a [rbd]
Aug  3 03:51:35 murmillia kernel: [772458.644901]  RSP <ffff880015e43cf8>
Aug  3 03:51:35 murmillia kernel: [772458.645328] ---[ end trace 2a2a66811d33dc9e ]---

followed by :

Aug  3 03:51:35 murmillia kernel: [772458.648317] BUG: unable to handle kernel paging request at ffffffffffffffd8
Aug  3 03:52:45 murmillia kernel: [772458.648485] IP: [<ffffffff81045a20>] kthread_data+0x7/0xc
Aug  3 03:52:45 murmillia kernel: [772458.648611] PGD 160f067 PUD 1611067 PMD 0 
Aug  3 03:52:45 murmillia kernel: [772458.648812] Oops: 0000 [#2] SMP 
Aug  3 03:52:45 murmillia kernel: [772458.648967] Modules linked in: xt_physdev iptable_filter ip_tables x_tables cbc rbd libceph libcrc32c loop xen_gntdev bridge coretemp ghash_clmulni_intel aesni_intel aes_x86_64 xts lrw gf128mul ablk_helper cryptd iTCO_wdt gpio_ich iTCO_vendor_support microcode serio_raw sb_edac edac_core evdev i2c_i801 lpc_ich mfd_core ioatdma shpchp wmi ac button dm_mod hid_generic usbhid hid sg sd_mod crc_t10dif crc32c_intel isci megaraid_sas ahci libsas libahci ehci_pci libata ehci_hcd scsi_transport_sas usbcore scsi_mod usb_common igb i2c_algo_bit i2c_core ixgbe dca ptp pps_core mdio
Aug  3 03:52:45 murmillia kernel: [772458.652063] CPU 2 
Aug  3 03:52:45 murmillia kernel: [772458.652125] Pid: 20090, comm: kworker/2:2 Tainted: G      D      3.9-dae-dom0 #1 Supermicro X9DRW-7TPF+/X9DRW-7TPF+
Aug  3 03:52:45 murmillia kernel: [772458.652307] RIP: e030:[<ffffffff81045a20>]  [<ffffffff81045a20>] kthread_data+0x7/0xc
Aug  3 03:52:45 murmillia kernel: [772458.652458] RSP: e02b:ffff880015e43ab0  EFLAGS: 00010002
Aug  3 03:52:45 murmillia kernel: [772458.652535] RAX: 0000000000000000 RBX: ffff88003f852b00 RCX: ffff88003f852b70
Aug  3 03:52:45 murmillia kernel: [772458.652637] RDX: 0000000000000001 RSI: 0000000000000002 RDI: ffff880015e44fe0
Aug  3 03:52:45 murmillia kernel: [772458.652743] RBP: 0000000000000002 R08: ffffffff817b5910 R09: 0000000000000002
Aug  3 03:52:45 murmillia kernel: [772458.652850] R10: 000000000000b7ec R11: ffff880015e44fe0 R12: ffff880015e45300
Aug  3 03:52:45 murmillia kernel: [772458.652964] R13: ffff88003a349510 R14: 0000000000000002 R15: ffff880015e44fd0
Aug  3 03:52:45 murmillia kernel: [772458.653073] FS:  00007f4764e28700(0000) GS:ffff88003f840000(0000) knlGS:0000000000000000
Aug  3 03:52:45 murmillia kernel: [772458.653176] CS:  e033 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug  3 03:52:45 murmillia kernel: [772458.653261] CR2: ffffffffffffffd8 CR3: 000000000160c000 CR4: 0000000000042660
Aug  3 03:52:45 murmillia kernel: [772458.653370] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Aug  3 03:52:45 murmillia kernel: [772458.653478] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Aug  3 03:52:45 murmillia kernel: [772458.653580] Process kworker/2:2 (pid: 20090, threadinfo ffff880015e42000, task ffff880015e44fe0)
Aug  3 03:52:45 murmillia kernel: [772458.653686] Stack:
Aug  3 03:52:45 murmillia kernel: [772458.653755]  ffffffff810428a1 ffff88003f852b00 ffff880015e44fe0 ffffffff81351294
Aug  3 03:52:45 murmillia kernel: [772458.654031]  0000000000012b00 ffff880015e43fd8 ffff880015e43fd8 ffff880015e44fe0
Aug  3 03:52:45 murmillia kernel: [772458.654315]  ffff880015e45520 0000000000000001 ffff88003a349510 ffff880015e45208
Aug  3 03:52:45 murmillia kernel: [772458.654585] Call Trace:
Aug  3 03:52:45 murmillia kernel: [772458.654660]  [<ffffffff810428a1>] ? wq_worker_sleeping+0x9/0x58
Aug  3 03:52:45 murmillia kernel: [772458.654743]  [<ffffffff81351294>] ? __schedule+0x109/0x47c
Aug  3 03:52:45 murmillia kernel: [772458.654823]  [<ffffffff81032c9b>] ? do_exit+0x8e2/0x8e4
Aug  3 03:52:45 murmillia kernel: [772458.654901]  [<ffffffff81352fd8>] ? oops_end+0x96/0x99
Aug  3 03:52:45 murmillia kernel: [772458.654987]  [<ffffffff8100864f>] ? do_invalid_op+0x84/0x8b
Aug  3 03:52:45 murmillia kernel: [772458.655077]  [<ffffffffa020f1d3>] ? rbd_img_obj_callback+0x103/0x29a [rbd]
Aug  3 03:52:45 murmillia kernel: [772458.655170]  [<ffffffff81005952>] ? check_events+0x12/0x20
Aug  3 03:52:45 murmillia kernel: [772458.655249]  [<ffffffff8100593f>] ? xen_restore_fl_direct_reloc+0x4/0x4
Aug  3 03:52:45 murmillia kernel: [772458.655332]  [<ffffffff8102f3d9>] ? arch_local_irq_restore+0x7/0x8
Aug  3 03:52:45 murmillia kernel: [772458.655412]  [<ffffffff81030ba8>] ? vprintk_emit+0x364/0x388
Aug  3 03:52:45 murmillia kernel: [772458.655492]  [<ffffffff8135801e>] ? invalid_op+0x1e/0x30
Aug  3 03:52:45 murmillia kernel: [772458.655572]  [<ffffffffa020f1d3>] ? rbd_img_obj_callback+0x103/0x29a [rbd]
Aug  3 03:52:45 murmillia kernel: [772458.655657]  [<ffffffffa020f1d3>] ? rbd_img_obj_callback+0x103/0x29a [rbd]
Aug  3 03:52:45 murmillia kernel: [772458.655749]  [<ffffffffa029fd72>] ? dispatch+0x424/0x590 [libceph]
Aug  3 03:52:45 murmillia kernel: [772458.655843]  [<ffffffffa029ac2f>] ? con_work+0x104d/0x1d05 [libceph]
Aug  3 03:52:45 murmillia kernel: [772458.655938]  [<ffffffff8100721f>] ? __switch_to+0x13e/0x3c0
Aug  3 03:52:45 murmillia kernel: [772458.656022]  [<ffffffff8104b2a6>] ? mmdrop+0xd/0x1c
Aug  3 03:52:45 murmillia kernel: [772458.656099]  [<ffffffff8104be3a>] ? finish_task_switch+0x50/0x8a
Aug  3 03:52:45 murmillia kernel: [772458.656182]  [<ffffffff810410ce>] ? process_one_work+0x156/0x208
Aug  3 03:52:45 murmillia kernel: [772458.656266]  [<ffffffff810427a0>] ? worker_thread+0x114/0x1bb
Aug  3 03:52:45 murmillia kernel: [772458.656347]  [<ffffffff8104268c>] ? manage_workers+0x202/0x202
Aug  3 03:52:45 murmillia kernel: [772458.656427]  [<ffffffff81045711>] ? kthread+0x7d/0x85
Aug  3 03:52:45 murmillia kernel: [772458.656504]  [<ffffffff81045694>] ? __kthread_parkme+0x59/0x59
Aug  3 03:52:45 murmillia kernel: [772458.656585]  [<ffffffff81356e3c>] ? ret_from_fork+0x7c/0xb0
Aug  3 03:52:45 murmillia kernel: [772458.656663]  [<ffffffff81045694>] ? __kthread_parkme+0x59/0x59
Aug  3 03:52:45 murmillia kernel: [772458.656741] Code: 78 5b 5d 41 5c 41 5d c3 65 48 8b 04 25 80 c7 00 00 48 8b 80 c8 02 00 00 48 8b 40 c8 48 c1 e8 02 83 e0 01 c3 48 8b 87 c8 02 00 00 <48> 8b 40 d8 c3 65 48 8b 04 25 80 c7 00 00 48 8b b8 c8 02 00 00 
Aug  3 03:52:45 murmillia kernel: [772458.659681] RIP  [<ffffffff81045a20>] kthread_data+0x7/0xc
Aug  3 03:52:45 murmillia kernel: [772458.659805]  RSP <ffff880015e43ab0>
Aug  3 03:52:45 murmillia kernel: [772458.659877] CR2: ffffffffffffffd8
Aug  3 03:52:45 murmillia kernel: [772458.659950] ---[ end trace 2a2a66811d33dc9f ]---
Aug  3 03:52:45 murmillia kernel: [772458.661770] Fixing recursive fault but reboot is needed!


Files

rbd.patch (12.5 KB) rbd.patch Olivier Bonvalet, 10/28/2013 04:42 AM

Related issues 2 (0 open2 closed)

Has duplicate rbd - Bug #7125: Assertion failure in rbd_img_obj_callback() ResolvedIlya Dryomov01/09/2014

Actions
Has duplicate rbd - Bug #8378: krbd: Kernel oops in rbd_img_obj_callbackResolvedIlya Dryomov05/16/2014

Actions
Actions

Also available in: Atom PDF