Actions
Bug #109
closedkernel bugs out with bad osd caps
% Done:
0%
Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):
Description
e.g.,
caps: [mds] allow caps: [mon] allow r caps: [osd] allow rw pool data
Updated by Sage Weil almost 14 years ago
[ 146.405507] ceph: tid 1 timed out on osd1, will reset osd [ 146.411538] BUG: unable to handle kernel NULL pointer dereference at (null) [ 146.412001] IP: [<ffffffff814d58ed>] tcp_sendpage+0x52d/0x710 [ 146.412001] PGD 11ea35067 PUD 11efc9067 PMD 0 [ 146.412001] Oops: 0000 [#1] SMP [ 146.412001] last sysfs file: /sys/class/net/lo/operstate [ 146.412001] CPU 0 [ 146.412001] Modules linked in: ceph [ 146.412001] [ 146.412001] Pid: 3475, comm: ceph-msgr/0 Not tainted 2.6.34-rc6 #2 PDSMi+/PDSMi [ 146.412001] RIP: 0010:[<ffffffff814d58ed>] [<ffffffff814d58ed>] tcp_sendpage+0x52d/0x710 [ 146.412001] RSP: 0018:ffff88011f135cb0 EFLAGS: 00010246 [ 146.412001] RAX: ffffffff8183f100 RBX: 0000000000000001 RCX: 0000000000000000 [ 146.412001] RDX: 0000000000001000 RSI: ffff88011e3b9130 RDI: ffff88011e289140 [ 146.412001] RBP: ffff88011f1c4000 R08: 0000000000000001 R09: 0000000000000000 [ 146.412001] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88011e289140 [ 146.412001] R13: ffff88011e289210 R14: 0000000000000000 R15: 0000000000000005 [ 146.412001] FS: 0000000000000000(0000) GS:ffff880001a00000(0000) knlGS:0000000000000000 [ 146.412001] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [ 146.412001] CR2: 0000000000000000 CR3: 000000011e86e000 CR4: 00000000000006f0 [ 146.412001] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [ 146.412001] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [ 146.412001] Process ceph-msgr/0 (pid: 3475, threadinfo ffff88011f134000, task ffff88011e828cc0) [ 146.412001] Stack: [ 146.412001] ffff88011ea51a47 ffff88011ea51d38 ffff88011ea519a0 ffff88011ea51868 [ 146.412001] <0> 0000000000000005 000000001ea51870 0000000000000000 000005a88146c389 [ 146.412001] <0> 000000000000c040 0000000000000000 0000000000000000 00000b50a001f6a2 [ 146.412001] Call Trace: [ 146.412001] [<ffffffff8146b7d6>] ? kernel_sendpage+0x16/0x30 [ 146.412001] [<ffffffffa0021d8a>] ? con_work+0x39a/0x1590 [ceph] [ 146.412001] [<ffffffff81035a94>] ? finish_task_switch+0x34/0xb0 [ 146.412001] [<ffffffff81035a94>] ? finish_task_switch+0x34/0xb0 [ 146.412001] [<ffffffffa00219f0>] ? con_work+0x0/0x1590 [ceph] [ 146.412001] [<ffffffff8104eb0b>] ? worker_thread+0x13b/0x210 [ 146.412001] [<ffffffff81052db0>] ? autoremove_wake_function+0x0/0x30 [ 146.412001] [<ffffffff8104e9d0>] ? worker_thread+0x0/0x210 [ 146.412001] [<ffffffff8105292e>] ? kthread+0x8e/0xa0 [ 146.412001] [<ffffffff81003194>] ? kernel_thread_helper+0x4/0x10 [ 146.412001] [<ffffffff810528a0>] ? kthread+0x0/0xa0 [ 146.412001] [<ffffffff81003190>] ? kernel_thread_helper+0x0/0x10 [ 146.412001] Code: ff 8b 56 0c 41 ba 01 00 00 00 03 56 08 41 39 d1 0f 85 d1 fc ff ff e9 c6 fd ff ff 0f 1f 00 49 8b 84 24 d0 01 00 00 e9 d7 fe ff ff <66> 83 39 00 0f 88 c6 01 00 00 48 89 c8 f0 ff 40 08 8b 85 b4 00 [ 146.412001] RIP [<ffffffff814d58ed>] tcp_sendpage+0x52d/0x710 [ 146.412001] RSP <ffff88011f135cb0> [ 146.412001] CR2: 0000000000000000 [ 146.684686] ---[ end trace a6569faac570fbce ]---
Updated by Sage Weil almost 14 years ago
- Target version changed from v2.6.35 to v2.6.34
Updated by Sage Weil almost 14 years ago
osd errors weren't unregistering the request. fixed by commit:a40355b39e006459b1ffba052c53084d20d64209
Actions