Project

General

Profile

Bug #109

kernel bugs out with bad osd caps

Added by Sage Weil over 9 years ago. Updated almost 9 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
Start date:
05/06/2010
Due date:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature:

Description

e.g.,

        caps: [mds] allow
        caps: [mon] allow r
        caps: [osd] allow rw pool data

History

#1 Updated by Sage Weil over 9 years ago

[  146.405507] ceph:  tid 1 timed out on osd1, will reset osd
[  146.411538] BUG: unable to handle kernel NULL pointer dereference at (null)
[  146.412001] IP: [<ffffffff814d58ed>] tcp_sendpage+0x52d/0x710
[  146.412001] PGD 11ea35067 PUD 11efc9067 PMD 0 
[  146.412001] Oops: 0000 [#1] SMP 
[  146.412001] last sysfs file: /sys/class/net/lo/operstate
[  146.412001] CPU 0 
[  146.412001] Modules linked in: ceph
[  146.412001] 
[  146.412001] Pid: 3475, comm: ceph-msgr/0 Not tainted 2.6.34-rc6 #2 PDSMi+/PDSMi
[  146.412001] RIP: 0010:[<ffffffff814d58ed>]  [<ffffffff814d58ed>] tcp_sendpage+0x52d/0x710
[  146.412001] RSP: 0018:ffff88011f135cb0  EFLAGS: 00010246
[  146.412001] RAX: ffffffff8183f100 RBX: 0000000000000001 RCX: 0000000000000000
[  146.412001] RDX: 0000000000001000 RSI: ffff88011e3b9130 RDI: ffff88011e289140
[  146.412001] RBP: ffff88011f1c4000 R08: 0000000000000001 R09: 0000000000000000
[  146.412001] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88011e289140
[  146.412001] R13: ffff88011e289210 R14: 0000000000000000 R15: 0000000000000005
[  146.412001] FS:  0000000000000000(0000) GS:ffff880001a00000(0000) knlGS:0000000000000000
[  146.412001] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[  146.412001] CR2: 0000000000000000 CR3: 000000011e86e000 CR4: 00000000000006f0
[  146.412001] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  146.412001] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[  146.412001] Process ceph-msgr/0 (pid: 3475, threadinfo ffff88011f134000, task ffff88011e828cc0)
[  146.412001] Stack:
[  146.412001]  ffff88011ea51a47 ffff88011ea51d38 ffff88011ea519a0 ffff88011ea51868
[  146.412001] <0> 0000000000000005 000000001ea51870 0000000000000000 000005a88146c389
[  146.412001] <0> 000000000000c040 0000000000000000 0000000000000000 00000b50a001f6a2
[  146.412001] Call Trace:
[  146.412001]  [<ffffffff8146b7d6>] ? kernel_sendpage+0x16/0x30
[  146.412001]  [<ffffffffa0021d8a>] ? con_work+0x39a/0x1590 [ceph]
[  146.412001]  [<ffffffff81035a94>] ? finish_task_switch+0x34/0xb0
[  146.412001]  [<ffffffff81035a94>] ? finish_task_switch+0x34/0xb0
[  146.412001]  [<ffffffffa00219f0>] ? con_work+0x0/0x1590 [ceph]
[  146.412001]  [<ffffffff8104eb0b>] ? worker_thread+0x13b/0x210
[  146.412001]  [<ffffffff81052db0>] ? autoremove_wake_function+0x0/0x30
[  146.412001]  [<ffffffff8104e9d0>] ? worker_thread+0x0/0x210
[  146.412001]  [<ffffffff8105292e>] ? kthread+0x8e/0xa0
[  146.412001]  [<ffffffff81003194>] ? kernel_thread_helper+0x4/0x10
[  146.412001]  [<ffffffff810528a0>] ? kthread+0x0/0xa0
[  146.412001]  [<ffffffff81003190>] ? kernel_thread_helper+0x0/0x10
[  146.412001] Code: ff 8b 56 0c 41 ba 01 00 00 00 03 56 08 41 39 d1 0f 85 d1 fc ff ff e9 c6 fd ff ff 0f 1f 00 49 8b 84 24 d0 01 00 00 e9 d7 fe ff ff <66> 83 39 00 0f 88 c6 01 00 00 48 89 c8 f0 ff 40 08 8b 85 b4 00 
[  146.412001] RIP  [<ffffffff814d58ed>] tcp_sendpage+0x52d/0x710
[  146.412001]  RSP <ffff88011f135cb0>
[  146.412001] CR2: 0000000000000000
[  146.684686] ---[ end trace a6569faac570fbce ]---

#2 Updated by Sage Weil over 9 years ago

  • Priority changed from Normal to High

#3 Updated by Sage Weil over 9 years ago

  • Target version changed from v2.6.35 to v2.6.34

#4 Updated by Sage Weil over 9 years ago

  • Status changed from New to Resolved

#5 Updated by Sage Weil over 9 years ago

osd errors weren't unregistering the request. fixed by commit:a40355b39e006459b1ffba052c53084d20d64209

Also available in: Atom PDF