Project

General

Profile

Bug #1866

null pointer dereference after osd went down

Added by Josh Durgin over 7 years ago. Updated over 7 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
libceph
Target version:
Start date:
12/29/2011
Due date:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:

Description

This was during a kernel_untar_build workunit on rbd:

Dec 29 00:18:31 sepia87 kernel: [27615.585661] rbd: loaded rbd (rados block device)
Dec 29 00:18:32 sepia87 kernel: [27616.074799] libceph: client4101 fsid c39dbc0b-b3c5-41c9-8323-008a70221f2d
Dec 29 00:18:32 sepia87 kernel: [27616.074985] libceph: mon1 10.3.14.213:6789 session established
Dec 29 00:18:35 sepia87 kernel: [27619.635973]  rbd0: unknown partition table
Dec 29 00:18:35 sepia87 kernel: [27619.636067] rbd: rbd0: added with size 0x280000000
Dec 29 00:19:03 sepia87 kernel: [27647.106474] kjournald starting.  Commit interval 5 seconds
Dec 29 00:19:03 sepia87 kernel: [27647.243689] EXT3-fs (rbd0): using internal journal
Dec 29 00:19:03 sepia87 kernel: [27647.243697] EXT3-fs (rbd0): mounted filesystem with ordered data mode
Dec 29 00:28:22 sepia87 kernel: [28206.391704] libceph: osd1 down
Dec 29 00:28:22 sepia87 kernel: [28206.494189] BUG: unable to handle kernel NULL pointer dereference at 0000000000000048
Dec 29 00:28:22 sepia87 kernel: [28206.502102] IP: [<ffffffffa033ea87>] try_write+0x627/0x1060 [libceph]
Dec 29 00:28:22 sepia87 kernel: [28206.504081] PGD e9cfa067 PUD e4b81067 PMD 0 
Dec 29 00:28:22 sepia87 kernel: [28206.504081] Oops: 0000 [#1] SMP 
Dec 29 00:28:22 sepia87 kernel: [28206.504081] CPU 0 
Dec 29 00:28:22 sepia87 kernel: [28206.504081] Modules linked in: rbd ceph libceph cryptd aes_x86_64 aes_generic btrfs zlib_deflate crc32c libcrc32c ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs exportfs reiserfs psmouse i2c_piix4 amd64_edac_mod edac_core lp edac_mce_amd parport k8temp serio_raw shpchp tg3 floppy sata_svw pata_serverworks [last unloaded: libceph]
Dec 29 00:28:22 sepia87 kernel: [28206.504081] 
Dec 29 00:28:22 sepia87 kernel: [28206.504081] Pid: 8641, comm: kworker/0:2 Not tainted 3.1.0-ceph-08942-g8cd7ede #1 Supermicro H8SSL/H8SSL
Dec 29 00:28:22 sepia87 kernel: [28206.504081] RIP: 0010:[<ffffffffa033ea87>]  [<ffffffffa033ea87>] try_write+0x627/0x1060 [libceph]
Dec 29 00:28:22 sepia87 kernel: [28206.504081] RSP: 0018:ffff8800dfa4db00  EFLAGS: 00010256
Dec 29 00:28:22 sepia87 kernel: [28206.504081] RAX: 0000000000000000 RBX: ffff8800e9db8830 RCX: 0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081] RDX: 0000000000053000 RSI: 0000000000000001 RDI: 0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081] RBP: ffff8800dfa4dc20 R08: 0000000000000000 R09: 0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081] R10: 0000000000000000 R11: 0000000000000000 R12: ffffea0001f9e840
Dec 29 00:28:22 sepia87 kernel: [28206.504081] R13: 000000000002d000 R14: ffff8800e9ccad00 R15: 0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081] FS:  00007effb6e3e700(0000) GS:ffff8800fbc00000(0000) knlGS:0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Dec 29 00:28:22 sepia87 kernel: [28206.504081] CR2: 0000000000000048 CR3: 00000000eca51000 CR4: 00000000000006f0
Dec 29 00:28:22 sepia87 kernel: [28206.504081] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Dec 29 00:28:22 sepia87 kernel: [28206.504081] Process kworker/0:2 (pid: 8641, threadinfo ffff8800dfa4c000, task ffff880037141fb0)
Dec 29 00:28:22 sepia87 kernel: [28206.504081] Stack:
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  0000000100004040 ffff880037e41000 0000000000000000 0000000000000000
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  ffff8800dfa4dbd0 0000000000000000 ffff8800e4bc9a91 ffff8800e4bc9a00
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  ffff8800e9db8c10 ffff8800e9db8a40 ffff8800e9db8870 ffff8800e9db8a30
Dec 29 00:28:22 sepia87 kernel: [28206.504081] Call Trace:
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffff814e8db4>] ? kernel_recvmsg+0x44/0x60
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffffa033d748>] ? ceph_tcp_recvmsg+0x48/0x60 [libceph]
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffffa03400d0>] con_work+0xc10/0x1b00 [libceph]
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffff81313f7e>] ? trace_hardirqs_on_thunk+0x3a/0x3f
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffff81603c74>] ? retint_restore_args+0x13/0x13
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffff8107fcb6>] process_one_work+0x1a6/0x520
Dec 29 00:28:22 sepia87 kernel: [28206.504081]  [<ffffffff8107fc47>] ? process_one_work+0x137/0x520


Related issues

Duplicates Linux kernel client - Bug #1793: NULL pointer dereference at try_write+0x627/0x1060 Can't reproduce 12/06/2011

History

#1 Updated by Sage Weil over 7 years ago

  • translation missing: en.field_position set to 1

#2 Updated by Sage Weil over 7 years ago

  • Target version set to v3.3
  • translation missing: en.field_position deleted (1)
  • translation missing: en.field_position set to 698

#3 Updated by Sage Weil over 7 years ago

  • Status changed from New to Duplicate

same as #1793

Also available in: Atom PDF