Project

General

Profile

Actions

Bug #2506

closed

ceph: ceph_add_cap: couldn't find snap realm NNN

Added by Sage Weil almost 12 years ago. Updated over 11 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
fs/ceph
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

From Martin Wilderoth on ML:

[ 1828.569091] ceph: ceph_add_cap: couldn't find snap realm 100
[ 1828.569105] ------------[ cut here ]------------
[ 1828.569121] WARNING: at
/build/buildd-linux-2.6_3.2.17-1~bpo60+1-amd64-CJo7Ex/linux-2.6-3.2.17/debian/bu
ild/source_amd64_none/fs/ceph/caps.c:590 ceph_add_cap+0x38e/0x49e [ceph]()
[ 1828.569139] Modules linked in: cryptd aes_x86_64 aes_generic cbc ceph libceph
crc32c libcrc32c evdev snd_pcm snd_timer snd soundcore snd_page_alloc pcspkr
ext3 jbd mbcache xen_netfront xen_blkfront
[ 1828.569182] Pid: 18, comm: kworker/0:1 Tainted: G        W
3.2.0-0.bpo.2-amd64 #1
[ 1828.569193] Call Trace:
[ 1828.569207]  [<ffffffff810497ec>] ? warn_slowpath_common+0x78/0x8c
[ 1828.569221]  [<ffffffffa00db647>] ? ceph_add_cap+0x38e/0x49e [ceph]
[ 1828.569233]  [<ffffffffa00d220a>] ? fill_inode+0x4eb/0x602 [ceph]
[ 1828.569244]  [<ffffffffa00d331b>] ? ceph_dentry_lru_touch+0x2a/0x68 [ceph]
[ 1828.569258]  [<ffffffffa00d317d>] ? ceph_readdir_prepopulate+0x2de/0x375
[ceph]
[ 1828.569271]  [<ffffffffa00e2d3f>] ? dispatch+0xa35/0xef2 [ceph]
[ 1828.569286]  [<ffffffffa00ae841>] ? ceph_tcp_recvmsg+0x43/0x4f [libceph]
[ 1828.569297]  [<ffffffffa00b0821>] ? con_work+0x1070/0x13b8 [libceph]
[ 1828.569308]  [<ffffffff81044580>] ? update_curr+0xbc/0x160
[ 1828.569319]  [<ffffffffa00af7b1>] ? try_write+0xbe1/0xbe1 [libceph]
[ 1828.569332]  [<ffffffff8105f8bb>] ? process_one_work+0x1cc/0x2ea
[ 1828.569342]  [<ffffffff8105fb06>] ? worker_thread+0x12d/0x247
[ 1828.569353]  [<ffffffff8105f9d9>] ? process_one_work+0x2ea/0x2ea
[ 1828.569361]  [<ffffffff8105f9d9>] ? process_one_work+0x2ea/0x2ea
[ 1828.569372]  [<ffffffff81063311>] ? kthread+0x7a/0x82
[ 1828.569384]  [<ffffffff8136bb34>] ? kernel_thread_helper+0x4/0x10
[ 1828.569395]  [<ffffffff81369bf3>] ? int_ret_from_sys_call+0x7/0x1b
[ 1828.569406]  [<ffffffff813646fc>] ? retint_restore_args+0x5/0x6
[ 1828.569417]  [<ffffffff8136bb30>] ? gs_change+0x13/0x13
[ 1828.569423] ---[ end trace 98770cddb79a6a55 ]---

Actions #1

Updated by Anonymous almost 12 years ago

Note that this was triggered frequently by backuppc runs:
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/6815/focus=6820

Actions #2

Updated by Sage Weil over 11 years ago

  • Status changed from 12 to Resolved

Yan found this! by backuppc

Actions #3

Updated by Samuel Hassine over 11 years ago

Hi Ceph Dev team,

Same problem here, with the last version of Ceph kernel client and the 2 last kernel (linux-image-3.2.0-33-generic and linux-image-3.5.0-18-generic).

Kernel output:

Nov 28 00:37:49 proxy-001 kernel: [  372.996428] ceph: ceph_add_cap: couldn't find snap realm 100
Nov 28 00:37:49 proxy-001 kernel: [  372.996467] ------------[ cut here ]------------
Nov 28 00:37:49 proxy-001 kernel: [  372.996475] WARNING: at /build/buildd/linux-lts-quantal-3.5.0/fs/ceph/caps.c:590 ceph_add_cap+0x3ea/0x3f0 [ceph]()
Nov 28 00:37:49 proxy-001 kernel: [  372.996476] Hardware name: Bochs
Nov 28 00:37:49 proxy-001 kernel: [  372.996477] Modules linked in: ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables x_tables c
eph joydev libceph psmouse mac_hid i2c_piix4 microcode hid_generic lp virtio_balloon serio_raw parport usbhid hid floppy btrfs zlib_deflate libcrc32c
Nov 28 00:37:49 proxy-001 kernel: [  372.996493] Pid: 24, comm: kworker/0:1 Tainted: G        W    3.5.0-18-generic #29~precise1-Ubuntu
Nov 28 00:37:49 proxy-001 kernel: [  372.996494] Call Trace:
Nov 28 00:37:49 proxy-001 kernel: [  372.996500]  [<ffffffff81052c9f>] warn_slowpath_common+0x7f/0xc0
Nov 28 00:37:49 proxy-001 kernel: [  372.996506]  [<ffffffff81052cfa>] warn_slowpath_null+0x1a/0x20
Nov 28 00:37:49 proxy-001 kernel: [  372.996524]  [<ffffffffa01b0afa>] ceph_add_cap+0x3ea/0x3f0 [ceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996528]  [<ffffffffa01a5cc7>] fill_inode.isra.8+0x697/0x770 [ceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996531]  [<ffffffff8169e1ee>] ? _raw_spin_lock+0xe/0x20
Nov 28 00:37:49 proxy-001 kernel: [  372.996535]  [<ffffffffa01a5f72>] ceph_readdir_prepopulate+0x1d2/0x3d0 [ceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996539]  [<ffffffffa01b9cc4>] handle_reply+0x394/0x510 [ceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996543]  [<ffffffffa01bbeab>] dispatch+0xdb/0x180 [ceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996548]  [<ffffffffa0184d24>] try_read+0x3d4/0x460 [libceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996564]  [<ffffffffa0184db0>] ? try_read+0x460/0x460 [libceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996567]  [<ffffffffa0184e2a>] con_work+0x7a/0x160 [libceph]
Nov 28 00:37:49 proxy-001 kernel: [  372.996571]  [<ffffffff81071aea>] process_one_work+0x11a/0x480
Nov 28 00:37:49 proxy-001 kernel: [  372.996573]  [<ffffffff81072b05>] worker_thread+0x165/0x370
Nov 28 00:37:49 proxy-001 kernel: [  372.996575]  [<ffffffff810729a0>] ? manage_workers.isra.29+0x130/0x130
Nov 28 00:37:49 proxy-001 kernel: [  372.996578]  [<ffffffff81077aa3>] kthread+0x93/0xa0
Nov 28 00:37:49 proxy-001 kernel: [  372.996580]  [<ffffffff816a7d64>] kernel_thread_helper+0x4/0x10
Nov 28 00:37:49 proxy-001 kernel: [  372.996582]  [<ffffffff81077a10>] ? flush_kthread_worker+0xb0/0xb0
Nov 28 00:37:49 proxy-001 kernel: [  372.996584]  [<ffffffff816a7d60>] ? gs_change+0x13/0x13
Nov 28 00:37:49 proxy-001 kernel: [  372.996585] ---[ end trace 360f9571ddaff772 ]---

Is this bug already solved or it will be in the next version?

Thanks for your answer.

Best regards.
Sam

Actions #4

Updated by Samuel Hassine over 11 years ago

Re,

Ceph version causes the kernel panics we are encountering:

root@proxy-001:/home/sysmastr# ceph -v
ceph version 0.48.2argonaut (commit:3e02b2fad88c2a95d9c0c86878f10d1beb780bfe)

Best regards.
Sam

Actions #5

Updated by Sage Weil over 11 years ago

commit:3384431b6d889486d19d1a1543626fb1a2cff178, which appears in v0.55, should fix the 'snap realm 100' problem. Thanks!

Actions

Also available in: Atom PDF