Project

General

Profile

Actions

Bug #2867

closed

kclient: crash from ffsb in con_work -> kernel_sendmsg

Added by Sage Weil over 11 years ago. Updated over 11 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
libceph
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description


Stack traceback for pid 10908
0xffff88020b3d5e80    10908        2  1    0   R  0xffff88020b3d62d8 *kworker/0:2
<c> ffff8802205bf910<c> 0000000000000018<c> 0000000000000002<c> 0000000000000000<c>
<c> ffff88022261f500<c> ffff8802205bf990<c> ffffffff81563111<c> 0000000000000800<c>
<c> ffff8802205bf950<c> ffff8800795bf000<c> 0000000000000050<c> 00000000ffffffff<c>
Call Trace:
 [<ffffffff81563111>] ? sk_stream_alloc_skb+0x41/0x120
 [<ffffffff81511dc8>] ? __alloc_skb+0x78/0x230
 [<ffffffff81563111>] ? sk_stream_alloc_skb+0x41/0x120
 [<ffffffff815636c0>] ? tcp_sendmsg+0x4d0/0xe20
 [<ffffffff8158bad0>] ? inet_recvmsg+0xf0/0xf0
 [<ffffffff8158bb8c>] ? inet_sendmsg+0xbc/0xf0
 [<ffffffff8158bad0>] ? inet_recvmsg+0xf0/0xf0
 [<ffffffff810ae29d>] ? trace_hardirqs_on+0xd/0x10
 [<ffffffff81507ac7>] ? sock_sendmsg+0x117/0x130
 [<ffffffff8158bc93>] ? inet_sendpage+0xd3/0x120
 [<ffffffff8150e020>] ? sock_update_classid+0xa0/0x100
 [<ffffffff81507b20>] ? kernel_sendmsg+0x40/0x60
 [<ffffffffa0361335>] ? con_work+0x5e5/0x1620 [libceph]
 [<ffffffff8106d1f8>] ? process_one_work+0x1f8/0x510
 [<ffffffffa0360d50>] ? try_read+0x1860/0x1860 [libceph]
 [<ffffffff8106d18a>] ? process_one_work+0x18a/0x510
[0]more> 
 [<ffffffff8106d11e>] ? process_one_work+0x11e/0x510
 [<ffffffff8106ecdf>] ? worker_thread+0x15f/0x350
 [<ffffffff8106eb80>] ? manage_workers.isra.27+0x230/0x230
 [<ffffffff8107411e>] ? kthread+0xae/0xc0
 [<ffffffff810ae29d>] ? trace_hardirqs_on+0xd/0x10
 [<ffffffff816368f4>] ? kernel_thread_helper+0x4/0x10
 [<ffffffff8162d370>] ? retint_restore_args+0x13/0x13
 [<ffffffff81074070>] ? __init_kthread_worker+0x70/0x70
 [<ffffffff816368f0>] ? gs_change+0x13/0x13

ubuntu@teuthology:/a/sage-2012-07-27_14:57:14-regression-wip-msgr-masterbits-testing-basic/1392$ cat config.yaml 
kernel: &id001
  kdb: true
  sha1: 7f77b3063194c035c7ac6db634e300126d8f5896
nuke-on-error: true
overrides:
  ceph:
    fs: btrfs
    log-whitelist:
    - slow request
    sha1: de4474acbd7d3e2d0d4ec511680bfedb59b0a462
  workunit:
    sha1: de4474acbd7d3e2d0d4ec511680bfedb59b0a462
roles:
- - mon.a
  - mon.c
  - osd.0
  - osd.1
  - osd.2
- - mon.b
  - mds.a
  - osd.3
  - osd.4
  - osd.5
- - client.0
targets:
  ubuntu@plana51.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQDLTsxC+nR+xtTXMbOtazCh7MOzgBKjX/oCLMP16k0AtH8Ui92tlqsfNxcHczUol0DNzxCITgrhF8FTvgM3EgkbOUVAxGj+xLqfxsdlf58nTVXbm/pOGYnvOI8CvA4DgISHDbkzuFH4FKtR8qNTTFVmtEXaZ+jpSvn7vrYuI/Uu9XZOQh73phYW8zvVB1x8770czM0Gy2wgxdNguKy6L/Q9ShsLcFfm8Uvxf6aXb3qmuxwGhqYsMlNl0X3AjoOwmow74rodlcMvQP/pAQdjMZfe1lBPqsjmU518BE5eo7zV3O9iF6ahOrm8igOu9bfki0G52R22pA3hE9BPKPfzA0hL
  ubuntu@plana52.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQC9kswBp2g5ZV1Qrvlee8MvUOCNdubQFqUBr5WSsmFBODqEuiitWbhuBu2Ucz0lBMf41DpMKLeYDN0lIC94GZmGaiCN+Ak9Ia05d/uRvesT2nDgHB3Z9J/zEFlY8RVxL3xhD+hq4u8dbASlqqoMDiBP+7efZMxt4Ndnzr/yOxge3KenxyQImBUS+OV+BqnfCOHf6BqM33U1leXz2kng7ocxoE91DAMslKD/2DPRSYEhfucUJZk6IYevr/g0JVhbfvjSlZzwUEfTyVmPeqNyls/U+azhKlvQbqpb+ttc02RNydQ1YgOgHFCaqd9Vm8XjUU6vYGlkFHZ+BMJuEwA9AH/D
  ubuntu@plana67.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQDqiZsiT7h3fNR9yzwK2WToaotO4olIxmVdh+aSf3ILwEpHjFYbWXymL0C77hn0MdGRbaOzWOSMMng3MAHKy9xR3/CGNXXqO7iEK1fJOvSfmypkvJDyrMY/RuSvdifcXJyREvFsSK6cdmRpO235ODhfui4FC5BLmgv/VvasH/1Ur4ALfe7UE9L+cU4VeoJdl082oYeo1nn1beERgaypX67MXepG2NKbEY77jG5FXbGVpKWmsgIEWiiX8p6+afTOP+8cGsM3vsAG7nTJeFVKkEHc7A8cPkT4l/iXKjSiwWAtU5NV0QmRC/1ad78+xTOWNzJaTrIxoKuuGpB+DjdvrJgN
tasks:
- internal.lock_machines: 3
- internal.save_config: null
- internal.check_lock: null
- internal.connect: null
- internal.check_conflict: null
- kernel: *id001
- internal.base: null
- internal.archive: null
- internal.coredump: null
- internal.syslog: null
- internal.timer: null
- chef: null
- clock: null
- ceph:
    log-whitelist:
    - wrongly marked me down
    - objects unfound and apparently lost
- thrashosds: null
- kclient: null
- workunit:
    clients:
      all:
      - suites/ffsb.sh


Related issues 2 (0 open2 closed)

Related to Linux kernel client - Bug #2790: libceph: crash in read_partial_message_section on ffsbDuplicateSage Weil07/17/2012

Actions
Has duplicate Linux kernel client - Bug #2688: lockup on ffsb + thrashingDuplicate07/02/2012

Actions
Actions #1

Updated by Sage Weil over 11 years ago

  • Project changed from Ceph to Linux kernel client
Actions #2

Updated by Sage Weil over 11 years ago

  • Category set to libceph
Actions #3

Updated by Sage Weil over 11 years ago

  • Status changed from New to 12
  • Assignee set to Sage Weil
  • Priority changed from High to Urgent

This appears to be a regression, so it is effectively blocking sending the pull request to Linus.

Actions #4

Updated by Sage Weil over 11 years ago

  • Status changed from 12 to 7

sigh of relief

Actions #5

Updated by Sage Weil over 11 years ago

  • Status changed from 7 to Resolved
Actions

Also available in: Atom PDF