Project

General

Profile

Actions

Bug #2688

closed

lockup on ffsb + thrashing

Added by Sage Weil almost 12 years ago. Updated over 11 years ago.

Status:
Duplicate
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

[15545.880295] INFO: Stall ended before state dump start
[15572.788376] INFO: rcu_sched detected stalls on CPUs/tasks: { 0} (detected by 7, t=186060 jiffies)
[15572.857113] INFO: Stall ended before state dump start
[15725.552445] INFO: rcu_bh detected stalls on CPUs/tasks: { 0} (detected by 6, t=168062 jiffies)
[15725.623650] INFO: Stall ended before state dump start
[15752.575408] INFO: rcu_sched detected stalls on CPUs/tasks: { 0} (detected by 3, t=204070 jiffies)
[15752.647627] INFO: Stall ended before state dump start
[15905.289610] INFO: rcu_bh detected stalls on CPUs/tasks: { 0} (detected by 6, t=186067 jiffies)
[15905.364026] INFO: Stall ended before state dump start
[15932.352447] INFO: rcu_sched detected stalls on CPUs/tasks: { 0} (detected by 2, t=222079 jiffies)
[15932.429773] INFO: Stall ended before state dump start
[16085.026682] INFO: rcu_bh detected stalls on CPUs/tasks: { 0} (detected by 7, t=204072 jiffies)
[16085.103708] INFO: Stall ended before state dump start

ubuntu@teuthology:/a/sage-2012-07-02_11:21:38-regression-next-testing-basic/4592$ cat config.yaml 
kernel: &id001
  branch: testing
  kdb: true
nuke-on-error: true
overrides:
  ceph:
    branch: next
    fs: btrfs
    log-whitelist:
    - slow request
roles:
- - mon.a
  - osd.0
  - osd.1
  - osd.2
- - mds.a
  - osd.3
  - osd.4
  - osd.5
- - client.0
targets:
  ubuntu@plana49.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCsodFx35LkTvKGlOrQT7Zt/AuvpUOoNz4sM8ovIxIjn/AJliZfup2KAhO/VRUpcndtJMy0eAp/v6wMy9bP0tqTNbeZ3q+zzzLuwnoCOQ0relggohTE7lzLDVLb/MHCQlxpLifwUrwQpVdiUfJ2B5mcGyMr2Lku4TcC3BrgPKuaXLkGHZl0aaShLfr6PbTvDjqI+IDT4E4iKd65KhzLuzDqAIwuZyoifiG+5KYYsoIZmCHidTZYHdf6utnBzP9jTQHroZR5kE/qbVnoL1tT9vvXWLcG7y2PB3UMmIOJmRXlys2/My5iRSC+1Wd9nBxcwE3BgWHknGQhrwkMl0uC+vH3
  ubuntu@plana50.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQDVJ+lkgUdkr27WFzrmwSQU22m+pFIiqzhfcO4Hinu8A8uyP4FIephrEcq4Rrt4hp14Syb1pxXisV6UKwAZKikDoD1Wl0LSro4TzOs6HuMEhfvzdnISvyzE3f2w0cj1zE61rHFYfPNF14b9fkE3wBf2Vb4i6ReaN2/Yd12J/xO52tJH1lPxgsFoAIRMjdQMbfVwPU6kK9SY4ngt9iLjge6gZ0O9Jwe2vrgD6+LNoMY9qvNjgRvQdCTi85OQwitU0ZMZdGC0cQ/oNbKd+yW92rW9Wu6dcyKSisesRcm7lbtS6X2uUup+u3vWze7coT+Py3TdNW6nGpIg4muyvqHfSinz
  ubuntu@plana51.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQDLTsxC+nR+xtTXMbOtazCh7MOzgBKjX/oCLMP16k0AtH8Ui92tlqsfNxcHczUol0DNzxCITgrhF8FTvgM3EgkbOUVAxGj+xLqfxsdlf58nTVXbm/pOGYnvOI8CvA4DgISHDbkzuFH4FKtR8qNTTFVmtEXaZ+jpSvn7vrYuI/Uu9XZOQh73phYW8zvVB1x8770czM0Gy2wgxdNguKy6L/Q9ShsLcFfm8Uvxf6aXb3qmuxwGhqYsMlNl0X3AjoOwmow74rodlcMvQP/pAQdjMZfe1lBPqsjmU518BE5eo7zV3O9iF6ahOrm8igOu9bfki0G52R22pA3hE9BPKPfzA0hL
tasks:
- internal.lock_machines: 3
- internal.save_config: null
- internal.check_lock: null
- internal.connect: null
- internal.check_conflict: null
- kernel: *id001
- internal.base: null
- internal.archive: null
- internal.coredump: null
- internal.syslog: null
- internal.timer: null
- chef: null
- clock: null
- ceph:
    log-whitelist:
    - wrongly marked me down or wrong addr
    - objects unfound and apparently lost
- thrashosds: null
- kclient: null
- workunit:
    clients:
      all:
      - suites/ffsb.sh

pretty sure i've seen this several times.  this is on v3.5-rc1-55-g84f5ea9


Related issues 2 (0 open2 closed)

Related to Linux kernel client - Bug #2868: kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osdResolved07/28/2012

Actions
Is duplicate of Linux kernel client - Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg ResolvedSage Weil07/27/2012

Actions
Actions #1

Updated by Sage Weil over 11 years ago

  • Status changed from 12 to Duplicate
Actions

Also available in: Atom PDF