Project

General

Profile

Actions

Bug #5215

closed

mon: hang during sync with mon thrashing

Added by Sage Weil almost 11 years ago. Updated almost 11 years ago.

Status:
Resolved
Priority:
Immediate
Category:
Monitor
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

mon syncs for a while and then stops/get stuck. i think this job failed yesterday, too, so it is likely easy to reproduce. full logs are available.

ubuntu@teuthology:/a/teuthology-2013-05-30_18:12:14-rados-next-testing-basic/26825$ cat orig.config.yaml 
kernel:
  kdb: true
  sha1: 81d7ac5e09dd6bce8de43272de102aafdb3481c6
machine_type: plana
nuke-on-error: true
overrides:
  ceph:
    conf:
      global:
        ms inject socket failures: 5000
      mon:
        debug mon: 20
        debug ms: 1
        debug paxos: 20
    fs: xfs
    log-whitelist:
    - slow request
    sha1: cf9aa7a0037e56eada8b3c1bb59d59d0bfe7bba5
  install:
    ceph:
      sha1: cf9aa7a0037e56eada8b3c1bb59d59d0bfe7bba5
  s3tests:
    branch: next
  workunit:
    sha1: cf9aa7a0037e56eada8b3c1bb59d59d0bfe7bba5
roles:
- - mon.a
  - mon.c
  - osd.0
  - osd.1
  - osd.2
- - mon.b
  - mds.a
  - osd.3
  - osd.4
  - osd.5
  - client.0
tasks:
- chef: null
- clock.check: null
- install: null
- ceph: null
- mon_thrash:
    revive_delay: 20
    thrash_delay: 1
- rados:
    clients:
    - client.0
    objects: 50
    op_weights:
      delete: 50
      read: 100
      rollback: 50
      snap_create: 50
      snap_remove: 50
      write: 100
    ops: 4000


Files

gdb.txt (43.9 KB) gdb.txt Joao Eduardo Luis, 06/04/2013 07:17 AM
Actions

Also available in: Atom PDF