Project

General

Profile

Bug #6073

osd: mark_me_down sequence is racy

Added by Sage Weil over 10 years ago. Updated almost 7 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2013-08-21 02:58:06.798796 mon.0 10.214.131.10:6789/0 294 : [INF] osd.0 marked itself down
2013-08-21 02:58:07.760169 mon.0 10.214.131.10:6789/0 295 : [INF] osdmap e98: 6 osds: 4 up, 6 in
2013-08-21 02:58:07.821448 osd.0 10.214.131.33:6800/12089 1 : [WRN] map e98 wrongly marked me down
2013-08-21 02:58:06.798796 mon.0 10.214.131.10:6789/0 294 : [INF] osd.0 marked itself down
2013-08-21 02:58:07.760169 mon.0 10.214.131.10:6789/0 295 : [INF] osdmap e98: 6 osds: 4 up, 6 in
2013-08-21 02:58:07.821448 osd.0 10.214.131.33:6800/12089 1 : [WRN] map e98 wrongly marked me down

job was
ubuntu@teuthology:/a/teuthology-2013-08-21_01:00:14-rados-master-testing-basic-plana/4751$ cat orig.config.yaml 
kernel:
  kdb: true
  sha1: 546140dd51e9ec7e34fe0b0a5814240828f68f7d
machine_type: plana
nuke-on-error: true
os_type: ubuntu
overrides:
  admin_socket:
    branch: master
  ceph:
    conf:
      global:
        ms inject socket failures: 5000
      mon:
        debug mon: 20
        debug ms: 1
        debug paxos: 20
      osd:
        osd op thread timeout: 60
    fs: btrfs
    log-whitelist:
    - slow request
    sha1: a35ab949fd8e6e2e259076d76d0d41742f045398
    valgrind:
      mds:
      - --tool=memcheck
      mon:
      - --tool=memcheck
      - --leak-check=full
      - --show-reachable=yes
      osd:
      - --tool=memcheck
  ceph-deploy:
    branch:
      dev: master
    conf:
      client:
        log file: /var/log/ceph/ceph-$name.$pid.log
      mon:
        debug mon: 1
        debug ms: 20
        debug paxos: 20
  install:
    ceph:
      flavor: notcmalloc
      sha1: a35ab949fd8e6e2e259076d76d0d41742f045398
  s3tests:
    branch: master
  workunit:
    sha1: a35ab949fd8e6e2e259076d76d0d41742f045398
roles:
- - mon.a
  - mon.c
  - osd.0
  - osd.1
  - osd.2
  - client.0
- - mon.b
  - mds.a
  - osd.3
  - osd.4
  - osd.5
  - client.1
tasks:
- chef: null
- clock.check: null
- install: null
- ceph: null
- workunit:
    clients:
      client.0:
      - cls
teuthology_branch: master

History

#1 Updated by Sage Weil over 10 years ago

  • Priority changed from High to Normal

#2 Updated by Sage Weil almost 7 years ago

  • Status changed from New to Can't reproduce

Also available in: Atom PDF