Project

General

Profile

Actions

Bug #5069

closed

monitor crashed during mon thrash in nightlies

Added by Tamilarasi muthamizhan almost 11 years ago. Updated almost 11 years ago.

Status:
Resolved
Priority:
High
Category:
Monitor
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

logs: ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:05-rados-next-testing-basic/12938

2013-05-14T01:28:35.267 INFO:teuthology.task.workunit.client.0.err:2013-05-14 01:28:56.156297 7f64397fa700  0 monclient: hunting for new mon
2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err:mon/MDSMonitor.cc: In function 'virtual void MDSMonitor::update_from_paxos()' thread 7fde038cb700 time 2013-05-14 01:29:23.476858
2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err:mon/MDSMonitor.cc: 88: FAILED assert(version >= mdsmap.epoch)
2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err: ceph version 0.61-277-gc5deb5d (c5deb5dbe2a1e7ada1cab778da41ddb30d9c880b)
2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err: 1: (MDSMonitor::update_from_paxos()+0x80d) [0x52d84d]
2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 2: (PaxosService::_active()+0x28c) [0x4f40bc]
2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 3: (Context::complete(int)+0xa) [0x4cafda]
2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 4: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0x95) [0x4cd7f5]
2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 5: (Paxos::handle_last(MMonPaxos*)+0xc72) [0x4eda22]
2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 6: (Paxos::dispatch(PaxosServiceMessage*)+0x26b) [0x4ede7b]
2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 7: (Monitor::_ms_dispatch(Message*)+0x10ac) [0x4c30ac]
2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 8: (Monitor::ms_dispatch(Message*)+0x32) [0x4dcc62]
2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 9: (DispatchQueue::entry()+0x3f1) [0x6b4ca1]
2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 10: (DispatchQueue::DispatchThread::entry()+0xd) [0x64263d]
2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 11: (()+0x7e9a) [0x7fde0862ee9a]
2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 12: (clone()+0x6d) [0x7fde06bdeccd]
2013-05-14T01:29:02.592 INFO:teuthology.task.ceph.mon.a.err: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:05-rados-next-testing-basic/12938$ cat config.yaml 
kernel: &id001
  kdb: true
  sha1: 4ebfb52255fd25a987154e0e8847a4155532f760
machine_type: plana
nuke-on-error: true
overrides:
  ceph:
    conf:
      global:
        ms inject socket failures: 500
      mon:
        debug mon: 20
        debug ms: 20
        debug paxos: 20
      osd:
        osd op thread timeout: 60
    fs: btrfs
    log-whitelist:
    - slow request
    sha1: 72bf5f4813c273210b5ced7f7793bc1bf813690c
  s3tests:
    branch: next
  workunit:
    sha1: 72bf5f4813c273210b5ced7f7793bc1bf813690c
roles:
- - mon.a
  - mon.b
  - mon.c
  - osd.0
  - osd.1
  - mds.0
  - client.0
targets:
  ubuntu@plana88.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCn53/HM+LXJ+Lp7U6+zxGv4uzsKWyTsUR33DqdlN90Lh5mRIqohFq/oitlEvTuSywmu0Cfo6wOeMF7mRrsGCfeOHE7qPCDlyv99P+K96nS0EvRE/+hGhcQZjrJveGTk9IYgu83UXRjYXjVkPR1lJpBc8Kl+has7uL0vnxnNiZmkKFT+2X9tfLxrvV622eN2keXg4qLc7kRWbPw7Tevk2OV5/Lw591aIlIDQo7DhsVaM002YhFbIw591B4FogwqGtY3X43WcRD5DlyONKFxXSy7Pp1F9W6p0bNEMC2JeP1JRxQeKUgajGlsR2bxLHgFTGRJNygRlgW2gIMceFPrUNHv
tasks:
- internal.lock_machines:
  - 1
  - plana
- internal.save_config: null
- internal.check_lock: null
- internal.connect: null
- internal.check_conflict: null
- internal.check_ceph_data: null
- internal.vm_setup: null
- kernel: *id001
- internal.base: null
- internal.archive: null
- internal.coredump: null
- internal.syslog: null
- internal.timer: null
- chef: null
- clock.check: null
- install: null
- ceph: null
- mon_thrash:
    revive_delay: 20
    thrash_delay: 1
- ceph-fuse: null
- workunit:
    clients:
      all:
      - mon/workloadgen.sh
    env:
      DURATION: '600'
      LOADGEN_NUM_OSDS: '5'
      VERBOSE: '1'


Files

mdsmonitor_assert_mon.1.log (29.8 KB) mdsmonitor_assert_mon.1.log Florian Wiessner, 05/16/2013 03:25 AM
bug_5069_all_mon_logs.txt (226 KB) bug_5069_all_mon_logs.txt Florian Wiessner, 05/16/2013 03:25 AM
Actions

Also available in: Atom PDF