Actions
Bug #5069
closedmonitor crashed during mon thrash in nightlies
% Done:
0%
Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
logs: ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:05-rados-next-testing-basic/12938
2013-05-14T01:28:35.267 INFO:teuthology.task.workunit.client.0.err:2013-05-14 01:28:56.156297 7f64397fa700 0 monclient: hunting for new mon 2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err:mon/MDSMonitor.cc: In function 'virtual void MDSMonitor::update_from_paxos()' thread 7fde038cb700 time 2013-05-14 01:29:23.476858 2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err:mon/MDSMonitor.cc: 88: FAILED assert(version >= mdsmap.epoch) 2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err: ceph version 0.61-277-gc5deb5d (c5deb5dbe2a1e7ada1cab778da41ddb30d9c880b) 2013-05-14T01:29:02.589 INFO:teuthology.task.ceph.mon.a.err: 1: (MDSMonitor::update_from_paxos()+0x80d) [0x52d84d] 2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 2: (PaxosService::_active()+0x28c) [0x4f40bc] 2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 3: (Context::complete(int)+0xa) [0x4cafda] 2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 4: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0x95) [0x4cd7f5] 2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 5: (Paxos::handle_last(MMonPaxos*)+0xc72) [0x4eda22] 2013-05-14T01:29:02.590 INFO:teuthology.task.ceph.mon.a.err: 6: (Paxos::dispatch(PaxosServiceMessage*)+0x26b) [0x4ede7b] 2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 7: (Monitor::_ms_dispatch(Message*)+0x10ac) [0x4c30ac] 2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 8: (Monitor::ms_dispatch(Message*)+0x32) [0x4dcc62] 2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 9: (DispatchQueue::entry()+0x3f1) [0x6b4ca1] 2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 10: (DispatchQueue::DispatchThread::entry()+0xd) [0x64263d] 2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 11: (()+0x7e9a) [0x7fde0862ee9a] 2013-05-14T01:29:02.591 INFO:teuthology.task.ceph.mon.a.err: 12: (clone()+0x6d) [0x7fde06bdeccd] 2013-05-14T01:29:02.592 INFO:teuthology.task.ceph.mon.a.err: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this. ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:05-rados-next-testing-basic/12938$ cat config.yaml kernel: &id001 kdb: true sha1: 4ebfb52255fd25a987154e0e8847a4155532f760 machine_type: plana nuke-on-error: true overrides: ceph: conf: global: ms inject socket failures: 500 mon: debug mon: 20 debug ms: 20 debug paxos: 20 osd: osd op thread timeout: 60 fs: btrfs log-whitelist: - slow request sha1: 72bf5f4813c273210b5ced7f7793bc1bf813690c s3tests: branch: next workunit: sha1: 72bf5f4813c273210b5ced7f7793bc1bf813690c roles: - - mon.a - mon.b - mon.c - osd.0 - osd.1 - mds.0 - client.0 targets: ubuntu@plana88.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCn53/HM+LXJ+Lp7U6+zxGv4uzsKWyTsUR33DqdlN90Lh5mRIqohFq/oitlEvTuSywmu0Cfo6wOeMF7mRrsGCfeOHE7qPCDlyv99P+K96nS0EvRE/+hGhcQZjrJveGTk9IYgu83UXRjYXjVkPR1lJpBc8Kl+has7uL0vnxnNiZmkKFT+2X9tfLxrvV622eN2keXg4qLc7kRWbPw7Tevk2OV5/Lw591aIlIDQo7DhsVaM002YhFbIw591B4FogwqGtY3X43WcRD5DlyONKFxXSy7Pp1F9W6p0bNEMC2JeP1JRxQeKUgajGlsR2bxLHgFTGRJNygRlgW2gIMceFPrUNHv tasks: - internal.lock_machines: - 1 - plana - internal.save_config: null - internal.check_lock: null - internal.connect: null - internal.check_conflict: null - internal.check_ceph_data: null - internal.vm_setup: null - kernel: *id001 - internal.base: null - internal.archive: null - internal.coredump: null - internal.syslog: null - internal.timer: null - chef: null - clock.check: null - install: null - ceph: null - mon_thrash: revive_delay: 20 thrash_delay: 1 - ceph-fuse: null - workunit: clients: all: - mon/workloadgen.sh env: DURATION: '600' LOADGEN_NUM_OSDS: '5' VERBOSE: '1'
Files
Actions