Project

General

Profile

Actions

Bug #3587

closed

mon: election doesn't finish during heavy mon thrashing

Added by Joao Eduardo Luis over 11 years ago. Updated over 11 years ago.

Status:
Resolved
Priority:
High
Category:
Monitor
Target version:
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

While trying to trigger #3495 using

$ while [ 1 ]; do ./init-ceph restart mon.a ; sleep 30 ; done
$ while [ 1 ]; do ./init-ceph restart mds.a ; sleep 2 ; done
$ while [ 1 ]; do ./init-ceph restart osd.1 ; sleep 2 ; done

At a certain point in time, mon.a got stuck electing (which was noticed after canceling its restart loop). My suspicion is that it happened after #3495 was triggered on mon.b during, or right before, an election cycle.

I've attached both mon.a's and mon.b's logs; mon.b's log does have the stack trace from #3495, but might be useful to further inquire what has happened in case its failure had anything to do with the infinite election cycle.


Files

mon.a.log (6.25 MB) mon.a.log Joao Eduardo Luis, 12/07/2012 04:19 AM
mon.b.log (4.26 MB) mon.b.log Joao Eduardo Luis, 12/07/2012 04:19 AM
mon.c.log (4.42 MB) mon.c.log Joao Eduardo Luis, 12/07/2012 05:18 AM
Actions

Also available in: Atom PDF