Project

General

Profile

Actions

Bug #17466

closed

MDSMonitor: non-existent standby_for_fscid not caught

Added by Greg Farnum over 7 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
Correctness/Safety
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
jewel
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDSMonitor
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

We've got it using aggregate initialization (that is, the explicity {a, b} syntax), but we're ordering the rank and fscid the way they are in the constructor — which is backwards from how they are declared in the struct. So it's swapped and if anything hits this code path it goes dramatically wrong.

I pushed a quick fix to wip-fixup-mds-standby-init that should resolve it by swapping them into the right sequence, but we probably want to make the struct declaration match the constructor order instead. Adam Tygart/m0zes__ ran into this bug and will be testing my fix.


Files

mdsmap (1022 Bytes) mdsmap Current mdsmap from the affected cluster Adam Tygart, 10/03/2016 03:18 PM
fsmap-903419 (1018 Bytes) fsmap-903419 FSMap from epoch 903419 Adam Tygart, 10/04/2016 07:36 PM
mds_addfailed.patch (2.1 KB) mds_addfailed.patch John Spray, 10/05/2016 01:23 PM

Related issues 1 (0 open1 closed)

Copied to CephFS - Backport #17557: jewel: MDSMonitor: non-existent standby_for_fscid not caughtResolvedJohn SprayActions
Actions

Also available in: Atom PDF