Project

General

Profile

Bug #10762

mon: osd gets marked down twice

Added by Sage Weil about 9 years ago. Updated about 9 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
giant,firefly
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

- osd has intermittnet network issue
- gets marked down
- network fixed
- osd comes back up
- mon election (or something loady on the mons)
- osd marked down again
- osd comes back

The bug is that hte preprocess_failure checks for is_down() but doesn't verify that the reproting epoch is after get_up_from, leading to dups.

Associated revisions

Revision 400ac237 (diff)
Added by Sage Weil about 9 years ago

mon: ignore osd failures from before up_from

If the failure was generated for an instance of the OSD prior to when
it came up, ignore it.

This probably causes a fair bit of unnecessary flapping in the wild...

Backport: giant, firefly
Fixes: #10762
Reported-by: Dan van der Ster <>
Signed-off-by: Sage Weil <>

Revision 3782b8b3 (diff)
Added by Sage Weil about 9 years ago

mon: ignore osd failures from before up_from

If the failure was generated for an instance of the OSD prior to when
it came up, ignore it.

This probably causes a fair bit of unnecessary flapping in the wild...

Backport: giant, firefly
Fixes: #10762
Reported-by: Dan van der Ster <>
Signed-off-by: Sage Weil <>
(cherry picked from commit 400ac237d35d0d1d53f240fea87e8483c0e2a7f5)

Revision e7af52a1 (diff)
Added by Sage Weil about 9 years ago

mon: ignore osd failures from before up_from

If the failure was generated for an instance of the OSD prior to when
it came up, ignore it.

This probably causes a fair bit of unnecessary flapping in the wild...

Backport: giant, firefly
Fixes: #10762
Reported-by: Dan van der Ster <>
Signed-off-by: Sage Weil <>
(cherry picked from commit 400ac237d35d0d1d53f240fea87e8483c0e2a7f5)

History

#2 Updated by Sage Weil about 9 years ago

  • Status changed from New to Fix Under Review

#3 Updated by Sage Weil about 9 years ago

  • Status changed from Fix Under Review to Pending Backport

#7 Updated by Loïc Dachary about 9 years ago

e7af52a mon: ignore osd failures from before up_from (in giant), 3782b8b mon: ignore osd failures from before up_from (in firefly),

#8 Updated by Loïc Dachary about 9 years ago

  • Status changed from Pending Backport to Resolved

Also available in: Atom PDF