Project

General

Profile

Actions

Bug #57852

open

osd: unhealthy osd cannot be marked down in time

Added by wencong wan over 1 year ago. Updated about 1 year ago.

Status:
Need More Info
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
OSD
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Before an unhealthy osd is marked down by mon, other osd may choose it as
heartbeat peer and then report an incorrect failure time(first_tx) to mon.

reproduce:
Shutdown cluster_network and public_network of an osd node several times.


Files

p1.png (63.1 KB) p1.png ifdown net at 13:10 wencong wan, 10/12/2022 02:13 AM
p2.png (246 KB) p2.png after 10 minutes,unhealthy osd still keep up status wencong wan, 10/12/2022 02:15 AM
Actions

Also available in: Atom PDF