Project

General

Profile

Actions

Bug #23371

open

OSDs flaps when cluster network is made down

Added by Nokia ceph-users about 6 years ago. Updated almost 6 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

we are having a 5 node cluster with 5 mons and 120 OSDs equally distributed.

As part of our resiliency test we made cluster network of one node down. OSDs of that node are not down immediately, it flapping. OSDs which marked down are booting back up. it is taking too much time for all OSDs to go down and during this entire period, ceph is not able to write anything.

This issue is faced only in Luminous.

Attaching ceph.conf


Files

ceph.conf (3.01 KB) ceph.conf Nokia ceph-users, 03/15/2018 05:20 AM
Actions

Also available in: Atom PDF