Project

General

Profile

Actions

Bug #19700

closed

OSD remained up despite cluster network being inactive?

Added by Patrick McLean about 7 years ago. Updated almost 4 years ago.

Status:
Closed
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Monitor, OSD
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

We have a ceph cluster with segregated cluster network for the OSDs to communicate with each other, and a "public" network for clients to talk to the cluster. The monitors are on the "public" network, and the OSDs talk to the monitor through that interface. We had an issue where the private network interface went down on one of our OSD nodes, but everything seemed to think that things were normal despite the fact that OSD node couldn't talk to any other OSDs. The monitor reported the cluster as healthy, and 'ceph osd tree' showed the downed node as up.

We have 12 OSDs per node, and 6 nodes in the cluster

Actions

Also available in: Atom PDF