Project

General

Profile

Actions

Bug #52467

open

mgr/dashboard: when more than N/2 monitors go down, the Dashboard mon report is incorrect

Added by Ernesto Puerta over 2 years ago. Updated over 2 years ago.

Status:
In Progress
Priority:
Normal
Category:
Component - Cluster
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
pacific
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Description of problem

when more than N/2+1 monitors go down, the Dashboard mon report is incorrect

Environment

Reproduced in Nautilus-Pacific.

How reproducible

Steps:

  1. Launch 3-mon (a, b, c) vstart cluster
  2. Kill 1 mon (b). Dashboard reports only 2 mons in quorum (a, c)
  3. Kill another mon (c)

Actual results

Dashboard keeps reporting (in Landing page and in cluster->mon page) 2 mons in quorum (a, c).

Expected results

Dashboard reports should report no quorum and only 1 mon (a) out of quorum, while the others are simply unavailable.

Actions #1

Updated by Ernesto Puerta over 2 years ago

  • Subject changed from mgr/dashboard: when more than N/2+1 monitors go down, the Dashboard mon report is incorrect to mgr/dashboard: when more than N/2 monitors go down, the Dashboard mon report is incorrect
Actions #2

Updated by Ernesto Puerta over 2 years ago

  • Priority changed from High to Normal
Actions #3

Updated by Ernesto Puerta over 2 years ago

  • Assignee changed from Ernesto Puerta to Aashish Sharma
Actions #4

Updated by Ernesto Puerta over 2 years ago

  • Status changed from New to Triaged
Actions #5

Updated by Loïc Dachary over 2 years ago

  • Target version deleted (v14.2.23)
Actions #6

Updated by Pere Díaz Bou over 2 years ago

  • Assignee changed from Aashish Sharma to Pere Díaz Bou
  • Pull request ID set to 44026
Actions #7

Updated by Ernesto Puerta over 2 years ago

As discussed in today's CDM, Neha suggested exploring the use the beacon approach (heartbeat-like) to detect the out-of-quorum situation. RADOS team will check and come up with a proposal.

Actions #8

Updated by Ernesto Puerta over 2 years ago

  • Status changed from Triaged to In Progress
  • Backport changed from pacific,octopus,nautilus to pacific
Actions

Also available in: Atom PDF