Project

General

Profile

Actions

Bug #64863

closed

rados/thrash-old-clients: Health detail: HEALTH_WARN 1/3 mons down, quorum a,c in cluster log

Added by Sridhar Seshasayee 2 months ago. Updated 3 days ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
rados
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

The following tests in the rados suite failed with the warning:

/a/yuriw-2024-03-08_16:20:46-rados-wip-yuri4-testing-2024-03-05-0854-distro-default-smithi/7587298
/a/yuriw-2024-03-08_16:20:46-rados-wip-yuri4-testing-2024-03-05-0854-distro-default-smithi/7587528
/a/yuriw-2024-03-08_16:20:46-rados-wip-yuri4-testing-2024-03-05-0854-distro-default-smithi/7587807
/a/yuriw-2024-03-08_16:20:46-rados-wip-yuri4-testing-2024-03-05-0854-distro-default-smithi/7587883

All the tests above add "MON_DOWN" to the ignore list as it's expected. In addition to the health
warning, the health detail is also logged by all the tests shown below:

"cluster [WRN] Health detail: HEALTH_WARN 1/3 mons down, quorum a,b" in cluster log

All the tests failed due to the above warning not present in the ignorelist.

Therefore, this tracker may be used to track the addition of "mons down" warning
as well to the ignore list.

Actions #1

Updated by Sridhar Seshasayee 2 months ago

  • Description updated (diff)
Actions #2

Updated by Sridhar Seshasayee 2 months ago

  • Translation missing: en.field_tag_list set to test-failure
  • Tags deleted (test-failure)
Actions #3

Updated by Radoslaw Zarzynski 2 months ago

  • Assignee set to Laura Flores

Hmm, I think I saw Laura's PR for MON_DOWN.

Actions #4

Updated by Laura Flores about 1 month ago

  • Status changed from New to Resolved
  • Pull request ID set to 56619

https://github.com/ceph/ceph/pull/56619

Radoslaw Zarzynski wrote in #note-3:

Hmm, I think I saw Laura's PR for MON_DOWN.

I attached the PR. We can reopen this if there are other occurrences that this PR didn't fix.

Actions #5

Updated by Sridhar Seshasayee 3 days ago ยท Edited

Seen on Squid:

/a/yuriw-2024-05-14_00:32:08-rados-wip-yuri4-testing-2024-04-29-0642-distro-default-smithi/7705403
/a/yuriw-2024-05-14_00:32:08-rados-wip-yuri4-testing-2024-04-29-0642-distro-default-smithi/7705410
/a/yuriw-2024-05-14_00:32:08-rados-wip-yuri4-testing-2024-04-29-0642-distro-default-smithi/7705417
/a/yuriw-2024-05-14_00:32:08-rados-wip-yuri4-testing-2024-04-29-0642-distro-default-smithi/7705423
/a/yuriw-2024-05-14_00:32:08-rados-wip-yuri4-testing-2024-04-29-0642-distro-default-smithi/7705447

Needs back-porting to Squid. Need to check if the fix is needed for Reef too.

Actions

Also available in: Atom PDF