Project

General

Profile

Actions

Bug #40112

closed

mon: rados/multimon tests fail with clock skew

Added by Sage Weil almost 5 years ago. Updated almost 4 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
nautilus, mimic
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

See

http://pulpito.ceph.com/sage-2019-05-30_21:14:09-rados:multimon-master-distro-basic-smithi/

or

http://pulpito.ceph.com/sage-2019-06-03_14:22:18-rados:multimon-master-distro-basic-smithi/
http://pulpito.ceph.com/sage-2019-06-03_14:24:56-rados:multimon-nautilus-distro-basic-smithi/

The second variant, where we don't see the SKEW failure, appears to be caused because the ceph setup command to set the crush tunables takes foreeever due to no quorum, but eventually the clocks correct themselves and then the job proceeds (probably NTP)?

The underlying problem appears to be that mons can't form quorum if there is too much skew, which prevents a meaningful test.

whenever mon.b is the rank 0 mon.


Related issues 3 (0 open3 closed)

Has duplicate RADOS - Bug #38893: RuntimeError: expected MON_CLOCK_SKEW but got noneResolved03/22/2019

Actions
Copied to RADOS - Backport #40228: nautilus: mon: rados/multimon tests fail with clock skewResolvedNathan CutlerActions
Copied to RADOS - Backport #44908: mimic: mon: rados/multimon tests fail with clock skewResolvedNathan CutlerActions
Actions #1

Updated by Sage Weil almost 5 years ago

  • Status changed from 12 to Pending Backport
  • Backport set to nautilus
Actions #2

Updated by Nathan Cutler almost 5 years ago

  • Copied to Backport #40228: nautilus: mon: rados/multimon tests fail with clock skew added
Actions #3

Updated by Nathan Cutler over 4 years ago

  • Pull request ID set to 28353
Actions #4

Updated by Nathan Cutler over 4 years ago

  • Has duplicate Bug #38893: RuntimeError: expected MON_CLOCK_SKEW but got none added
Actions #5

Updated by Nathan Cutler over 4 years ago

  • Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved".

Actions #6

Updated by Nathan Cutler about 4 years ago

  • Backport changed from nautilus to nautilus, mimic
Actions #7

Updated by Nathan Cutler about 4 years ago

  • Status changed from Resolved to Pending Backport
Actions #8

Updated by Nathan Cutler about 4 years ago

  • Copied to Backport #44908: mimic: mon: rados/multimon tests fail with clock skew added
Actions #9

Updated by Nathan Cutler almost 4 years ago

  • Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".

Actions

Also available in: Atom PDF