Project

General

Profile

Actions

Bug #23460

closed

mgr deadlock: _check_auth_rotating possible clock skew, rotating keys expired way too early

Added by Wido den Hollander about 6 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
ceph-mgr
Target version:
-
% Done:

0%

Source:
Tags:
cephx,mgr,monclient
Backport:
luminous, mimic
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

On Luminous v12.2.2 and v12.2.4 clusters running either CentOS or Ubuntu I've seen many Manager going offline with these messages in their logs:

2018-03-26 05:32:50.976826 7f596859d700 -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early (before 2018-03-26 04:32:50.976818)
2018-03-26 05:33:00.977091 7f596859d700 -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early (before 2018-03-26 04:33:00.977083)
2018-03-26 05:33:10.977244 7f596859d700 -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early (before 2018-03-26 04:33:10.977238)
2018-03-26 05:33:20.977398 7f596859d700 -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early (before 2018-03-26 04:33:20.977392)
2018-03-26 05:33:30.977584 7f596859d700 -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early (before 2018-03-26 04:33:30.977577)
Mar 23 09:18:22 mon01 ceph-mgr[2324150]: 2018-03-23 09:18:22.451311 7fb9e8ac7700 -1 monclient: _check_auth_rotating possible clock skew, rotating keys expired way too early (before 2018-03-23 8:18:22.451287)

The first thing to check is if the time is correct on these systems, but it is, they are all in sync thanks to NTP.

No clock drift warnings or such, the Mgr just goes down with these messages.


Files

gdb.txt (69 KB) gdb.txt Wido den Hollander, 04/05/2018 06:27 AM

Related issues 4 (1 open3 closed)

Related to mgr - Feature #23574: Add a HeartbeatMap to ceph-mgr (die on deadlocks)New04/06/2018

Actions
Has duplicate mgr - Bug #36266: mgr: deadlock in ClusterStateDuplicate09/30/2018

Actions
Copied to mgr - Backport #38318: luminous: mgr deadlock: _check_auth_rotating possible clock skew, rotating keys expired way too earlyResolvedNathan CutlerActions
Copied to mgr - Backport #38319: mimic: mgr deadlock: _check_auth_rotating possible clock skew, rotating keys expired way too earlyResolvedNathan CutlerActions
Actions

Also available in: Atom PDF