Project

General

Profile

Actions

Bug #50390

closed

mds: monclient: wait_auth_rotating timed out after 30

Added by Patrick Donnelly about 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
% Done:

0%

Source:
Q/A
Tags:
Backport:
nautilus,octopus,pacific
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
qa, qa-failure
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Symptom:

2021-04-15T15:54:59.586 INFO:teuthology.orchestra.run.smithi001.stdout:2021-04-15T15:41:41.849793+0000 mon.a (mon.0) 3465 : cluster [WRN] Replacing daemon mds.b as rank 0 with standby daemon mds.a

From: /ceph/teuthology-archive/pdonnell-2021-04-15_01:35:57-fs-wip-pdonnell-testing-20210414.230315-distro-basic-smithi/6047521/teuthology.log

MDS was restarted between tests. The MDS's prior incarnation was up:standby in the MDSMap but it got stuck after restart on:

2021-04-15T15:41:41.824+0000 7fd1952a9780  0 monclient: wait_auth_rotating timed out after 30
2021-04-15T15:41:41.824+0000 7fd1952a9780 -1 mds.b unable to obtain rotating service keys; retrying

From: /ceph/teuthology-archive/pdonnell-2021-04-15_01:35:57-fs-wip-pdonnell-testing-20210414.230315-distro-basic-smithi/6047521/remote/smithi110/log/ceph-mds.b.log.gz

So it never got to send its up:boot message to the mons at startup, which would replace its old instance (in up:standby). This causes the warning because the mons had assigned that old instance to be up:creating but never heard back from it.


Related issues 5 (1 open4 closed)

Related to RADOS - Bug #50775: mds and osd unable to obtain rotating service keysFix Under Review

Actions
Has duplicate CephFS - Bug #50755: mds restart but unable to obtain rotating service keysDuplicate

Actions
Copied to CephFS - Backport #50897: nautilus: mds: monclient: wait_auth_rotating timed out after 30ResolvedIlya DryomovActions
Copied to CephFS - Backport #50898: octopus: mds: monclient: wait_auth_rotating timed out after 30ResolvedIlya DryomovActions
Copied to CephFS - Backport #50899: pacific: mds: monclient: wait_auth_rotating timed out after 30ResolvedIlya DryomovActions
Actions

Also available in: Atom PDF