Project

General

Profile

Actions

Bug #44159

closed

[rbd-mirror] Mirror daemon never recovers from being blacklisted

Added by Oliver Freyermuth about 4 years ago. Updated about 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
luminous,mimic,nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I can reproduce this rather reliably by:
- Restarting many OSDs (old nodes, slow spinning disks, likely exceeding default blacklist timeout).
- Sometimes, it also happens when restarting other RBD mirror daemons (we have 3).

The attached log is extracted from one blacklisted RBD mirror unable to recover at log level 15.
RBD volume names and domains are sanitized, otherwise the log is untouched.


Files

ceph-client.rbd_mirror_backup.log.gz (479 KB) ceph-client.rbd_mirror_backup.log.gz Oliver Freyermuth, 02/15/2020 05:18 PM

Related issues 3 (0 open3 closed)

Copied to rbd - Backport #44262: mimic: [rbd-mirror] Mirror daemon never recovers from being blacklistedResolvedMykola GolubActions
Copied to rbd - Backport #44263: nautilus: [rbd-mirror] Mirror daemon never recovers from being blacklistedResolvedMykola GolubActions
Copied to rbd - Backport #44264: luminous: [rbd-mirror] Mirror daemon never recovers from being blacklistedRejectedMykola GolubActions
Actions

Also available in: Atom PDF