Actions
Bug #24533
closedPurgeQueue sometimes ignores Journaler errors
Status:
Resolved
Priority:
Normal
Assignee:
Category:
Correctness/Safety
Target version:
% Done:
0%
Source:
Development
Tags:
Backport:
jewel,luminous,mimic
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
We check journaler.get_error() in PurgeQueue::_recover, but never later in _consume -- if something like a decode error happens, the MDS may silently stop progressing the queue.
Ticket inspired by thread "[ceph-users] MDS: journaler.pq decode error"
Actions