Project

General

Profile

Actions

Bug #36268

closed

Unable to recover from ENOSPC in BlueFS

Added by Igor Fedotov over 5 years ago. Updated almost 4 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Under heavy load and full DB volume BlueStore might fall into the state where it lacks additional space for BlueFS even if the space is still available at block device.
This is cased by the "lazy" behavior of free space rebalancing - it happens periodically in background rather than on demand.
On the first allocation failure OSD asserts and then is unable to restart since log replay during BlueFS open needs the space as well but rebalance is still not executed.
Then assertion again and hence getting a sort of unrecoverable deadlock for OSD.


Related issues 2 (0 open2 closed)

Copied to bluestore - Backport #36640: luminous: Unable to recover from ENOSPC in BlueFSRejectedActions
Copied to bluestore - Backport #36641: mimic: Unable to recover from ENOSPC in BlueFSRejectedIgor FedotovActions
Actions

Also available in: Atom PDF