Project

General

Profile

Actions

Bug #19103

closed

cephfs: Out of space handling

Added by David Zafman about 7 years ago. Updated about 7 years ago.

Status:
Won't Fix
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Cephfs needs to be more careful on a cluster with almost full OSDs. There is a delay in OSDs reporting stats, a MON marking CEPH_OSDMAP_FULL based on mon_osd_full_ratio and the MDS getting the map. The current default for mon_osd_full_ratio (95%) may or may not be sufficient to prevent running an OSD filesystem out of space before the full flag is noticed.

Currently, there is no code in MDS that checks CEPH_OSDMAP_NEARFULL. Starting to do sync writes alone at nearfull won't work because OSDs never return ENOSPC to MDS. We'd also need a flag to indicate to the OSD that MDS wants to get ENOSPC (if osd_failsafe_full_ratio has been reached) which it MDS set on file data writes only.

Actions

Also available in: Atom PDF