Project

General

Profile

Actions

Fix #6109

open

pg <pgid> mark_unfound_lost fails if a completely-gone OSD still in map

Added by Dan Mick over 10 years ago. Updated about 1 year ago.

Status:
New
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Monitor
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

cluster on mira045 et. al. had bad disk on osd.25; marked out, much data extracted, but for some
reason one pgid (2.1b7) wouldn't recover. osd.25 taken down; mark_unfound_lost revert tried to repair;
fails with

Error EINVAL: pg has 32 objects but we haven't probed all sources, not marking lost

apparently because the OSDmap still thinks osd.25 is a possible source, even though it's no longer
in crush and in fact has been "osd rm"ed.

Actions

Also available in: Atom PDF