Project

General

Profile

Actions

Feature #19657

closed

An EIO from a single device should not be a client-visible failure.

Added by David Zafman about 7 years ago. Updated almost 7 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
David Zafman
Category:
-
Target version:
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

Sage:
the underlying goal: any EIO from a single device should not be a client-visible failure.
(the same way an OSD failure shouldn't be visible to client workloads)

we get this by enabling the eio on read config option, but the goal would be to do better than that: trigger a repair on an object that gives EIO on read, just like we do during scrub.
(well, during repair)

or, if we don't repair, work around it by reading from a replica that doesn't return eio.
(which might make more sense since repair after scrub pbly won't be driven directly by the osd in the future?)

davidz
ok. yes… eventually repair is moving out of osd and we won't be using recovery to do it (hopefully). We could use that mechanism for now I guess.
Or as you say may we just read from another replica


Related issues 1 (0 open1 closed)

Has duplicate RADOS - Bug #20863: CRC error does not mark PG as inconsistent or queue for repairDuplicate07/31/2017

Actions
Actions #1

Updated by David Zafman about 7 years ago

  • Priority changed from Normal to Urgent
Actions #2

Updated by David Zafman about 7 years ago

  • Target version set to v12.1.0
Actions #3

Updated by David Zafman almost 7 years ago

  • Status changed from New to In Progress
Actions #5

Updated by David Zafman almost 7 years ago

  • Status changed from In Progress to Resolved
Actions #6

Updated by David Zafman over 6 years ago

  • Has duplicate Bug #20863: CRC error does not mark PG as inconsistent or queue for repair added
Actions

Also available in: Atom PDF