Fix #6059: osd: block reads while repgather is writing across replicas - Ceph - Ceph

Actions

Copy link

Fix #6059

closed

osd: block reads while repgather is writing across replicas

Added by Sage Weil over 10 years ago. Updated over 10 years ago.

Status:

Resolved

Priority:

High

Assignee:

Samuel Just

Category:

OSD

Target version:

v0.71

% Done:

Source:

Development

Tags:

Backport:

Reviewed:

Affected Versions:

ceph-qa-suite:

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

Currently we use the ondisk_write/read locks to do mutual exclusion over the local filestore which avoids reading data that is being modified. We also need to block reads while we are waiting for replcias to commit. Otherwise object state can appear to go back in time.

- start a write locally, and on remotes
- finish local write
- another op reads (new) locally written value
- all osds crash
- replica osds restart, but old primary does not
- they recover
- client reads prior object value

we do not see this in our testing because on failure/peering our clients resubmit their requests.

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Ceph

Custom queries

Fix #6059

osd: block reads while repgather is writing across replicas

Updated by Ian Colle over 10 years ago

Updated by Samuel Just over 10 years ago

Updated by Ian Colle over 10 years ago

Updated by Sage Weil over 10 years ago

Updated by Samuel Just over 10 years ago

Updated by Ian Colle over 10 years ago

Updated by Ian Colle over 10 years ago

Updated by Sage Weil over 10 years ago