Project

General

Profile

Actions

Bug #1564

closed

osd: osd should not be primary before data is replicated

Added by Greg Farnum over 12 years ago. Updated over 12 years ago.

Status:
Won't Fix
Priority:
Normal
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Right now an OSD is kicked out of the acting set only if it needs a backlog, and it's allowed back in to the acting set once it has the backlog. This means that it can become primary for a PG while it still has hours of data to transfer, though! This is pretty bad since any reads or writes to that PG are likely to block...
Now, the OSD is supposed to prioritize objects which have waiting operations, but that's still a pretty significant latency increase, and sometimes it breaks (see #1563).

Actions

Also available in: Atom PDF