Project

General

Profile

Actions

Bug #11155

closed

degraded writes cab increase the window for generating an unfound object with small min_size

Added by Samuel Just about 9 years ago. Updated about 9 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

[0,1] is the acting set
Object foo is present on 0, missing on 1.
We accept a write on foo submitting the write to both 0 and 1.
0 restarts before persisting the write, 1 remembers the log entry.
foo is now unfound.

This isn't actually a bug as such, even without degraded writes, there is a similar window when writing to a size one acting set (min_size 1, size 2), allowing the single osd to propogate the log to another osd, and then killing the first osd. It might be worth changing find_best_info to prefer a log with an old last_update if the newer last_update holder happens to have a lower last_complete.

Actions #1

Updated by Samuel Just about 9 years ago

  • Status changed from New to 7
Actions #2

Updated by Samuel Just about 9 years ago

ubuntu@teuthology:/a/samuelj-2015-03-22_22:05:00-rados-wip-sam-testing-distro-basic-multi/816986/remote

This one is actually with wip-11155, didn't help since the degraded peer was the only one to go active in the next interval.

Actions #3

Updated by Samuel Just about 9 years ago

  • Status changed from 7 to Resolved
Actions #4

Updated by Samuel Just about 9 years ago

Reverted the degraded writes patches.

Actions

Also available in: Atom PDF