Project

General

Profile

Actions

Bug #5692

closed

RADOS performance regression in 0.65

Added by Mark Nelson almost 11 years ago. Updated almost 11 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Looks like after some narrowing down that we have what appears to be a pretty serious write performance regression starting in 0.65. This mostly is affecting XFS and EXT4. RADOS Bench performance is impacted across all IO sizes, though worse at smaller IO sizes. Small writes are about 3x slower, while large writes are about 1/3rd slower.

According to Sage's release notes, here's what we changed in the OSD in 0.65:

- osd: do not use fadvise(DONTNEED) on XFS (data corruption on power cycle)
- osd: recovery and peering performance improvements
- osd: new writeback throttling (for less bursty write performance) (Sam Just) osd: ping/heartbeat on public and private interfaces
- osd: avoid osd flapping from asymmetric network failure
- osd: re-use partially deleted PG contents when present (Sam Just)
- osd: break blacklisted client watches (David Zafman) - See more at: http://ceph.com/releases/v0-65-released/#sthash.pzr25JaM.dpuf

Actions

Also available in: Atom PDF