Project

General

Profile

Actions

Feature #47718

closed

intoduce means to detect/workaround spurios read errors in bluefs

Added by Igor Fedotov over 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Target version:
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

We've seen and work around by retry such errors for user data at main device, see https://tracker.ceph.com/issues/22464

But DB volume (and hence RocksDB) is still exposed to this issue. And we're hard to properly diagnose this.
Hence suggesting to introduce a detector triggered on all-zeros read blocks and unconditionally retrying such reads. If all-zeros pattern doesn't persist this indicate the same spurious read error. The detector to be disabled by default and to be turned on when suspicious read errors are observed.

Unfortunately it looks like the only available solution so far which requires no updates to RocksDB...


Related issues 1 (0 open1 closed)

Related to bluestore - Bug #47271: ceph version 14.2.10-OSD failsClosed

Actions
Actions #1

Updated by Igor Fedotov over 3 years ago

  • Related to Bug #47271: ceph version 14.2.10-OSD fails added
Actions #2

Updated by Igor Fedotov almost 3 years ago

  • Status changed from New to Resolved
  • Target version set to v17.0.0
  • Pull request ID set to 39185
Actions

Also available in: Atom PDF