Project

General

Profile

Actions

Bug #1898

closed

very long scrub blocked write operation

Added by Greg Farnum over 12 years ago. Updated over 12 years ago.

Status:
Duplicate
Priority:
High
Assignee:
Category:
OSD
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

On cephstore6235 we saw a write operation get blocked for 4 minutes by scrub. The log is available in /var/log/ceph/long_scrub.log, operation client.898189.0:13131563


Related issues 1 (0 open1 closed)

Is duplicate of Ceph - Feature #1783: osd: scrub incrementally across hash range using MOSDPGScanResolved12/03/2011

Actions
Actions #1

Updated by Sage Weil over 12 years ago

  • Translation missing: en.field_position set to 3
Actions #2

Updated by Sage Weil over 12 years ago

  • Target version deleted (v0.41)
  • Translation missing: en.field_position deleted (10)
  • Translation missing: en.field_position set to 5
Actions #3

Updated by Sage Weil over 12 years ago

  • Priority changed from Immediate to Urgent
Actions #4

Updated by Sage Weil over 12 years ago

  • Target version set to v0.41
Actions #5

Updated by Samuel Just over 12 years ago

It seems that osd.220 took 5 minutes between starting _scan_list and relocking the pg.

Actions #6

Updated by Samuel Just over 12 years ago

2012-01-06 16:22:52.45141 another case where osd.220 required 4 minutes to scan the 150000 object pg. osd.97 seems to require closer to 2 minutes to scan the pg. We can reduce the severity of this problem by waiting for all replicas to respond to the first scan before stopping writes.

Actions #7

Updated by Samuel Just over 12 years ago

  • Status changed from New to Won't Fix

We will fix this as part of the incremental scrubbing #1783

Actions #8

Updated by Anonymous over 12 years ago

  • Status changed from Won't Fix to Duplicate
  • Priority changed from Urgent to High

Even though this is not the same complaint as 1783, we plan on
fixing it with the same changes, so I am calling this a DUP of
1783.

Actions

Also available in: Atom PDF