Bug #57951: rgw: lc: lc for a single large bucket can run too long - rgw - Ceph

Actions

Copy link

Bug #57951

open

rgw: lc: lc for a single large bucket can run too long

Added by Matt Benjamin over 1 year ago. Updated 7 months ago.

Status:

Pending Backport

Priority:

Normal

Assignee:

Matt Benjamin

Target version:

% Done:

Source:

Tags:

lifecycle backport_processed

Backport:

quincy, pacific

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Pull request ID:

48683

Crash signature (v1):

Crash signature (v2):

Description

If this happens, other lc hosts/threads can attempt to process the same bucket, which inflates overhead without any corresponding benefit. In practice, this may be likely only during test runs with a debug interval (which, in turn, should never be used in production), but we should prevent the behavior anyway.

Related issues 2 (1 open — 1 closed)

Actions

Copy link

Updated by Casey Bodley over 1 year ago

Status changed from In Progress to Fix Under Review
Tags set to lifecycle
Pull request ID set to 48683

Actions

Copy link

Updated by J. Eric Ivancich about 1 year ago

Status changed from Fix Under Review to Pending Backport

Actions

Copy link

Updated by Backport Bot about 1 year ago

Copied to Backport #58787: pacific: rgw: lc: lc for a single large bucket can run too long added

Actions

Copy link

Updated by Backport Bot about 1 year ago

Copied to Backport #58788: quincy: rgw: lc: lc for a single large bucket can run too long added

Actions

Copy link

Updated by Backport Bot about 1 year ago

Tags changed from lifecycle to lifecycle backport_processed

Actions

Copy link

Updated by Cory Snyder 11 months ago

We seem to have encountered this scenario many times on production Pacific clusters. At least, we have a lot of error logs that indicate LC is processing the same bucket on multiple hosts concurrently over the period of multiple days. Can I assist with backports on this one, Matt?

Actions

Copy link

Updated by Matt Benjamin 11 months ago

Hi Cory,

Sure, I'd be delighted to have you do that, and will review and assist if needed.

Matt

Actions

Copy link

Updated by Cory Snyder 7 months ago

For what it's worth, I'll note that we've commonly seen lifecycle rules be used to delete everything in a bucket when a client wishes to empty it so that the bucket itself may be deleted. For buckets with tens or hundreds of millions of objects, and LC rules that are designed to immediately expire every object in the bucket, the processing can easily take multiple days.

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Ceph » rgw

Custom queries

Bug #57951

rgw: lc: lc for a single large bucket can run too long

Updated by Casey Bodley over 1 year ago

Updated by J. Eric Ivancich about 1 year ago

Updated by Backport Bot about 1 year ago

Updated by Backport Bot about 1 year ago

Updated by Backport Bot about 1 year ago

Updated by Cory Snyder 11 months ago

Updated by Matt Benjamin 11 months ago

Updated by Cory Snyder 7 months ago