Project

General

Profile

Actions

Bug #19236

closed

multisite: some 'radosgw-admin data sync' commands hang

Added by Casey Bodley about 7 years ago. Updated about 7 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
jewel, kraken
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

RGWRemoteDataLog::init_sync_status() and read_sync_status() use their own RGWCoroutinesManager, because RGWCoroutinesManager::run() is not reentrant. The rest api needs to call these functions while the sync thread is in RGWRemoteDataLog::run_sync().

However, those coroutines were still using RGWRemoteDataLog::http_manager, which was sending completions to RGWRemoteDataLog's RGWCoroutinesManager. This prevented the coroutines from waking up when the replies came back, and results in hangs.

https://github.com/ceph/ceph/pull/13660


Related issues 2 (0 open2 closed)

Copied to rgw - Backport #19353: jewel: multisite: some 'radosgw-admin data sync' commands hangResolvedCasey BodleyActions
Copied to rgw - Backport #19354: kraken: multisite: some 'radosgw-admin data sync' commands hangResolvedNathan CutlerActions
Actions #1

Updated by Casey Bodley about 7 years ago

  • Backport set to jewel kraken
Actions #2

Updated by Orit Wasserman about 7 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #3

Updated by Nathan Cutler about 7 years ago

  • Copied to Backport #19353: jewel: multisite: some 'radosgw-admin data sync' commands hang added
Actions #4

Updated by Nathan Cutler about 7 years ago

  • Copied to Backport #19354: kraken: multisite: some 'radosgw-admin data sync' commands hang added
Actions #5

Updated by Nathan Cutler about 7 years ago

  • Status changed from Pending Backport to Resolved
  • Backport changed from jewel kraken to jewel, kraken
Actions

Also available in: Atom PDF