Project

General

Profile

Actions

Bug #24336

open

rgw-multisite: Segmentation fault when use different rgw_data_log_max_shards among zones

Added by Xinying Song almost 6 years ago. Updated almost 5 years ago.

Status:
Fix Under Review
Priority:
Normal
Assignee:
Target version:
-
% Done:

0%

Source:
Tags:
multisite
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

RGW will core when using different rgw_data_log_max_shards config in different zones within the same zonegroup.
log as below:
2018-05-29 18:27:47.560 7fba29dd1700 -1 ** Caught signal (Segmentation fault) *
in thread 7fba29dd1700 thread_name:data-sync

ceph version 13.1.0-490-gfbf0e10 (fbf0e10af014a15a47e92dda53e7d3fce890421e) nautilus (dev)
1: (()+0x272be0) [0x55f1e3467be0]
2: (()+0x10330) [0x7fba57f55330]
3: (std::__detail::_List_node_base::_M_hook(std::__detail::_List_node_base*)+0xf) [0x7fba4dbf771f]
4: (RGWOmapAppend::append(std::string const&)+0x57) [0x55f1e351caf7]
5: (RGWListBucketIndexesCR::operate()+0x9fd) [0x55f1e375017d]
6: (RGWCoroutinesStack::operate(RGWCoroutinesEnv*)+0x46) [0x55f1e3514d56]
7: (RGWCoroutinesManager::run(std::list<RGWCoroutinesStack*, std::allocator<RGWCoroutinesStack*> >&)+0x27b) [0x55f1e3517f0b]
8: (RGWCoroutinesManager::run(RGWCoroutine*)+0x68) [0x55f1e3518d48]
9: (RGWRemoteDataLog::run_sync(int)+0x1d7) [0x55f1e3737017]
10: (RGWDataSyncProcessorThread::process()+0x46) [0x55f1e35c2d46]
11: (RGWRadosThread::Worker::entry()+0x21b) [0x55f1e356820b]
12: (()+0x8184) [0x7fba57f4d184]
13: (clone()+0x6d) [0x7fba4d69a03d]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

The reason is we initialize a omap-manager with the size read from remote zone(rgw_data_log_max_shards), but access it with a shard parameter read from local config(rgw_data_log_max_shrads).

Actions #2

Updated by Nathan Cutler almost 6 years ago

  • Status changed from New to Fix Under Review
Actions #3

Updated by Casey Bodley almost 6 years ago

  • Assignee set to Casey Bodley
Actions #4

Updated by Casey Bodley almost 5 years ago

  • Tags set to multisite
Actions

Also available in: Atom PDF