Project

General

Profile

Bug #17044

multisite: assert(next) failed in RGWMetaSyncCR

Added by Casey Bodley about 1 year ago. Updated 7 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
-
Start date:
08/16/2016
Due date:
% Done:

0%

Source:
other
Tags:
Backport:
jewel
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Release:
Needs Doc:
No

Description

When testing with multiple gateways per zone, I'm seeing some omap-get-vals requests fail with EIO. RGWMetaSyncCR doesn't notice these errors, and ends up hitting an assertion:


2016-08-16 10:44:18.083635 7f11e5cac700 -1 /home/cbodley/ceph/src/rgw/rgw_sync.cc: In function 'virtual int RGWMetaSyncCR::operate()' thread 7f11e5cac700 time 2016-08-16 10:44:18.077186
/home/cbodley/ceph/src/rgw/rgw_sync.cc: 1757: FAILED assert(next)

 ceph version Development (no_version)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x95) [0x1195f92]
 2: (RGWMetaSyncCR::operate()+0xc64) [0x109e11c]
 3: (RGWCoroutinesStack::operate(RGWCoroutinesEnv*)+0x191) [0xe6e77d]
 4: (RGWCoroutinesManager::run(std::__cxx11::list<RGWCoroutinesStack*, std::allocator<RGWCoroutinesStack*> >&)+0x287) [0xe7033f]
 5: (RGWCoroutinesManager::run(RGWCoroutine*)+0xbc) [0xe71438]
 6: (RGWRemoteMetaLog::run_sync()+0x1638) [0x108b8a4]
 7: (RGWMetaSyncStatusManager::run()+0x1c) [0xf3c03e]
 8: (RGWMetaSyncProcessorThread::process()+0x1c) [0xf3e322]
 9: (RGWRadosThread::Worker::entry()+0xf6) [0xee4c70]
 10: (Thread::entry_wrapper()+0xc1) [0x11b9f29]
 11: (Thread::_entry_func(void*)+0x18) [0x11b9e5e]
 12: (()+0x761a) [0x7f1205c3061a]
 13: (clone()+0x6d) [0x7f120442259d]


Related issues

Copied to rgw - Backport #17119: jewel: multisite: assert(next) failed in RGWMetaSyncCR Resolved

History

#1 Updated by Casey Bodley about 1 year ago

  • Status changed from New to Need Review

#2 Updated by Casey Bodley 12 months ago

  • Status changed from Need Review to Pending Backport

#3 Updated by Loic Dachary 12 months ago

  • Copied to Backport #17119: jewel: multisite: assert(next) failed in RGWMetaSyncCR added

#4 Updated by Yehuda Sadeh 12 months ago

  • Assignee changed from Casey Bodley to Yehuda Sadeh

#5 Updated by Yehuda Sadeh 12 months ago

  • Assignee changed from Yehuda Sadeh to Casey Bodley

#6 Updated by Nathan Cutler 7 months ago

  • Status changed from Pending Backport to Resolved

Also available in: Atom PDF