Project

General

Profile

Actions

Bug #17044

closed

multisite: assert(next) failed in RGWMetaSyncCR

Added by Casey Bodley over 7 years ago. Updated about 7 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
jewel
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

When testing with multiple gateways per zone, I'm seeing some omap-get-vals requests fail with EIO. RGWMetaSyncCR doesn't notice these errors, and ends up hitting an assertion:


2016-08-16 10:44:18.083635 7f11e5cac700 -1 /home/cbodley/ceph/src/rgw/rgw_sync.cc: In function 'virtual int RGWMetaSyncCR::operate()' thread 7f11e5cac700 time 2016-08-16 10:44:18.077186
/home/cbodley/ceph/src/rgw/rgw_sync.cc: 1757: FAILED assert(next)

 ceph version Development (no_version)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x95) [0x1195f92]
 2: (RGWMetaSyncCR::operate()+0xc64) [0x109e11c]
 3: (RGWCoroutinesStack::operate(RGWCoroutinesEnv*)+0x191) [0xe6e77d]
 4: (RGWCoroutinesManager::run(std::__cxx11::list<RGWCoroutinesStack*, std::allocator<RGWCoroutinesStack*> >&)+0x287) [0xe7033f]
 5: (RGWCoroutinesManager::run(RGWCoroutine*)+0xbc) [0xe71438]
 6: (RGWRemoteMetaLog::run_sync()+0x1638) [0x108b8a4]
 7: (RGWMetaSyncStatusManager::run()+0x1c) [0xf3c03e]
 8: (RGWMetaSyncProcessorThread::process()+0x1c) [0xf3e322]
 9: (RGWRadosThread::Worker::entry()+0xf6) [0xee4c70]
 10: (Thread::entry_wrapper()+0xc1) [0x11b9f29]
 11: (Thread::_entry_func(void*)+0x18) [0x11b9e5e]
 12: (()+0x761a) [0x7f1205c3061a]
 13: (clone()+0x6d) [0x7f120442259d]


Related issues 1 (0 open1 closed)

Copied to rgw - Backport #17119: jewel: multisite: assert(next) failed in RGWMetaSyncCRResolvedLoïc DacharyActions
Actions #1

Updated by Casey Bodley over 7 years ago

  • Status changed from New to Fix Under Review
Actions #2

Updated by Casey Bodley over 7 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #3

Updated by Loïc Dachary over 7 years ago

  • Copied to Backport #17119: jewel: multisite: assert(next) failed in RGWMetaSyncCR added
Actions #4

Updated by Yehuda Sadeh over 7 years ago

  • Assignee changed from Casey Bodley to Yehuda Sadeh
Actions #5

Updated by Yehuda Sadeh over 7 years ago

  • Assignee changed from Yehuda Sadeh to Casey Bodley
Actions #6

Updated by Nathan Cutler about 7 years ago

  • Status changed from Pending Backport to Resolved
Actions

Also available in: Atom PDF