Bug #7511: "set noup; set down <n>; unset noup" doesn't bring <n> back up - Ceph - Ceph

Actions

Copy link

Bug #7511

closed

"set noup; set down <n>; unset noup" doesn't bring <n> back up

Added by Dan Mick about 10 years ago. Updated about 10 years ago.

Status:

Resolved

Priority:

Immediate

Assignee:

Category:

OSD

Target version:

% Done:

Source:

Development

Tags:

Backport:

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

Experiment:
ceph osd set noup
ceph osd down 1
<osd 1 is down>
ceph osd unset noup

Expected:
<osd 1 is back up>

Observed:
<osd 1 stays down>

Sage theorizes that the OSD should notice, when it tries to come back, that noup is set, and subscribe an event from the monitor to notice new maps, so that it sees the
"unset noup" and tries to come back again. (or words to that effect)

Actions

Copy link

Updated by Sage Weil about 10 years ago

Status changed from New to 12
Priority changed from Normal to Immediate

ha, this apears to be what is breaking the unit tests on firefly/master:
http://gitbuilder.sepia.ceph.com/gitbuilder-ceph-tarball-precise-amd64-notcmalloc/log.cgi?log=72f260e07cf5f7bc5264a7f8ab292b9dc9ec3222

Actions

Copy link

Updated by Sage Weil about 10 years ago

     0> 2014-02-21 21:13:56.428703 7f0258ff9700 -1 osd/PG.cc: In function 'void PG::scrub_finish()' thread 7f0258ff9700 time 2014-02-21 21:13:56.393127
osd/PG.cc: 4294: FAILED assert(deep_scrub)

 ceph version 0.77-488-g4a2c0af (4a2c0af22acb51584ccd008b1e991d4943e6b557)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x95) [0x193f819]
 2: (PG::scrub_finish()+0x3a2) [0x15f5bc4]
 3: (PG::scrub_finalize()+0x2fb) [0x15f56cd]
 4: (OSD::ScrubFinalizeWQ::_process(PG*)+0x1d) [0x139d9c7]
 5: (ThreadPool::WorkQueue<PG>::_process(PG*, ThreadPool::TPHandle&)+0x32) [0x1429714]
 6: (ThreadPool::WorkQueue<PG>::_void_process(void*, ThreadPool::TPHandle&)+0x37) [0x1428cbf]
 7: (ThreadPool::worker(ThreadPool::WorkThread*)+0x715) [0x1932283]
 8: (ThreadPool::WorkThread::entry()+0x23) [0x1933f21]

Actions

Copy link