Project

General

Profile

Actions

Bug #9700

closed

cephtool mon_osd intermittent failure

Added by John Spray over 9 years ago. Updated over 9 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
qa
Target version:
-
% Done:

80%

Source:
other
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hit this one time on a gitbuilder: it's not clear to me why we have a 5-time retry here: some timeout raciness in the test? Log attached, grep for "echo retried with non zero exit status, 5 times".

Test code was added in:

commit f1becf9ad7237f36cf65e2b8dc95ee43946fe1fd
Author: Loic Dachary <loic-201408@dachary.org>
Date:   Sat Oct 4 11:34:27 2014 +0200

    qa: ceph tell must retry on ENXIO

    It is expected for ceph tell to fail with ENXIO if the daemon it is
    trying to join is not ready for some reason. This should be handled as a
    transient error instead of a fatal error.

    Add two shell functions to help with retry. They may prove useful if
    other cases requiring a few retries show up.

    http://tracker.ceph.com/issues/9655 Fixes: #9655

    Signed-off-by: Loic Dachary <loic-201408@dachary.org>

Or perhaps this is just the same thing as #9655 and the change to the test wasn't a sufficient wait?


Files

fail.log (785 KB) fail.log John Spray, 10/08/2014 06:11 AM

Related issues 2 (0 open2 closed)

Related to Ceph - Bug #9655: tests: qa/workunits/cephtool/test.sh fails ENXIOResolvedLoïc Dachary10/03/2014

Actions
Related to Ceph - Bug #8630: test osd-config.sh ENXIOResolvedLoïc Dachary06/19/2014

Actions
Actions

Also available in: Atom PDF