Bug #15211
tests: test-erasure-code.sh failing sometimes during 'make check' on master
Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
qa
Target version:
-
% Done:
0%
Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
One example:
https://jenkins.ceph.com/job/ceph-pull-requests/3328/consoleFull
./test/erasure-code/test-erasure-code.sh:46: run: TEST_rados_put_get_jerasure testdir/test-erasure-code ./test/erasure-code/test-erasure-code.sh:212: TEST_rados_put_get_jerasure: local dir=testdir/test-erasure-code ./test/erasure-code/test-erasure-code.sh:214: TEST_rados_put_get_jerasure: rados_put_get testdir/test-erasure-code ecpool ./test/erasure-code/test-erasure-code.sh:70: rados_put_get: local dir=testdir/test-erasure-code ./test/erasure-code/test-erasure-code.sh:71: rados_put_get: local poolname=ecpool ./test/erasure-code/test-erasure-code.sh:72: rados_put_get: local objname=SOMETHING ./test/erasure-code/test-erasure-code.sh:75: rados_put_get: for marker in AAA BBB CCCC DDDD ./test/erasure-code/test-erasure-code.sh:76: rados_put_get: printf '%*s' 1024 AAA ./test/erasure-code/test-erasure-code.sh:75: rados_put_get: for marker in AAA BBB CCCC DDDD ./test/erasure-code/test-erasure-code.sh:76: rados_put_get: printf '%*s' 1024 BBB ./test/erasure-code/test-erasure-code.sh:75: rados_put_get: for marker in AAA BBB CCCC DDDD ./test/erasure-code/test-erasure-code.sh:76: rados_put_get: printf '%*s' 1024 CCCC ./test/erasure-code/test-erasure-code.sh:75: rados_put_get: for marker in AAA BBB CCCC DDDD ./test/erasure-code/test-erasure-code.sh:76: rados_put_get: printf '%*s' 1024 DDDD ./test/erasure-code/test-erasure-code.sh:82: rados_put_get: ./rados --pool ecpool put SOMETHING testdir/test-erasure-code/ORIGINAL ./test/erasure-code/test-erasure-code.sh:83: rados_put_get: ./rados --pool ecpool get SOMETHING testdir/test-erasure-code/COPY ./test/erasure-code/test-erasure-code.sh:84: rados_put_get: diff testdir/test-erasure-code/ORIGINAL testdir/test-erasure-code/COPY ./test/erasure-code/test-erasure-code.sh:85: rados_put_get: rm testdir/test-erasure-code/COPY ./test/erasure-code/test-erasure-code.sh:92: rados_put_get: initial_osds=($(get_osds $poolname $objname)) ../test/erasure-code/test-erasure-code.sh:92: rados_put_get: get_osds ecpool SOMETHING .../qa/workunits/ceph-helpers.sh:640: get_osds: local poolname=ecpool .../qa/workunits/ceph-helpers.sh:641: get_osds: local objectname=SOMETHING ..../qa/workunits/ceph-helpers.sh:644: get_osds: ceph --format xml osd map ecpool SOMETHING ..../qa/workunits/ceph-helpers.sh:644: get_osds: xmlstarlet sel -t -m //acting/osd -v . -o ' ' .../qa/workunits/ceph-helpers.sh:644: get_osds: local 'osds=6 0 4 ' .../qa/workunits/ceph-helpers.sh:646: get_osds: echo 6 0 4 ./test/erasure-code/test-erasure-code.sh:92: rados_put_get: local -a initial_osds ./test/erasure-code/test-erasure-code.sh:93: rados_put_get: local last=2 ./test/erasure-code/test-erasure-code.sh:94: rados_put_get: ./ceph osd out 4 *** DEVELOPER MODE: setting PATH, PYTHONPATH and LD_LIBRARY_PATH *** marked out osd.4. ./test/erasure-code/test-erasure-code.sh:95: rados_put_get: grep '\<4\>' ../qa/workunits/ceph-helpers.sh:640: get_osds: local poolname=ecpool ../qa/workunits/ceph-helpers.sh:641: get_osds: local objectname=SOMETHING .../qa/workunits/ceph-helpers.sh:644: get_osds: ceph --format xml osd map ecpool SOMETHING .../qa/workunits/ceph-helpers.sh:644: get_osds: xmlstarlet sel -t -m //acting/osd -v . -o ' ' ../qa/workunits/ceph-helpers.sh:644: get_osds: local 'osds=6 0 4 ' ../qa/workunits/ceph-helpers.sh:646: get_osds: echo 6 0 4 6 0 4 ./test/erasure-code/test-erasure-code.sh:95: rados_put_get: return 1 TEST_rados_put_get_jerasure: return 1
Related issues
Associated revisions
test/test-erasure-code.sh: disable pg temp priming
If we prime pg_temp, there is a race where this test expects an out OSD
to not be in the acting set but it will be there initially.
Fixes: #15211
Signed-off-by: Sage Weil <sage@redhat.com>
History
#1 Updated by Josh Durgin about 8 years ago
- Subject changed from tests: test-erasure-code.sh failing sometimes on during 'make check' on master to tests: test-erasure-code.sh failing sometimes during 'make check' on master
#2 Updated by Josh Durgin about 8 years ago
Another example: https://jenkins.ceph.com/job/ceph-pull-requests/3373/consoleFull
#3 Updated by Kefu Chai about 8 years ago
- Status changed from New to Fix Under Review
#4 Updated by Kefu Chai about 8 years ago
- Status changed from Fix Under Review to Resolved
#5 Updated by Laura Flores over 1 year ago
- Related to Bug #57883: test-erasure-code.sh: TEST_rados_put_get_jerasure fails on "rados_put_get: grep '\<5\>'" added