Bug #13592: ceph-helpers: TEST_auto_repair_erasure_coded intermittent failures - Ceph - Ceph

Actions

Copy link

Bug #13592

closed

ceph-helpers: TEST_auto_repair_erasure_coded intermittent failures

Added by Loïc Dachary over 8 years ago. Updated over 8 years ago.

Status:

Resolved

Priority:

Normal

Assignee:

Loïc Dachary

Category:

Target version:

% Done:

Source:

other

Tags:

Backport:

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

./test/osd/osd-scrub-repair.sh:182: TEST_auto_repair_erasure_coded:  objectstore_tool testdir/osd-scrub-repair 0 SOMETHING list-attrs
../qa/workunits/ceph-helpers.sh:792: objectstore_tool:  local dir=testdir/osd-scrub-repair
../qa/workunits/ceph-helpers.sh:793: objectstore_tool:  shift
../qa/workunits/ceph-helpers.sh:794: objectstore_tool:  local id=0
../qa/workunits/ceph-helpers.sh:795: objectstore_tool:  shift
../qa/workunits/ceph-helpers.sh:796: objectstore_tool:  local osd_data=testdir/osd-scrub-repair/0
../qa/workunits/ceph-helpers.sh:798: objectstore_tool:  kill_daemons testdir/osd-scrub-repair TERM osd.0
.../qa/workunits/ceph-helpers.sh:192: kill_daemons:  shopt -q -o xtrace
.../qa/workunits/ceph-helpers.sh:192: kill_daemons:  echo true
../qa/workunits/ceph-helpers.sh:192: kill_daemons:  local trace=true
../qa/workunits/ceph-helpers.sh:193: kill_daemons:  true
../qa/workunits/ceph-helpers.sh:193: kill_daemons:  shopt -u -o xtrace
../qa/workunits/ceph-helpers.sh:219: kill_daemons:  return 0
../qa/workunits/ceph-helpers.sh:800: objectstore_tool:  ceph-objectstore-tool --data-path testdir/osd-scrub-repair/0 --journal-path testdir/osd-scrub-repair/0/journal SOMETHING list-attrs
No object id 'SOMETHING' found
../qa/workunits/ceph-helpers.sh:802: objectstore_tool:  return 1

Files

log.txt (416 KB) log.txt

Loïc Dachary, 10/25/2015 11:11 PM

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/8618/consoleText

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/8660/console

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/8815/console

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/9175/console

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/9235/console

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/9278/console

Actions

Copy link

Updated by Sage Weil over 8 years ago

Assignee set to Loïc Dachary

Actions

Copy link

Updated by Loïc Dachary over 8 years ago

Status changed from New to Need More Info

Actions

Copy link

#10

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/9565/console

Actions

Copy link

#11

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/9622/console

Actions

Copy link

#12

Updated by Loïc Dachary over 8 years ago

http://jenkins.ceph.dachary.org/job/ceph/LABELS=centos-7&&x86_64/9619/console

Actions

Copy link

#13

Updated by Xinze Chi over 8 years ago

I think the reason is that the scrub is not be scheduled by osd.
Because the sched_scrub is only 33% percent called in every tick (OSD::scrub_random_backoff()). And then it may need more time to wait for scheduling the specially pg scrub(we should loop through the all pgs)

Actions

Copy link

#14