Project

General

Profile

Actions

Bug #3697

closed

rbd copy.sh test failing in nightly

Added by Sage Weil over 11 years ago. Updated about 11 years ago.

Status:
Duplicate
Priority:
High
Assignee:
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2012-12-28T22:27:46.767 INFO:teuthology.task.workunit.client.0.out:testing remove...
2012-12-28T22:27:46.767 INFO:teuthology.task.workunit.client.0.err:+ test_remove
2012-12-28T22:27:46.767 INFO:teuthology.task.workunit.client.0.err:+ echo testing remove...
2012-12-28T22:27:46.767 INFO:teuthology.task.workunit.client.0.err:+ remove_images
2012-12-28T22:27:50.792 INFO:teuthology.task.workunit.client.0.err:+ rbd create -s 1 test1
2012-12-28T22:27:50.852 INFO:teuthology.task.workunit.client.0.err:+ rbd rm test1
2012-12-28T22:27:51.014 INFO:teuthology.task.workunit.client.0.out:^MRemoving image: 100% complete...done.
2012-12-28T22:27:51.017 INFO:teuthology.task.workunit.client.0.err:+ rbd ls
2012-12-28T22:27:51.017 INFO:teuthology.task.workunit.client.0.err:+ wc -l
2012-12-28T22:27:51.018 INFO:teuthology.task.workunit.client.0.err:+ grep ^0$
2012-12-28T22:27:51.063 INFO:teuthology.task.workunit:Stopping rbd/copy.sh on client.0...

ubuntu@teuthology:/a/teuthology-2012-12-28_19:00:03-regression-next-testing-basic/29749

Related issues 1 (0 open1 closed)

Is duplicate of rbd - Bug #3958: rbd fsx fails with EBUSYResolvedSage Weil01/29/2013

Actions
Actions #1

Updated by Sage Weil over 11 years ago

  • Project changed from Ceph to rbd
  • Category deleted (librbd)
Actions #2

Updated by Dan Mick over 11 years ago

  • Assignee set to Dan Mick
Actions #3

Updated by Dan Mick over 11 years ago

Trying to reproduce now

Actions #4

Updated by Dan Mick over 11 years ago

Hm, doesn't reproduce on local vstart cluster. Pondering possible failure modes.

Actions #5

Updated by Dan Mick over 11 years ago

  • Status changed from 12 to Can't reproduce
Actions #6

Updated by Sage Weil over 11 years ago

FWIW I ran this in a loop and reproduced it after 7 iterations (well, a slightly different error actually, when it removes a snapshot).

Actions #7

Updated by Sage Weil over 11 years ago

  • Status changed from Can't reproduce to In Progress
Actions #8

Updated by Dan Mick over 11 years ago

Reproduces OK on plana cluster, indeed. This seems to point toward some sort of OSD bug where committed state isn't properly read back by the next client. Have detailed logs to examine, hopefully with Sam's help.

Actions #9

Updated by Dan Mick over 11 years ago

When reproducing with lots of error logging to stderr, the error occurs on snapshots because the snap rm/snap info test tries to parse stderr, and the logging intervenes. :( so the logging is creating a different failure.

I'll change the test so that log_to_stderr is off and try running with full logging.

Actions #10

Updated by Ian Colle over 11 years ago

  • Priority changed from Urgent to High
Actions #11

Updated by Dan Mick over 11 years ago

  • Status changed from In Progress to Can't reproduce

unable to reproduce so far

Actions #12

Updated by Tamilarasi muthamizhan about 11 years ago

recent log : ubuntu@teuthology:/a/teuthology-2013-02-04_20:00:03-regression-bobtail-master-basic/15773

Actions #13

Updated by Tamilarasi muthamizhan about 11 years ago

  • Status changed from Can't reproduce to In Progress
  • Assignee changed from Dan Mick to Josh Durgin
Actions #14

Updated by Josh Durgin about 11 years ago

  • Status changed from In Progress to Duplicate
Actions

Also available in: Atom PDF