Project

General

Profile

Feature #1007

qa: osd failure and cluster recovery test(s)

Added by Sage Weil over 10 years ago. Updated over 9 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
qa
Target version:
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

We need tests of OSD failures that verify the cluster is able to recover. Eventually this will need to be fleshed out to include a variety of failure scenarios that try to get good coverage on the peering and recovery code. We can start out with some pretty simple tests, though:

- restart an osd once, or every few minutes.  verify get back to all active+clean.  maybe within some time bound?
- stop an osd, mark it out. continue operation for a while (dirty lots of objects). then re-add the osd. (this exercises code paths similar to a regular cluster expansion)
- restart multiple (or all) osds simultaneously.

Related issues

Blocked by Ceph - Feature #1212: teuthology: ability to restart daemons while other tasks are running Resolved 06/21/2011

History

#1 Updated by Sage Weil over 10 years ago

  • translation missing: en.field_position set to 8

#2 Updated by Sage Weil over 10 years ago

  • translation missing: en.field_story_points set to 33
  • translation missing: en.field_position deleted (9)
  • translation missing: en.field_position set to 9

#3 Updated by Sage Weil over 10 years ago

  • translation missing: en.field_story_points changed from 33 to 3
  • translation missing: en.field_position deleted (9)
  • translation missing: en.field_position set to 9

#4 Updated by Greg Farnum over 10 years ago

  • Assignee set to Samuel Just

#5 Updated by Anonymous over 10 years ago

  • Assignee changed from Samuel Just to Anonymous

#6 Updated by Sage Weil about 10 years ago

  • Target version changed from v0.28 to v0.29

#7 Updated by Sage Weil about 10 years ago

  • Assignee deleted (Anonymous)

#8 Updated by Sage Weil about 10 years ago

  • Target version changed from v0.29 to v0.30

#9 Updated by Sage Weil about 10 years ago

  • Subject changed from autotest: osd failure and cluster recovery test(s) to qa: osd failure and cluster recovery test(s)

#10 Updated by Sage Weil about 10 years ago

  • translation missing: en.field_position deleted (39)
  • translation missing: en.field_position set to 3

#11 Updated by Sage Weil about 10 years ago

  • Target version changed from v0.30 to v0.31

#12 Updated by Sage Weil about 10 years ago

  • translation missing: en.field_position deleted (26)
  • translation missing: en.field_position set to 27

#13 Updated by Sage Weil about 10 years ago

  • Target version changed from v0.31 to v0.32

#14 Updated by Sage Weil about 10 years ago

  • translation missing: en.field_position deleted (31)
  • translation missing: en.field_position set to 730

#15 Updated by Sage Weil about 10 years ago

  • translation missing: en.field_position deleted (730)
  • translation missing: en.field_position set to 1
  • translation missing: en.field_position changed from 1 to 748

#16 Updated by Sage Weil about 10 years ago

  • Target version changed from v0.32 to v0.33
  • translation missing: en.field_position deleted (748)
  • translation missing: en.field_position set to 1

#17 Updated by Sage Weil about 10 years ago

  • Target version changed from v0.33 to v0.34
  • translation missing: en.field_position deleted (38)
  • translation missing: en.field_position set to 2

#18 Updated by Sage Weil almost 10 years ago

  • translation missing: en.field_position deleted (11)
  • translation missing: en.field_position set to 39

#19 Updated by Sage Weil almost 10 years ago

  • Target version changed from v0.34 to 12
  • translation missing: en.field_position deleted (39)
  • translation missing: en.field_position set to 1

#20 Updated by Sage Weil almost 10 years ago

  • Target version changed from 12 to v0.38

#21 Updated by Sage Weil almost 10 years ago

  • Target version changed from v0.38 to v0.36
  • translation missing: en.field_position deleted (54)
  • translation missing: en.field_position set to 1
  • translation missing: en.field_position changed from 1 to 851

#22 Updated by Sage Weil almost 10 years ago

  • Target version changed from v0.36 to v0.37
  • translation missing: en.field_position deleted (871)
  • translation missing: en.field_position set to 11

#23 Updated by Sage Weil almost 10 years ago

  • Target version deleted (v0.37)

#24 Updated by Sage Weil over 9 years ago

  • Status changed from New to Resolved
  • Target version set to v0.39

yay thrashing

Also available in: Atom PDF