Project

General

Profile

Actions

Feature #1007

closed

qa: osd failure and cluster recovery test(s)

Added by Sage Weil about 13 years ago. Updated over 12 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
qa
Target version:
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

We need tests of OSD failures that verify the cluster is able to recover. Eventually this will need to be fleshed out to include a variety of failure scenarios that try to get good coverage on the peering and recovery code. We can start out with some pretty simple tests, though:

- restart an osd once, or every few minutes.  verify get back to all active+clean.  maybe within some time bound?
- stop an osd, mark it out. continue operation for a while (dirty lots of objects). then re-add the osd. (this exercises code paths similar to a regular cluster expansion)
- restart multiple (or all) osds simultaneously.

Related issues 1 (0 open1 closed)

Blocked by Ceph - Feature #1212: teuthology: ability to restart daemons while other tasks are runningResolvedSamuel Just06/21/2011

Actions
Actions #1

Updated by Sage Weil about 13 years ago

  • Translation missing: en.field_position set to 8
Actions #2

Updated by Sage Weil about 13 years ago

  • Translation missing: en.field_story_points set to 33
  • Translation missing: en.field_position deleted (9)
  • Translation missing: en.field_position set to 9
Actions #3

Updated by Sage Weil about 13 years ago

  • Translation missing: en.field_story_points changed from 33 to 3
  • Translation missing: en.field_position deleted (9)
  • Translation missing: en.field_position set to 9
Actions #4

Updated by Greg Farnum about 13 years ago

  • Assignee set to Samuel Just
Actions #5

Updated by Anonymous about 13 years ago

  • Assignee changed from Samuel Just to Anonymous
Actions #6

Updated by Sage Weil almost 13 years ago

  • Target version changed from v0.28 to v0.29
Actions #7

Updated by Sage Weil almost 13 years ago

  • Assignee deleted (Anonymous)
Actions #8

Updated by Sage Weil almost 13 years ago

  • Target version changed from v0.29 to v0.30
Actions #9

Updated by Sage Weil almost 13 years ago

  • Subject changed from autotest: osd failure and cluster recovery test(s) to qa: osd failure and cluster recovery test(s)
Actions #10

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (39)
  • Translation missing: en.field_position set to 3
Actions #11

Updated by Sage Weil almost 13 years ago

  • Target version changed from v0.30 to v0.31
Actions #12

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (26)
  • Translation missing: en.field_position set to 27
Actions #13

Updated by Sage Weil almost 13 years ago

  • Target version changed from v0.31 to v0.32
Actions #14

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (31)
  • Translation missing: en.field_position set to 730
Actions #15

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (730)
  • Translation missing: en.field_position set to 1
  • Translation missing: en.field_position changed from 1 to 748
Actions #16

Updated by Sage Weil almost 13 years ago

  • Target version changed from v0.32 to v0.33
  • Translation missing: en.field_position deleted (748)
  • Translation missing: en.field_position set to 1
Actions #17

Updated by Sage Weil over 12 years ago

  • Target version changed from v0.33 to v0.34
  • Translation missing: en.field_position deleted (38)
  • Translation missing: en.field_position set to 2
Actions #18

Updated by Sage Weil over 12 years ago

  • Translation missing: en.field_position deleted (11)
  • Translation missing: en.field_position set to 39
Actions #19

Updated by Sage Weil over 12 years ago

  • Target version changed from v0.34 to 12
  • Translation missing: en.field_position deleted (39)
  • Translation missing: en.field_position set to 1
Actions #20

Updated by Sage Weil over 12 years ago

  • Target version changed from 12 to v0.38
Actions #21

Updated by Sage Weil over 12 years ago

  • Target version changed from v0.38 to v0.36
  • Translation missing: en.field_position deleted (54)
  • Translation missing: en.field_position set to 1
  • Translation missing: en.field_position changed from 1 to 851
Actions #22

Updated by Sage Weil over 12 years ago

  • Target version changed from v0.36 to v0.37
  • Translation missing: en.field_position deleted (871)
  • Translation missing: en.field_position set to 11
Actions #23

Updated by Sage Weil over 12 years ago

  • Target version deleted (v0.37)
Actions #24

Updated by Sage Weil over 12 years ago

  • Status changed from New to Resolved
  • Target version set to v0.39

yay thrashing

Actions

Also available in: Atom PDF