Project

General

Profile

Actions

Bug #13531

closed

latest update osd lost, cluster still stuck on Peering

Added by shawn chen over 8 years ago. Updated about 8 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Community (dev)
Tags:
Backport:
infernalis, hammer
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

reproduce procedure:
my test env:

three osds: osd.0 osd.1 osd.2
use rados put xxx xxx -p xxx to put data
suppose osd.1 is primary, then kill -9 osd.0 osd.2

use rados to put xxx data again,
kill -9 osd.1
restart osd.0 and osd.2

ceph -s output seeing that pg down state and ceph pg xxx query seeing that
Peering State blocked by 1
then use ceph osd lost 1 --yes-i-really-mean-it
the Peering State still stuck cause blocked by osd.1

Actions #2

Updated by Sage Weil over 8 years ago

  • Status changed from New to 12
  • Source changed from other to Community (dev)
  • Backport set to infernalis, hammer
Actions #3

Updated by Samuel Just over 8 years ago

This needs a test before merge, shouldn't be hard.

Actions #4

Updated by Kefu Chai over 8 years ago

  • Assignee changed from shawn chen to Kefu Chai
Actions #5

Updated by Kefu Chai over 8 years ago

will try to ping shawn for adding a test.

Actions #6

Updated by Sage Weil over 8 years ago

  • Status changed from 12 to 7
Actions #7

Updated by Sage Weil over 8 years ago

  • Status changed from 7 to 17
Actions #8

Updated by Sage Weil about 8 years ago

fix is merged, but we still need a test.

Actions #9

Updated by Kefu Chai about 8 years ago

  • Status changed from 17 to Resolved
  • Assignee changed from Kefu Chai to shawn chen
Actions

Also available in: Atom PDF