Project

General

Profile

Actions

Bug #11197

closed

pg stuck stale after create with activation delay

Added by Samuel Just about 9 years ago. Updated almost 9 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
firefly
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I think as soon as the mon gets a status from an osd(peering for instance), the pg moves off of the creating list and we stop sending creates. However, if the primary dies between sending that status and the replicas recording the initial activation, it will not have been written out on any other osd and will end up without any copies, yet not creating. As a bandaid, we can choose not to move it out of creating state until we activate?

(Not a hammer blocker, very very old bug as far as I can tell)

Actions #1

Updated by Samuel Just about 9 years ago

  • Status changed from New to 7

I'm somewhat wrong, the problem is that we don't set creating at all when handling the initial create message. It gets set on the first start_peering_interval after that point.

Actions #2

Updated by Sage Weil about 9 years ago

  • Status changed from 7 to Pending Backport
  • Source changed from other to Q/A
  • Backport set to firefly
Actions #4

Updated by Loïc Dachary almost 9 years ago

29bc9e5 PG: set/clear CREATING in Primary state entry/exit (in firefly),

Actions #5

Updated by Loïc Dachary almost 9 years ago

  • Status changed from Pending Backport to Resolved
Actions

Also available in: Atom PDF