Project

General

Profile

Actions

Bug #4371

closed

osd/ReplicatedPG.cc: 4814: FAILED assert(peer_missing.count(fromosd))

Added by Samuel Just about 11 years ago. Updated about 11 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
bobtail
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Seems to be caused by check_recovery_sources only getting invoked in active:
1) pg is in GetInfo, acting is [3,1]
2) we find object A on osd [0] in GetInfo
3) 0 goes down, no new peering interval since it is neither up nor acting
4) we go active and try to pull from 0 since missing_loc did not get cleaned up.

Testing fix now.


Related issues 1 (0 open1 closed)

Has duplicate Ceph - Bug #4396: osd crashed in ReplicatedPG::do_op in the nightliesDuplicateSage Weil03/08/2013

Actions
Actions #1

Updated by Samuel Just about 11 years ago

pending merge of wip_omap_snaps

Actions #2

Updated by Samuel Just about 11 years ago

  • Status changed from 12 to Pending Backport
Actions #3

Updated by Samuel Just about 11 years ago

de22b186c497ce151217aecf17a8d35cdbf549bb

Actions #4

Updated by Samuel Just about 11 years ago

  • Status changed from Pending Backport to Resolved
Actions #5

Updated by Samuel Just about 11 years ago

a30903c6adaa023587d3147179d6038ad37ca520

Actions

Also available in: Atom PDF