Project

General

Profile

Bug #1392

osd: PGLog received in Initial state

Added by Sam Lang over 12 years ago. Updated over 12 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
OSD
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

OSD crash on single OSD. Attached is the log and core files.

osd.core.gz (8.72 MB) Sam Lang, 08/11/2011 01:44 PM

osd.12.log.crash (407 KB) Sam Lang, 08/11/2011 01:44 PM

History

#1 Updated by Josh Durgin over 12 years ago

  • Subject changed from osd crash: PG.cc:3891 bad state machine event to osd: PGLog received in Initial state
  • Category set to OSD

#2 Updated by Sage Weil over 12 years ago

  • Target version set to v0.34

#3 Updated by Sage Weil over 12 years ago

  • Priority changed from Normal to High

#4 Updated by Samuel Just over 12 years ago

Sorry for the delay. It looks like the osd received a log message during GetInfo. Would it be possible to reproduce this with OSD debugging over 10?

#5 Updated by Sam Lang over 12 years ago

The crash happened on just that osd (out of 30), and it hasn't happened since, so I don't have a way to easily reproduce it. I guess if it happens again I'll let you know.

#6 Updated by Samuel Just over 12 years ago

Actually, it looks like last_warm_restart isn't set in the Initial state. Thus, between when a PG is created and when it first restarts peering, the pg could erroneously consider stale peering messages...

a88c1790ffb7b708b334a6d9a1deacc300e933be should fix that error. Unfortunately, your error may well have been caused by something else.

Thanks for the help!

#7 Updated by Sage Weil over 12 years ago

  • Status changed from New to Resolved

being optimistic and closing this unless/until we see it again.

Also available in: Atom PDF