Project

General

Profile

Actions

Bug #8495

closed

osd: bad state machine event on backfill request

Added by Dmitry Smirnov almost 10 years ago. Updated almost 10 years ago.

Status:
Duplicate
Priority:
High
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Three OSDs crashed together for no apparent reason during routine backfilling/remapping.
Situation: 12 OSDs on 5 hosts; 1 OSD down/in; 1 OSD up/out.

I've decided to replace one OSD so I've used command 'ceph osd out 3'.
Some hours later 3 OSDs crashed all together.

See attached logs.


Files

ceph-osd.0.log.xz (133 KB) ceph-osd.0.log.xz Dmitry Smirnov, 05/31/2014 02:52 AM
ceph-osd.1.log.xz (137 KB) ceph-osd.1.log.xz Dmitry Smirnov, 05/31/2014 02:52 AM
ceph-osd.6.log.xz (140 KB) ceph-osd.6.log.xz Dmitry Smirnov, 05/31/2014 02:52 AM
crushmap (972 Bytes) crushmap Dmitry Smirnov, 05/31/2014 02:52 AM

Related issues 1 (0 open1 closed)

Related to Ceph - Bug #7922: osd: multi-backfill reservation does not release on rejectResolvedDavid Zafman03/31/2014

Actions
Actions

Also available in: Atom PDF