Project

General

Profile

Actions

Bug #8342

closed

init script may not start all OSDs

Added by Dmitry Smirnov almost 10 years ago. Updated almost 10 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
OSD
Target version:
-
% Done:

100%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

On server with multiple OSDs init script aborts and do not attempt to start other OSDs after failure to start one of them.

This may happen when HDD died or removed. For example
on boot of server with OSD[1,2,3,4] if OSD.1 is not mounted to /var/lib/ceph/osd/ceph-1 no OSD will be started at all.

To fix the issue I created pull request https://github.com/ceph/ceph/pull/1730
however Sage commented that it may not be the best way to fix the problem.

Actions #1

Updated by Sage Weil almost 10 years ago

  • Assignee set to Sage Weil
  • Priority changed from Normal to Urgent
Actions #2

Updated by Sage Weil almost 10 years ago

  • Status changed from New to Fix Under Review

https://github.com/ceph/ceph/pull/1846

Can you take a look at this patch? I think I understand now what you were seeing, and I think this addresses it.

The important bit is to make sure that the init script eventually still returns an error, even though some of the other OSDs were started.

Actions #3

Updated by Dmitry Smirnov almost 10 years ago

Sage, your fix works perfectly. Thank you.

Actions #4

Updated by Loïc Dachary almost 10 years ago

  • Status changed from Fix Under Review to Resolved
  • % Done changed from 0 to 100
Actions

Also available in: Atom PDF