Bug #8342
closedinit script may not start all OSDs
100%
Description
On server with multiple OSDs init script aborts and do not attempt to start other OSDs after failure to start one of them.
This may happen when HDD died or removed. For example
on boot of server with OSD[1,2,3,4] if OSD.1 is not mounted to /var/lib/ceph/osd/ceph-1
no OSD will be started at all.
To fix the issue I created pull request https://github.com/ceph/ceph/pull/1730
however Sage commented that it may not be the best way to fix the problem.
Updated by Sage Weil almost 10 years ago
- Assignee set to Sage Weil
- Priority changed from Normal to Urgent
Updated by Sage Weil almost 10 years ago
- Status changed from New to Fix Under Review
https://github.com/ceph/ceph/pull/1846
Can you take a look at this patch? I think I understand now what you were seeing, and I think this addresses it.
The important bit is to make sure that the init script eventually still returns an error, even though some of the other OSDs were started.
Updated by Dmitry Smirnov almost 10 years ago
Sage, your fix works perfectly. Thank you.
Updated by Loïc Dachary almost 10 years ago
- Status changed from Fix Under Review to Resolved
- % Done changed from 0 to 100