Bug #13422
closedpid file that was deleted after ceph restart leads to osd process lose control
0%
Description
when restarting ceph, the same osd will start twice. the first time is regular start;the other is activated by ceph-disk activated all. if the second startup starts fast enough before the pid file created. Unluckly, only one process can get fsid_lock and live through. then the lose process will delete the pid file when exiting. As a result, this osd may stop unregularlly when using 'service ceph stop osd.{id}'.
resovled method:
locking pid file using file lock
Updated by huang jun over 8 years ago
we also met this many times, even if https://github.com/ceph/ceph/commit/487c22a8a4b3ba099f9c19125c720e99e7c8d0db was patched
Updated by shun song over 8 years ago
how about not to remove pid-file,when process exits, becuase the pid-file is important for juding whether osd/mon is running. at the same time, to avoid compelling to write the pid-file when multiple times start up one osd or monitor, lock is recommanded.
Updated by Nathan Cutler over 8 years ago
master PR: https://github.com/ceph/ceph/pull/6763
Updated by Nathan Cutler over 8 years ago
- Status changed from New to Fix Under Review
Updated by Kefu Chai over 8 years ago
- Has duplicate Bug #13238: duplicate start of ceph-osd daemon added
Updated by Shinobu Kinjo over 8 years ago
Will patch be (able to be) backported to 0.80.8?
Updated by Kefu Chai about 8 years ago
Will patch be (able to be) backported to 0.80.8?
@Shinobu 0.80.8 is a firefly release, and since firefly is retired, see http://docs.ceph.com/docs/master/releases/ , we are not going to backport to this release.
and yes, it is able to be backported to 0.80.8, but this will not happen in the upstream.
Updated by Kefu Chai about 8 years ago
- Status changed from Fix Under Review to Pending Backport
Updated by Loïc Dachary about 8 years ago
- Copied to Backport #14582: infernalis: pid file that was deleted after ceph restart leads to osd process lose control added
Updated by Loïc Dachary about 8 years ago
- Copied to Backport #14583: hammer: pid file that was deleted after ceph restart leads to osd process lose control added
Updated by Loïc Dachary about 8 years ago
- Status changed from Pending Backport to In Progress
- Assignee changed from Kefu Chai to Loïc Dachary
Updated by Loïc Dachary about 8 years ago
- Has duplicate Bug #14575: daemons leave pid files behind in /home/ubuntu/cephtest added
Updated by Loïc Dachary about 8 years ago
Updated by Kefu Chai about 8 years ago
- Status changed from In Progress to Fix Under Review
Updated by Loïc Dachary about 8 years ago
- Status changed from Resolved to Pending Backport
Updated by Loïc Dachary about 8 years ago
Updated by Loïc Dachary over 7 years ago
- Status changed from Pending Backport to Resolved