Bug #11398
closed
weird, objectstoretool()
uses kill_daemons()
to get the exclusive access to the underlying FileStore
. and kill_daemons()
waited until the process disappeared. the fsid flock is relinquished when ceph-osd returns.
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -TERM 12506
kill_daemons: 179: send_signal=0
kill_daemons: 180: sleep 0
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -0 12506
kill_daemons: 179: send_signal=0
kill_daemons: 180: sleep 1
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -0 12506
kill_daemons: 179: send_signal=0
kill_daemons: 180: sleep 1
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -0 12506
kill_daemons: 179: send_signal=0
kill_daemons: 180: sleep 1
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -0 12506
kill_daemons: 178: break
- Status changed from 12 to Can't reproduce
Did not occur in the past few weeks.
- Status changed from Can't reproduce to 12
- Regression set to No
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -0 28608
kill_daemons: 179: send_signal=0
kill_daemons: 180: sleep 2
kill_daemons: 177: for try in 0 1 1 1 2 3
kill_daemons: 178: kill -0 28608
kill_daemons: 179: send_signal=0
kill_daemons: 180: sleep 3
objectstore_tool: 682: ceph-objectstore-tool --data-path testdir/osd-scrub-repair/3 --journal-path testdir/osd-scrub-repair/3/journal SOMETHING list-attrs
OSD has the store locked
the daemon is not actually killed, the kill_daemons function gave up after trying for 3+2+1+1+1 seconds which can happen sometime when the machine is extra slow. It's usually not a big deal and the kill_daemons is actually documented to have that behavior. But when we are to run ceph-objectstore-tool there
must not be any remaining daemons. A variant of kill_daemons must be implemented that guarantees it will either kill all daemons or fail trying.
- Status changed from 12 to Fix Under Review
- Backport set to hammer
- Status changed from Fix Under Review to Pending Backport
- Status changed from Pending Backport to Resolved
Also available in: Atom
PDF