Actions
Bug #11798
closedupstart: configuration is too generous on restarts
% Done:
0%
Source:
Development
Tags:
Backport:
hammer, firefly
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
See https://bugzilla.redhat.com/show_bug.cgi?id=1210871 for the investigation that prompted this.
Our current upstart scripts are probably too generous about restarting processes. At the moment each daemon is configured to restart as long as it doesn't exceed 5 crashes in 30 seconds. The restart process on some of them can exceed 6 seconds (at least some of the time), and any of our daemons which are crashing that frequently are probably stuck on a disk state issue.
We need to run some tests to figure out more reasonable values and change them.
Actions