Project

General

Profile

Bug #41218

mgr/volumes: retry spawning purge threads on failure

Added by Venky Shankar about 1 year ago. Updated 12 months ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
% Done:

0%

Source:
Tags:
Backport:
nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
mgr/volumes
Labels (FS):
Pull request ID:
Crash signature:

Description

seen here: http://qa-proxy.ceph.com/teuthology/pdonnell-2019-08-07_15:57:31-fs-wip-pdonnell-testing-20190807.132723-distro-basic-smithi/4193689/teuthology.log

there were no logs available for further debugging :(

Patrick saw another instance of this error with no specific reason related to mgr/volumes. There could be a possibility of a memory leak in manager and this might just be a side effect of a bigger issue.

For now, we should probably retry when a purge thread fails to spawn. But this should not be endless and there needs to be an upper cap for this, after which we log the error to cluster log and/or update status in `ceph status`.


Related issues

Related to fs - Bug #41219: mgr/volumes: send purge thread (and other) health warnings to `ceph status` Resolved 08/13/2019
Copied to fs - Backport #41889: nautilus: mgr/volumes: retry spawning purge threads on failure Resolved

History

#1 Updated by Venky Shankar about 1 year ago

  • Related to Bug #41219: mgr/volumes: send purge thread (and other) health warnings to `ceph status` added

#2 Updated by Ramana Raja about 1 year ago

  • Backport changed from luminous, mimic to nautilus

#3 Updated by Venky Shankar about 1 year ago

thanks -- those were cache in my browser :P

#4 Updated by Venky Shankar about 1 year ago

  • Pull request ID set to 29735

#5 Updated by Venky Shankar about 1 year ago

  • Status changed from New to Fix Under Review

#6 Updated by Patrick Donnelly about 1 year ago

  • Status changed from Fix Under Review to Pending Backport

#7 Updated by Nathan Cutler about 1 year ago

  • Copied to Backport #41889: nautilus: mgr/volumes: retry spawning purge threads on failure added

#8 Updated by Nathan Cutler 12 months ago

  • Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".

Also available in: Atom PDF