Project

General

Profile

Bug #41218

mgr/volumes: retry spawning purge threads on failure

Added by Venky Shankar 2 months ago. Updated 16 days ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
Start date:
08/13/2019
Due date:
% Done:

0%

Source:
Tags:
Backport:
nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
mgr/volumes
Labels (FS):
Pull request ID:
Crash signature:

Description

seen here: http://qa-proxy.ceph.com/teuthology/pdonnell-2019-08-07_15:57:31-fs-wip-pdonnell-testing-20190807.132723-distro-basic-smithi/4193689/teuthology.log

there were no logs available for further debugging :(

Patrick saw another instance of this error with no specific reason related to mgr/volumes. There could be a possibility of a memory leak in manager and this might just be a side effect of a bigger issue.

For now, we should probably retry when a purge thread fails to spawn. But this should not be endless and there needs to be an upper cap for this, after which we log the error to cluster log and/or update status in `ceph status`.


Related issues

Related to fs - Bug #41219: mgr/volumes: send purge thread (and other) health warnings to `ceph status` Resolved 08/13/2019
Copied to fs - Backport #41889: nautilus: mgr/volumes: retry spawning purge threads on failure Resolved

History

#1 Updated by Venky Shankar 2 months ago

  • Related to Bug #41219: mgr/volumes: send purge thread (and other) health warnings to `ceph status` added

#2 Updated by Ramana Raja 2 months ago

  • Backport changed from luminous, mimic to nautilus

#3 Updated by Venky Shankar 2 months ago

thanks -- those were cache in my browser :P

#4 Updated by Venky Shankar 2 months ago

  • Pull request ID set to 29735

#5 Updated by Venky Shankar 2 months ago

  • Status changed from New to Need Review

#6 Updated by Patrick Donnelly about 1 month ago

  • Status changed from Need Review to Pending Backport

#7 Updated by Nathan Cutler about 1 month ago

  • Copied to Backport #41889: nautilus: mgr/volumes: retry spawning purge threads on failure added

#8 Updated by Nathan Cutler 16 days ago

  • Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".

Also available in: Atom PDF