Project

General

Profile

Bug #45266

follower monitors can grow beyond memory target

Added by Josh Durgin over 3 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
High
Category:
Performance/Resource Usage
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
octopus, nautilus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Monitor
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

The leader monitor periordically tells tcmalloc to release memory back to the OS, but follower monitors do not. This can result in follower monitors using more memory than their memory target, and potentially getting oom killed.

A workaround is to reset the mon_memory_target config option, which will cause all monitors to ask tcmalloc to release its free memory.

Alternately, mon_memory_autotune can be disabled.


Related issues

Copied to RADOS - Backport #45391: nautilus: follower monitors can grow beyond memory target Resolved
Copied to RADOS - Backport #45392: octopus: follower monitors can grow beyond memory target Resolved

History

#1 Updated by Sridhar Seshasayee over 3 years ago

  • Assignee set to Sridhar Seshasayee

Taking ownership of this.

-Sridhar

#2 Updated by Josh Durgin over 3 years ago

  • Pull request ID set to 34837

#3 Updated by Neha Ojha over 3 years ago

  • Status changed from New to Fix Under Review

#4 Updated by Kefu Chai over 3 years ago

  • Status changed from Fix Under Review to Pending Backport

#5 Updated by Sridhar Seshasayee over 3 years ago

  • Copied to Backport #45391: nautilus: follower monitors can grow beyond memory target added

#6 Updated by Sridhar Seshasayee over 3 years ago

  • Copied to Backport #45392: octopus: follower monitors can grow beyond memory target added

#7 Updated by Nathan Cutler over 3 years ago

  • Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".

Also available in: Atom PDF