Project

General

Profile

Actions

Bug #64082

open

When I set debug_mgr 30/30, the memory of ceph-mgr will keep rising

Added by Bo Zhang 4 months ago. Updated 3 months ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
1 - critical
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

This is an inevitable question,I did the following operations.
1.Collects heap information
ceph tell mgr.HOSTNAME heap start_profiler (Not supported by default, I added it myself)
ceph tell mgr.HOSTNAME heap dump
2.Processes heap files
pprof --pdf /home/zhangbo/ceph-mgr /var/log/ceph/mgr.node149.profile.0002.heap >test.heap.pdf
See the attachment for the test.heap.pdf
3.Compare the data from two samplings, with an interval of about 2 hours.
pprof --pdf --base /var/log/ceph/mgr.node149.profile.0004.heap /home/zhangbo/ceph-mgr /var/log/ceph/mgr.node149.profile.0050.heap > test.compare.pdf
See the attachment for the test.compare.pdf

I tested the above on 14.2.8, and 17.2.5 also has this problem,And more serious.
The memory of ceph-mgr continues to increase and there is no downward trend. I noticed that the memory of ceph-mgr has increased to 20G or even higher.If the mgr service has not been restarted for several years, the memory may also gradually increase (uses the default log level 1/5).

The ceph-mon has the same problem, but the log volume of mon is less than that of mgr (debug_mon = debug_mgr =30/30). The phenomenon is that the memory of ceph-mon increases more slowly and takes up less space.

This bug has the potential to OOM the whole system.


Files

test.heap.pdf (15.5 KB) test.heap.pdf Bo Zhang, 01/18/2024 08:01 AM
test.compare.pdf (13.3 KB) test.compare.pdf Bo Zhang, 01/18/2024 08:02 AM
mon.heap.pdf (18.3 KB) mon.heap.pdf Bo Zhang, 01/18/2024 08:13 AM

Related issues 1 (1 open0 closed)

Is duplicate of mgr - Bug #64078: When I set debug_mgr 30/30, the memory of ceph-mgr will keep risingNew

Actions
Actions #1

Updated by Bo Zhang 4 months ago

To view the mon's memory information, please see the mon.heap.pdf in the attachment.

There is a problem with the ceph log module.

Actions #2

Updated by Bo Zhang 4 months ago

ceph 17.2.5 behaves more severely than 14.2.8, and the memory grows several times faster

Actions #3

Updated by Igor Fedotov 4 months ago

  • Is duplicate of Bug #64078: When I set debug_mgr 30/30, the memory of ceph-mgr will keep rising added
Actions #4

Updated by Igor Fedotov 4 months ago

  • Project changed from bluestore to mgr
Actions #5

Updated by Bo Zhang 3 months ago

If I don’t add logs to m_recent, the memory will not increase significantly.Reducing the value of DEFAULT_MAX_RECENT will also be useful.

Actions

Also available in: Atom PDF