Bug #22543: OSDs can not start after shutdown, killed by OOM killer during PGs load - bluestore - Ceph

Actions

Copy link

Bug #22543

closed

OSDs can not start after shutdown, killed by OOM killer during PGs load

Added by Volodymyr Blokhin over 6 years ago. Updated about 6 years ago.

Status:

Can't reproduce

Priority:

High

Assignee:

Target version:

Ceph - v12.2.2

% Done:

Source:

Community (user)

Tags:

Backport:

Regression:

Severity:

1 - critical

Reviewed:

Affected Versions:

Ceph - v12.2.2

ceph-qa-suite:

ceph-disk

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

Hello,

After shutdown all OSDs can not start. During load_pgs stage ceph-osd process consumes all available virtual memory (RAM+swap) so OOM has to kill it.

@root@osd001:~# dpkg -l | grep -i ceph
ii ceph-base 12.2.2-1xenial amd64 common ceph daemon libraries and management tools
ii ceph-common 12.2.2-1xenial amd64 common utilities to mount and interact with a ceph storage cluster
ii ceph-fuse 12.2.2-1xenial amd64 FUSE-based client for the Ceph distributed file system
ii ceph-mds 12.2.2-1xenial amd64 metadata server for the ceph distributed file system
ii ceph-osd 12.2.2-1xenial amd64 OSD server for the ceph storage system
ii libcephfs2 12.2.2-1xenial amd64 Ceph distributed file system client library
ii python-cephfs 12.2.2-1xenial amd64 Python 2 libraries for the Ceph libcephfs library
ii python-rados 12.2.2-1xenial amd64 Python 2 libraries for the Ceph librados library
ii python-rbd 12.2.2-1xenial amd64 Python 2 libraries for the Ceph librbd library
ii python-rgw 12.2.2-1xenial amd64 Python 2 libraries for the Ceph librgw library
root@osd001:~# apt-cache policy ceph-osd
ceph-osd:
Installed: 12.2.2-1xenial
Candidate: 12.2.2-1xenial
Version table: *** 12.2.2-1xenial 1100
1100 https://download.ceph.com/debian-luminous xenial/main amd64 Packages
100 /var/lib/dpkg/status@

Files

Download all files

OSDs_lsblk.txt (1.63 KB) OSDs_lsblk.txt	osd hdds list and bluestore partition size	Volodymyr Blokhin, 12/26/2017 05:56 PM
osd2_perf_dump.txt (21.5 KB) osd2_perf_dump.txt	ceph daemon osd.2 perf dump	Volodymyr Blokhin, 12/26/2017 05:56 PM
osd2_ceph_conf.txt (399 Bytes) osd2_ceph_conf.txt	cat /etc/ceph/ceph.conf on osd node	Volodymyr Blokhin, 12/26/2017 05:56 PM
osd2_config_show.txt (55.1 KB) osd2_config_show.txt	ceph daemon osd.2 config show	Volodymyr Blokhin, 12/26/2017 05:56 PM
cmn01_ceph_conf.txt (598 Bytes) cmn01_ceph_conf.txt	cat /etc/ceph/ceph.conf on monitor node	Volodymyr Blokhin, 12/26/2017 05:56 PM
OSD_RAM_usage.png (44.3 KB) OSD_RAM_usage.png	grafana mem usage monitoring from one of osd nodes	Volodymyr Blokhin, 12/26/2017 05:56 PM
ceph_status.txt (645 Bytes) ceph_status.txt	ceph status output	Volodymyr Blokhin, 12/26/2017 05:56 PM
ceph_osd2_dump_mempools.txt (1.6 KB) ceph_osd2_dump_mempools.txt	ceph daemon osd.2 dump_mempools	Volodymyr Blokhin, 12/26/2017 05:56 PM
ceph_osd_tree.txt (499 Bytes) ceph_osd_tree.txt	ceph osd tree	Volodymyr Blokhin, 12/26/2017 05:56 PM
ceph_osd_dump.txt (2.62 KB) ceph_osd_dump.txt	ceph osd dump	Volodymyr Blokhin, 12/26/2017 05:56 PM
up_and_fail_cycle_osd2_log.txt (305 KB) up_and_fail_cycle_osd2_log.txt	/var/log/ceph/ceph-osd.N.log	Volodymyr Blokhin, 12/26/2017 06:02 PM

Actions

Copy link

Updated by Volodymyr Blokhin over 6 years ago

File up_and_fail_cycle_osd2_log.txt up_and_fail_cycle_osd2_log.txt added

Actions

Copy link

Updated by Sage Weil over 6 years ago

Status changed from New to Need More Info
Priority changed from Normal to High

The mempool dump shows 58GB (!) of pg logs. Can you restart the osd with 'debug bluestore = 20' so we can see if it is reading real, valid log entries?

Thanks!
sage

Actions

Copy link

Updated by Volodymyr Blokhin over 6 years ago

Sage,

Unfortunately we could not wait so long and re-deployed Ceph cluster on 12/30/2017.
We have managed to start ceph-osd (PG load finished) adding 100Gb to swap on each OSD node.
But we never got PGs online (waited 36 hours) and had to re-deploy the cluster.

Sage Weil wrote:

The mempool dump shows 58GB (!) of pg logs. Can you restart the osd with 'debug bluestore = 20' so we can see if it is reading real, valid log entries?

Thanks!
sage

Actions

Copy link