Project

General

Profile

Actions

Bug #41618

closed

14.2.1->14.2.2 ceph-mgr hard segfault. devicehealth?

Added by Harry Coin over 4 years ago. Updated over 4 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
ceph-mgr
Target version:
% Done:

0%

Source:
Tags:
ceph-mgr segfault
Backport:
Regression:
Yes
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hard segfault on ceph-mgr at load.
"Thread 33 "devicehealth" received signal SIGSEGV, Segmentation fault."
Suspect possibly related to either:
one pg_num change from 4 to 16 on a metadata volume.
or
turning on disk health checking.

either way, in a kvm VM running linux 5.2.0-15-generic #16-Ubuntu SMP Fri Aug 23 20:16:23 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux, vm set to conroe/core2 dual processor, latest ubuntu/eoan, ceph 14.2.2
See attached gdb log, with debug symbols and backtraces.

100% reproducible. Kills ceph-mgr.


Files

ceph-mgr-crashlog.txt (114 KB) ceph-mgr-crashlog.txt Harry Coin, 09/03/2019 03:58 PM
ceph-mgr-crashlog.txt (237 KB) ceph-mgr-crashlog.txt Harry Coin, 09/03/2019 05:36 PM
log.txt (67 KB) log.txt Harry Coin, 09/04/2019 12:20 AM

Related issues 1 (0 open1 closed)

Is duplicate of RADOS - Bug #42082: pybind/rados: set_omap() crash on py3ResolvedSage Weil09/27/2019

Actions
Actions

Also available in: Atom PDF