Project

General

Profile

Actions

Bug #22056

closed

segv in OSDMap::calc_pg_upmaps from balancer

Added by Sage Weil over 6 years ago. Updated over 5 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
mimic, luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

(gdb) bt
#0  0x00007fb53adfc269 in raise (sig=sig@entry=11) at ../sysdeps/unix/sysv/linux/pt-raise.c:35
#1  0x00005637937d2ece in reraise_fatal (signum=11) at /build/ceph-12.2.1-648-g3f7eadb/src/global/signal_handler.cc:74
#2  handle_fatal_signal (signum=11) at /build/ceph-12.2.1-648-g3f7eadb/src/global/signal_handler.cc:138
#3  <signal handler called>
#4  0x00005637938fff8f in OSDMap::_apply_upmap (this=this@entry=0x7fb52bf36d10, pi=..., raw_pg=..., raw=raw@entry=0x7fb52bf368a0) at /build/ceph-12.2.1-648-g3f7eadb/src/osd/OSDMap.cc:1981
#5  0x0000563793904987 in OSDMap::_pg_to_up_acting_osds (this=0x7fb52bf36d10, pg=..., up=up@entry=0x7fb52bf36bd0, up_primary=up_primary@entry=0x0, acting=acting@entry=0x0, acting_primary=acting_primary@entry=0x0, raw_pg_to_pg=true)
    at /build/ceph-12.2.1-648-g3f7eadb/src/osd/OSDMap.cc:2164
#6  0x0000563793915ed5 in OSDMap::pg_to_up_acting_osds (acting_primary=0x0, acting=0x0, up_primary=0x0, up=0x7fb52bf36bd0, pg=..., this=0x7fb52bf36d10) at /build/ceph-12.2.1-648-g3f7eadb/src/osd/OSDMap.h:1128
#7  OSDMap::calc_pg_upmaps (this=<optimized out>, cct=<optimized out>, max_deviation_ratio=<optimized out>, max=<optimized out>, only_pools_orig=std::set with 0 elements, pending_inc=<optimized out>) at /build/ceph-12.2.1-648-g3f7eadb/src/osd/OSDMap.cc:3859
#8  0x00005637936ba022 in osdmap_calc_pg_upmaps (self=0x7fb52a6fc200, args=<optimized out>) at /build/ceph-12.2.1-648-g3f7eadb/src/mgr/PyOSDMap.cc:135
#9  0x00007fb53bee5751 in PyEval_EvalFrameEx () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#10 0x00007fb53c00e01c in PyEval_EvalCodeEx () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#11 0x00007fb53bee3cfd in PyEval_EvalFrameEx () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#12 0x00007fb53bee3e24 in PyEval_EvalFrameEx () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#13 0x00007fb53bee3e24 in PyEval_EvalFrameEx () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#14 0x00007fb53c00e01c in PyEval_EvalCodeEx () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#15 0x00007fb53bf642e0 in ?? () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#16 0x00007fb53bf371e3 in PyObject_Call () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#17 0x00007fb53bfab31c in ?? () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0
#18 0x00007fb53bf371e3 in PyObject_Call () from /usr/lib/x86_64-linux-gnu/libpython2.7.so.1.0

the log has
    -9> 2017-11-06 15:18:49.455111 7fb52bf39700  4 mgr[balancer] Mode upmap, max misplaced 0.050000
    -8> 2017-11-06 15:18:49.455175 7fb52bf39700 20 mgr[balancer] unknown 0.000000 degraded 0.000000 inactive 0.000000 misplaced 0
    -7> 2017-11-06 15:18:49.455221 7fb52bf39700  4 mgr[balancer] do_upmap
    -6> 2017-11-06 15:18:49.455234 7fb52bf39700  4 mgr get_config get_configkey: mgr/balancer/upmap_max_iterations
    -5> 2017-11-06 15:18:49.455237 7fb52bf39700  4 ceph_config_get upmap_max_iterations not found 
    -4> 2017-11-06 15:18:49.455242 7fb52bf39700  4 mgr get_config get_configkey: mgr/balancer/upmap_max_deviation
    -3> 2017-11-06 15:18:49.455243 7fb52bf39700  4 ceph_config_get upmap_max_deviation not found 
    -2> 2017-11-06 15:18:49.455300 7fb52bf39700  4 mgr[balancer] pools ['unique_pool_0', 'rbd']
    -1> 2017-11-06 15:18:49.455315 7fb52bf39700 10 osdmap_calc_pg_upmaps osdmap 0x56379f69d100 inc 0x56379f8e9000 max_deviation 0.0
1 max_iterations 10
     0> 2017-11-06 15:18:49.457405 7fb52bf39700 -1 *** Caught signal (Segmentation fault) **
 in thread 7fb52bf39700 thread_name:balancer

but unfortunately debug osd was 1/5 on the mgr, so i can't see what calc_pg_upmaps was doing.

/a/sage-2017-11-06_04:25:20-rados-wip-sage2-testing-2017-11-05-0042-distro-basic-smithi/1820262


Related issues 2 (0 open2 closed)

Copied to Ceph - Backport #26933: mimic: segv in OSDMap::calc_pg_upmaps from balancerResolvedNathan CutlerActions
Copied to Ceph - Backport #26934: luminous: segv in OSDMap::calc_pg_upmaps from balancerResolvedVictor DenisovActions
Actions

Also available in: Atom PDF