Bug #41171
closed
mimic: ceph-mgr 13.2.6 crashing on ubuntu 18.04 lts: ActivePyModule.cc: 54: FAILED assert(pClassInstance != nullptr)
Added by Jared Baker almost 5 years ago.
Updated almost 5 years ago.
Description
Recently upgraded from 13.2.5 to 13.2.6 and began to experience instability with ceph-mgr.
Our setup:
3 controllers (mon's) running ceph-mon, radosgw, ceph-mgr
39 osd nodes with 36 osd's each (1404 total)
ceph-mgr configured to run on each controller in tcp/6802 and is load balanced with haproxy
During troubleshooting I have pared the system down to just one ceph-mgr running and still it is crashing occasionally.
I have attached several files of output that should be useful
Files
- Subject changed from ceph-mgr 13.2.6 crashing on ubuntu 18.04 lts to mimic: ceph-mgr 13.2.6 crashing on ubuntu 18.04 lts: ActivePyModule.cc: 54: FAILED assert(pClassInstance != nullptr)
- Target version deleted (
v13.2.6)
- Affected Versions v13.2.6 added
from the logs:
2019-08-08 10:51:49.389 7fb03e113700 -1 received signal: Terminated from /sbin/init (PID: 1) UID: 0
2019-08-08 10:51:50.433 7fb03e113700 -1 mgr handle_signal *** Got signal Terminated ***
2019-08-08 10:51:52.297 7fb026169700 -1 /build/ceph-13.2.6/src/mgr/ActivePyModule.cc: In function 'void ActivePyModule::notify(const string&, const string&)' thread 7fb026169700 time 2019-08-08 10:51:52.302522
/build/ceph-13.2.6/src/mgr/ActivePyModule.cc: 54: FAILED assert(pClassInstance != nullptr)
ceph version 13.2.6 (7b695f835b03642f85998b2ae7b6dd093d9fbce4) mimic (stable)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x14e) [0x7fb0484e5b5e]
2: (()+0x2c4cb7) [0x7fb0484e5cb7]
3: (ActivePyModule::notify(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x234) [0x561af448b3b4]
4: (FunctionContext::finish(int)+0x2c) [0x561af4447d5c]
5: (Context::complete(int)+0x9) [0x561af44439a9]
6: (Finisher::finisher_thread_entry()+0x135) [0x7fb0484e40a5]
7: (()+0x76db) [0x7fb04781c6db]
8: (clone()+0x3f) [0x7fb046a0288f]
Looks like the mgr is not able to properly shut down.
- Is duplicate of Bug #38573: mgr/ActivePyModule.cc: 54: FAILED ceph_assert(pClassInstance != nullptr) added
- Status changed from New to Duplicate
Also available in: Atom
PDF