Project

General

Profile

Actions

Bug #38621

closed

py modules fail to cleanly shut down

Added by Alfonso Martínez about 5 years ago. Updated over 4 years ago.

Status:
Won't Fix
Priority:
High
Assignee:
-
Category:
ceph-mgr
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

When running backend api tests locally on master:

src/pybind/mgr/dashboard/run-backend-api-tests.sh

2019-03-07 09:53:12,042.042 INFO:__main__:Running ['./bin/ceph', 'mgr', 'fail', 'y']
2019-03-07 09:53:14,143.143 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:14,157.157 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:14,157.157 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:14,181.181 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:14,182.182 INFO:__main__:Killing PID 4214 for mgr.x
2019-03-07 09:53:14,182.182 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:14,199.199 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:15,200.200 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:15,210.210 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:16,212.212 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:16,240.240 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:17,243.243 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:17,276.276 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:18,280.280 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:18,312.312 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:19,315.315 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:19,350.350 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x
2019-03-07 09:53:20,352.352 INFO:__main__:Running ['ps', 'ww', '-u0']
2019-03-07 09:53:20,385.385 INFO:__main__:Found ps line for daemon: 4214 ? Ssl 0:03 ./bin/./ceph-mgr -i x

[...]

2019-03-07 09:58:25,918.918 INFO:__main__:======================================================================
2019-03-07 09:58:25,919.919 INFO:__main__:ERROR: test_standby (tasks.mgr.test_dashboard.TestDashboard)
2019-03-07 09:58:25,919.919 INFO:__main__:----------------------------------------------------------------------
2019-03-07 09:58:25,920.920 INFO:__main__:Traceback (most recent call last):
2019-03-07 09:58:25,920.920 INFO:__main__: File "/ceph/qa/tasks/mgr/test_dashboard.py", line 18, in setUp
2019-03-07 09:58:25,921.921 INFO:__main__: self._assign_ports("dashboard", "server_port")
2019-03-07 09:58:25,921.921 INFO:__main__: File "/ceph/qa/tasks/mgr/mgr_test_case.py", line 182, in assign_ports
2019-03-07 09:58:25,922.922 INFO:
_main__: cls.mgr_cluster.mgr_stop(mgr_id)
2019-03-07 09:58:25,922.922 INFO:__main__: File "/ceph/qa/tasks/mgr/mgr_test_case.py", line 30, in mgr_stop
2019-03-07 09:58:25,923.923 INFO:__main__: self.mgr_daemons[mgr_id].stop()
2019-03-07 09:58:25,924.924 INFO:__main__: File "../qa/tasks/vstart_runner.py", line 373, in stop
2019-03-07 09:58:25,924.924 INFO:__main__: self.daemon_type, self.daemon_id))
2019-03-07 09:58:25,925.925 INFO:__main__:MaxWhileTries: Timed out waiting for daemon mgr.x

In mgr.x.log there is this error:

0> 2019-03-07 09:52:42.753 7f014909e700 -1 ** Caught signal (Aborted) *
in thread 7f014909e700 thread_name:mgrsb-fin
ceph version Development (no_version) nautilus (rc)
1: (()+0x9cc178) [0x55c47e5dd178]
2: (()+0x13030) [0x7f014eff2030]
3: (gsignal()+0x10f) [0x7f014e6a953f]
4: (abort()+0x127) [0x7f014e693895]
5: (()+0x7012e) [0x7f014f7f412e]
6: (PyGrammar_LabelRepr()+0) [0x7f014f7f418e]
7: (()+0x722ea) [0x7f014f7f62ea]
8: (Py_FinalizeEx()+0x13e) [0x7f014f9bf89e]
9: (PyModuleRegistry::shutdown()+0xa3) [0x55c47e43d263]
10: (()+0x7f3b02) [0x55c47e404b02]
11: (()+0x7f6932) [0x55c47e407932]
12: (boost::function1<void, int>::operator()(int) const+0x6c) [0x55c47e312a7c]
13: (FunctionContext::finish(int)+0x24) [0x55c47e30f936]
14: (Context::complete(int)+0x27) [0x55c47e30f7fd]
15: (Finisher::finisher_thread_entry()+0x38b) [0x7f0150e0a36d]
16: (Finisher::FinisherThread::entry()+0x1c) [0x55c47e3f1a30]
17: (Thread::entry_wrapper()+0x78) [0x7f0150e7659e]
18: (Thread::_entry_func(void*)+0x18) [0x7f0150e7651c]
19: (()+0x858e) [0x7f014efe758e]
20: (clone()+0x43) [0x7f014e76e6a3]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Files

test-standby-api-tests.log (323 KB) test-standby-api-tests.log Alfonso Martínez, 03/07/2019 10:18 AM
test-standby-mgr-x.log (175 KB) test-standby-mgr-x.log Alfonso Martínez, 03/07/2019 10:22 AM
Actions

Also available in: Atom PDF