Project

General

Profile

Actions

Bug #17944

closed

misc.py: stop_daemons_of_type did not stop them all (sometimes osds, sometimes mons)

Added by Sage Weil over 7 years ago. Updated over 2 years ago.

Status:
Can't reproduce
Priority:
Urgent
Assignee:
-
Category:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

2016-11-17T18:25:07.367 INFO:teuthology.misc:Shutting down osd daemons...
2016-11-17T18:25:07.367 DEBUG:tasks.ceph.osd.1:waiting for process to exit
2016-11-17T18:25:07.368 INFO:teuthology.orchestra.run:waiting for 300
2016-11-17T18:25:07.405 INFO:tasks.ceph.osd.1.smithi002.stderr:2016-11-17 18:25:07.374718 304f7700 -1 received  signal: Terminated from  PID: 23671 task name: /usr/bin/python /bin/daemon-helper term valgrind --trace-children=no --child-silent-after-fork=yes --num-callers=50 --suppressions=/home/ubuntu/cephtest/valgrind.supp --xml=yes --xml-file=/var/log/ceph/valgrind/osd.1.log --time-stamp=yes --tool=memcheck ceph-osd -f --cluster ceph -i 1  UID: 0
2016-11-17T18:25:07.405 INFO:tasks.ceph.osd.1.smithi002.stderr:2016-11-17 18:25:07.379027 304f7700 -1 osd.1 15 *** Got signal Terminated ***
2016-11-17T18:25:08.203 INFO:tasks.ceph.osd.1.smithi002.stderr:2016-11-17 18:25:08.190705 304f7700 -1 osd.1 15 shutdown
2016-11-17T18:25:13.368 INFO:tasks.ceph.osd.1:Stopped
2016-11-17T18:25:13.369 DEBUG:tasks.ceph.osd.0:waiting for process to exit
2016-11-17T18:25:13.369 INFO:teuthology.orchestra.run:waiting for 300
2016-11-17T18:25:13.409 INFO:tasks.ceph.osd.0.smithi002.stderr:2016-11-17 18:25:13.374977 30cf8700 -1 received  signal: Terminated from  PID: 23660 task name: /usr/bin/python /bin/daemon-helper term valgrind --trace-children=no --child-silent-after-fork=yes --num-callers=50 --suppressions=/home/ubuntu/cephtest/valgrind.supp --xml=yes --xml-file=/var/log/ceph/valgrind/osd.0.log --time-stamp=yes --tool=memcheck ceph-osd -f --cluster ceph -i 0  UID: 0
2016-11-17T18:25:13.409 INFO:tasks.ceph.osd.0.smithi002.stderr:2016-11-17 18:25:13.380205 30cf8700 -1 osd.0 18 *** Got signal Terminated ***
2016-11-17T18:25:14.206 INFO:tasks.ceph.osd.0.smithi002.stderr:2016-11-17 18:25:14.057377 30cf8700 -1 osd.0 18 shutdown
2016-11-17T18:25:19.369 INFO:tasks.ceph.osd.0:Stopped
2016-11-17T18:25:19.370 DEBUG:tasks.ceph.osd.3:waiting for process to exit
2016-11-17T18:25:19.370 INFO:teuthology.orchestra.run:waiting for 300
2016-11-17T18:25:19.398 INFO:tasks.ceph.osd.3.smithi031.stderr:2016-11-17 18:25:19.378217 3100a700 -1 received  signal: Terminated from  PID: 14913 task name: /usr/bin/python /usr/bin/daemon-helper term valgrind --trace-children=no --child-silent-after-fork=yes --num-callers=50 --suppressions=/home/ubuntu/cephtest/valgrind.supp --xml=yes --xml-file=/var/log/ceph/valgrind/osd.3.log --time-stamp=yes --tool=memcheck ceph-osd -f --cluster ceph -i 3  UID: 0
2016-11-17T18:25:19.398 INFO:tasks.ceph.osd.3.smithi031.stderr:2016-11-17 18:25:19.383355 3100a700 -1 osd.3 21 *** Got signal Terminated ***
2016-11-17T18:25:20.631 INFO:tasks.ceph.osd.3.smithi031.stderr:2016-11-17 18:25:20.581761 3100a700 -1 osd.3 21 shutdown
2016-11-17T18:30:13.404 INFO:teuthology.misc:Shutting down mgr daemons...

on a job with osd's 0 through 5.

The subsequent unmount then fails because a daemon is still running.

/a/sage-2016-11-17_17:11:43-rados-wip-sage-testing---basic-smithi/556861

not sure if something is corrupting the ctx.daemons list? or iter_daemons_of_type is buggy?


Related issues 1 (0 open1 closed)

Has duplicate teuthology - Bug #43292: teuthology does not stop all daemonsCan't reproduce

Actions
Actions

Also available in: Atom PDF