Project

General

Profile

Actions

Bug #21144

closed

daemon-helper: command crashed with signal 1

Added by Sage Weil over 6 years ago. Updated over 6 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2017-08-26T06:25:01.864 INFO:tasks.ceph.osd.13:Started
2017-08-26T06:25:02.207 INFO:tasks.ceph.osd.3:Restarting daemon
2017-08-26T06:25:02.295 INFO:teuthology.orchestra.run.smithi187:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 3'
2017-08-26T06:25:02.411 INFO:tasks.ceph.osd.1.smithi203.stderr:2017-08-26 06:25:01.398401 7fedbc107e00 -1 osd.1 0 mon_cmd_maybe_osd_create fail: 'you must complete the upgrade and 'ceph osd require-osd-release luminous' before using crush device classes': (1) Operation not permitted
2017-08-26T06:25:02.454 INFO:tasks.ceph.osd.1.smithi203.stderr:daemon-helper: command crashed with signal 1
2017-08-26T06:25:02.464 INFO:tasks.ceph.mon.b.smithi203.stderr:2017-08-26 06:25:01.608299 7f791656b700 -1 received  signal: Hangup from  PID: 20113 task name: killall -q -1 ceph-mon ceph-mgr ceph-mds ceph-osd ceph-fuse radosgw  UID: 0
2017-08-26T06:25:02.536 INFO:tasks.ceph.osd.5.smithi203.stderr:daemon-helper: command crashed with signal 1
2017-08-26T06:25:02.542 INFO:tasks.ceph.osd.9.smithi203.stderr:daemon-helper: command crashed with signal 1
2017-08-26T06:25:02.631 INFO:tasks.ceph.mon.c.smithi084.stderr:2017-08-26 06:25:01.853378 7f93c97be700 -1 received  signal: Hangup from  PID: 14662 task name: killall -q -1 ceph-mon ceph-mgr ceph-mds ceph-osd ceph-fuse radosgw  UID: 0
2017-08-26T06:25:02.647 INFO:tasks.ceph.osd.2.smithi084.stderr:2017-08-26 06:25:01.853453 7fe4e70d8700 -1 received  signal: Hangup from  PID: 14662 task name: killall -q -1 ceph-mon ceph-mgr ceph-mds ceph-osd ceph-fuse radosgw  UID: 0
2017-08-26T06:25:02.652 INFO:tasks.ceph.osd.6.smithi084.stderr:2017-08-26 06:25:01.853563 7f162a5d0700 -1 received  signal: Hangup from  PID: 14662 task name: killall -q -1 ceph-mon ceph-mgr ceph-mds ceph-osd ceph-fuse radosgw  UID: 0
2017-08-26T06:25:02.714 INFO:tasks.ceph.osd.10.smithi084.stderr:2017-08-26 06:25:01.853570 7efddf5ca700 -1 received  signal: Hangup from  PID: 14662 task name: killall -q -1 ceph-mon ceph-mgr ceph-mds ceph-osd ceph-fuse radosgw  UID: 0
2017-08-26T06:25:02.758 INFO:tasks.ceph.osd.14.smithi084.stderr:2017-08-26 06:25:01.853616 7f9376087700 -1 received  signal: Hangup from  PID: 14662 task name:  UID: 0
2017-08-26T06:25:02.804 INFO:tasks.ceph.osd.13.smithi203.stderr:daemon-helper: command crashed with signal 1
2017-08-26T06:25:02.820 INFO:tasks.ceph.osd.13.smithi203.stdout:starting osd.13 at - osd_data /var/lib/ceph/osd/ceph-13 /var/lib/ceph/osd/ceph-13/journal

/a/sage-2017-08-25_18:16:29-rados-wip-sage-testing-luminous-20170825a-distro-basic-smithi/1563147
Actions #1

Updated by Hey Pas over 6 years ago

Hello,

I managed to bump into this during a Jewel -> Luminous upgrade using the docker ceph/daemon container image. (e0219efd1c22 5 months old to 220e2a0b5985 2 days old images)

So it required a full cluster stop (stopping all old OSDs), and then the MONs finally realized that it's okay to flip the upgrade bit.

All in all, it could have been a much worse experience, but it was spooky/scary as hell anyhow.

Actions #2

Updated by Sage Weil over 6 years ago

  • Status changed from 12 to Resolved

I missed before that this was an upgrade test; it's working now.

Actions

Also available in: Atom PDF