Project

General

Profile

Actions

Bug #48994

closed

'...kill ceph-mds -f --cluster ceph -i a' failure in upgrade:octopus-x:parallel-no-cephadm-pacific

Added by Yuri Weinstein over 3 years ago. Updated about 3 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

This is on https://github.com/ceph/ceph/pull/39066/
Run: https://pulpito.ceph.com/teuthology-2021-01-25_20:01:21-upgrade:octopus-x:parallel-no-cephadm-pacific-distro-basic-gibba/
Jobs: all
Logs: http://qa-proxy.ceph.com/teuthology/teuthology-2021-01-25_20:01:21-upgrade:octopus-x:parallel-no-cephadm-pacific-distro-basic-gibba/5828460/teuthology.log

failure_reason: 'Command failed on smithi201 with status 1: ''sudo adjust-ulimits
  ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-mds
  -f --cluster ceph -i a'''
 ceph version 15.2.8-228-gc676cbb9 (c676cbb9be59cfb21bd2ba9250035305ff2c9719) octopus (stable)
 1: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0xe1) [0x7f508fc778be]
 2: (DecodeConfigPayloadVisitor const::result_type boost::variant<OSDConfigPayload, UnknownConfigPayload>::apply_visitor<DecodeConfigPayloadVisitor const>(DecodeConfigPayloadVisitor const&) &+0xee) [0x7f508feee6fe]
 3: (MMgrConfigure::decode_payload()+0x1521) [0x7f508fef3d31]
 4: (decode_message(ceph::common::CephContext*, int, ceph_msg_header&, ceph_msg_footer&, ceph::buffer::v15_2_0::list&, ceph::buffer::v15_2_0::list&, ceph::buffer::v15_2_0::list&, boost::intrusive_ptr<Connection>)+0x15cd) [0x7f508fe71f4d]
 5: (ProtocolV2::handle_message()+0x406) [0x7f508ff38576]
 6: (ProtocolV2::handle_read_frame_dispatch()+0x160) [0x7f508ff4aaa0]
 7: (ProtocolV2::_handle_read_frame_epilogue_main()+0x69) [0x7f508ff4abf9]
 8: (ProtocolV2::handle_read_frame_epilogue_main(std::unique_ptr<ceph::buffer::v15_2_0::ptr_node, ceph::buffer::v15_2_0::ptr_node::disposer>&&, int)+0x61) [0x7f508ff4c471]
 9: (ProtocolV2::run_continuation(Ct<ProtocolV2>&)+0x34) [0x7f508ff347b4]
 10: (AsyncConnection::process()+0x5fc) [0x7f508feff41c]
 11: (EventCenter::process_events(unsigned int, std::chrono::duration<unsigned long, std::ratio<1l, 1000000000l> >*)+0x7dd) [0x7f508ff555ed]
 12: (()+0x553288) [0x7f508ff5d288]
 13: (()+0xbd6df) [0x7f508f0ab6df]
 14: (()+0x76db) [0x7f508f5826db]
 15: (clone()+0x3f) [0x7f508e76871f]

Per Josh "the mds is crashing due to an incompatibility with a message from the mgr" might an issue "with octopus mds + pacific mgr"


Related issues 1 (0 open1 closed)

Is duplicate of RADOS - Bug #49069: mds crashes on v15.2.8 -> master upgrade decoding MMgrConfigureResolved

Actions
Actions #1

Updated by Josh Durgin over 3 years ago

  • Description updated (diff)
Actions #2

Updated by Yuri Weinstein over 3 years ago

  • Project changed from Orchestrator to Ceph
Actions #3

Updated by Josh Durgin about 3 years ago

  • Is duplicate of Bug #49069: mds crashes on v15.2.8 -> master upgrade decoding MMgrConfigure added
Actions #4

Updated by Josh Durgin about 3 years ago

  • Status changed from New to Duplicate
Actions

Also available in: Atom PDF