Project

General

Profile

Actions

Bug #19939

closed

OSD crash in MOSDRepOpReply::decode_payload

Added by John Spray almost 7 years ago. Updated almost 7 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
OSD
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Seen on kcephfs suite, running against test branch based on Monday's master.

2017-05-16T10:15:48.415 INFO:tasks.ceph.osd.1.smithi158.stderr:/build/ceph-12.0.2-1210-gb130515/src/osd/osd_types.h: In function 'static void osd_reqid_t::_denc_finish(ceph::buffer::ptr::iterator&, __u8*, __u8*, char**, uint32_t*)' thread 7fa92256f700 time 2017-05-16 10:15:48.416215
2017-05-16T10:15:48.416 INFO:tasks.ceph.osd.1.smithi158.stderr:/build/ceph-12.0.2-1210-gb130515/src/osd/osd_types.h: 117: FAILED assert(pos <= end)
2017-05-16T10:15:48.416 INFO:tasks.ceph.osd.0.smithi158.stderr:/build/ceph-12.0.2-1210-gb130515/src/osd/osd_types.h: In function 'static void osd_reqid_t::_denc_finish(ceph::buffer::ptr::iterator&, __u8*, __u8*, char**, uint32_t*)' thread 7f36a8f2f700 time 2017-05-16 10:15:48.416393
2017-05-16T10:15:48.416 INFO:tasks.ceph.osd.0.smithi158.stderr:/build/ceph-12.0.2-1210-gb130515/src/osd/osd_types.h: 117: FAILED assert(pos <= end)
2017-05-16T10:15:48.417 INFO:tasks.ceph.osd.1.smithi158.stderr: ceph version 12.0.2-1210-gb130515 (b13051530ef8dca97f2a6fc0558a898dd02eb8e1)
2017-05-16T10:15:48.417 INFO:tasks.ceph.osd.1.smithi158.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x10e) [0x55c29afc534e]
2017-05-16T10:15:48.418 INFO:tasks.ceph.osd.1.smithi158.stderr: 2: (std::enable_if<denc_traits<osd_reqid_t, void>::supported&&(!denc_traits<osd_reqid_t, void>::featured), void>::type decode<osd_reqid_t, denc_traits<osd_reqid_t, void> >(osd_reqid_t&, ceph::buffer::list::iterator&)+0x198) [0x55c29ab7f0b8]
2017-05-16T10:15:48.418 INFO:tasks.ceph.osd.1.smithi158.stderr: 3: (MOSDRepOpReply::decode_payload()+0x8e) [0x55c29ad3e9ee]
2017-05-16T10:15:48.418 INFO:tasks.ceph.osd.1.smithi158.stderr: 4: (decode_message(CephContext*, int, ceph_msg_header&, ceph_msg_footer&, ceph::buffer::list&, ceph::buffer::list&, ceph::buffer::list&, Connection*)+0xe37) [0x55c29aff5a77]
2017-05-16T10:15:48.418 INFO:tasks.ceph.osd.1.smithi158.stderr: 5: (AsyncConnection::process()+0x1a64) [0x55c29b1c5ac4]
2017-05-16T10:15:48.418 INFO:tasks.ceph.osd.1.smithi158.stderr: 6: (EventCenter::process_events(int)+0x301) [0x55c29b04e811]
2017-05-16T10:15:48.418 INFO:tasks.ceph.osd.1.smithi158.stderr: 7: (()+0xa5129a) [0x55c29b05129a]
2017-05-16T10:15:48.419 INFO:tasks.ceph.osd.1.smithi158.stderr: 8: (()+0xb1a60) [0x7fa9251a3a60]
2017-05-16T10:15:48.419 INFO:tasks.ceph.osd.1.smithi158.stderr: 9: (()+0x8184) [0x7fa92581b184]
2017-05-16T10:15:48.419 INFO:tasks.ceph.osd.1.smithi158.stderr: 10: (clone()+0x6d) [0x7fa92490b37d]

I assume that since it's a MOSDRepOpReply this is the OSD's fault, and not something bogus from the client.

http://pulpito.ceph.com/jspray-2017-05-16_09:43:25-kcephfs-wip-jcsp-testing-20170515b-testing-basic-smithi/1184359

kcephfs/thrash/{clusters/fixed-3-cephfs.yaml conf.yaml objectstore/filestore-xfs.yaml thrashers/mon.yaml workloads/kclient_workunit_suites_iozone.yaml}

Actions

Also available in: Atom PDF