Project

General

Profile

Actions

Bug #20793

closed

osd: segv in CopyFromFinisher::execute in ec cache tiering test

Added by Sage Weil almost 7 years ago. Updated over 6 years ago.

Status:
Resolved
Priority:
Immediate
Assignee:
Jason Dillaman
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2017-07-27T08:41:34.994 INFO:tasks.ceph.osd.4.smithi093.stderr:*** Caught signal (Segmentation fault) **
2017-07-27T08:41:34.994 INFO:tasks.ceph.osd.4.smithi093.stderr: in thread 7fcd0eb87700 thread_name:tp_osd_tp
2017-07-27T08:41:34.994 INFO:tasks.ceph.osd.4.smithi093.stderr: ceph version 12.1.1-650-gd08e995 (d08e9953805141b8862ece9c66e6306d51b94a95) luminous (rc)
2017-07-27T08:41:34.996 INFO:tasks.ceph.osd.4.smithi093.stderr: 1: (()+0xa938a4) [0x55833029d8a4]
2017-07-27T08:41:34.996 INFO:tasks.ceph.osd.4.smithi093.stderr: 2: (()+0x11390) [0x7fcd296d4390]
2017-07-27T08:41:34.996 INFO:tasks.ceph.osd.4.smithi093.stderr: 3: (CopyFromFinisher::execute()+0xc) [0x55832fefd91c]
2017-07-27T08:41:34.996 INFO:tasks.ceph.osd.4.smithi093.stderr: 4: (PrimaryLogPG::do_osd_ops(PrimaryLogPG::OpContext*, std::vector<OSDOp, std::allocator<OSDOp> >&)+0x2f4d) [0x55832fe9f0dd]
2017-07-27T08:41:34.996 INFO:tasks.ceph.osd.4.smithi093.stderr: 5: (PrimaryLogPG::prepare_transaction(PrimaryLogPG::OpContext*)+0xbf) [0x55832feae95f]
2017-07-27T08:41:34.996 INFO:tasks.ceph.osd.4.smithi093.stderr: 6: (PrimaryLogPG::execute_ctx(PrimaryLogPG::OpContext*)+0x2db) [0x55832feaf1db]
2017-07-27T08:41:35.075 INFO:tasks.ceph.osd.4.smithi093.stderr: 7: (PrimaryLogPG::do_op(boost::intrusive_ptr<OpRequest>&)+0x309d) [0x55832feb3ccd]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 8: (PrimaryLogPG::do_request(boost::intrusive_ptr<OpRequest>&, ThreadPool::TPHandle&)+0xe73) [0x55832fe6d9e3]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 9: (OSD::dequeue_op(boost::intrusive_ptr<PG>, boost::intrusive_ptr<OpRequest>, ThreadPool::TPHandle&)+0x3a9) [0x55832fcf5549]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 10: (PGQueueable::RunVis::operator()(boost::intrusive_ptr<OpRequest> const&)+0x57) [0x55832ff87ce7]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 11: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x130e) [0x55832fd1cc0e]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 12: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x933) [0x5583302e5183]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 13: (ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x5583302e83c0]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 14: (()+0x76ba) [0x7fcd296ca6ba]
2017-07-27T08:41:35.076 INFO:tasks.ceph.osd.4.smithi093.stderr: 15: (clone()+0x6d) [0x7fcd2874182d]
2017-07-27T08:41:35.077 INFO:tasks.ceph.osd.4.smithi093.stderr:2017-07-27 08:41:34.956188 7fcd0eb87700 -1 *** Caught signal (Segmentation fault) **
2017-07-27T08:41:35.077 INFO:tasks.ceph.osd.4.smithi093.stderr: in thread 7fcd0eb87700 thread_name:tp_osd_tp

/a/sage-2017-07-26_19:43:32-rados-wip-sage-testing2-distro-basic-smithi/1448280
Actions #1

Updated by Sage Weil almost 7 years ago

  • Project changed from Ceph to RADOS

similar:

2017-07-27T09:08:08.095 INFO:tasks.ceph.osd.4.smithi178.stderr: ceph version 12.1.1-650-gd08e995 (d08e9953805141b8862ece9c66e6306d51b94a95) luminous (rc)
2017-07-27T09:08:08.095 INFO:tasks.ceph.osd.4.smithi178.stderr: 1: (()+0xa40a79) [0x7f4f59f96a79]
2017-07-27T09:08:08.095 INFO:tasks.ceph.osd.4.smithi178.stderr: 2: (()+0x10330) [0x7f4f57a5d330]
2017-07-27T09:08:08.095 INFO:tasks.ceph.osd.4.smithi178.stderr: 3: (PrimaryLogPG::finish_copyfrom(CopyFromCallback*)+0x1c) [0x7f4f59beebec]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 4: (CopyFromFinisher::execute()+0x18) [0x7f4f59c5fbc8]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 5: (PrimaryLogPG::do_osd_ops(PrimaryLogPG::OpContext*, std::vector<OSDOp, std::allocator<OSDOp> >&)+0x46d2) [0x7f4f59c0c092]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 6: (PrimaryLogPG::prepare_transaction(PrimaryLogPG::OpContext*)+0x8f) [0x7f4f59c18fdf]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 7: (PrimaryLogPG::execute_ctx(PrimaryLogPG::OpContext*)+0x723) [0x7f4f59c19d33]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 8: (PrimaryLogPG::do_op(boost::intrusive_ptr<OpRequest>&)+0x3134) [0x7f4f59c1e5b4]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 9: (PrimaryLogPG::do_request(boost::intrusive_ptr<OpRequest>&, ThreadPool::TPHandle&)+0xe46) [0x7f4f59bdce46]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 10: (OSD::dequeue_op(boost::intrusive_ptr<PG>, boost::intrusive_ptr<OpRequest>, ThreadPool::TPHandle&)+0x3e6) [0x7f4f59a7f736]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 11: (PGQueueable::RunVis::operator()(boost::intrusive_ptr<OpRequest> const&)+0x47) [0x7f4f59cd5cc7]
2017-07-27T09:08:08.096 INFO:tasks.ceph.osd.4.smithi178.stderr: 12: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xff5) [0x7f4f59aa9d15]
2017-07-27T09:08:08.097 INFO:tasks.ceph.osd.4.smithi178.stderr: 13: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x907) [0x7f4f59fd86c7]
2017-07-27T09:08:08.097 INFO:tasks.ceph.osd.4.smithi178.stderr: 14: (ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x7f4f59fda840]
2017-07-27T09:08:08.097 INFO:tasks.ceph.osd.4.smithi178.stderr: 15: (()+0x8184) [0x7f4f57a55184]
2017-07-27T09:08:08.097 INFO:tasks.ceph.osd.4.smithi178.stderr: 16: (clone()+0x6d) [0x7f4f56b4537d]

/a/sage-2017-07-26_19:43:32-rados-wip-sage-testing2-distro-basic-smithi/1448326

Actions #2

Updated by Sage Weil almost 7 years ago

/a/sage-2017-07-26_19:43:32-rados-wip-sage-testing2-distro-basic-smithi/1448238
/a/sage-2017-07-26_19:43:32-rados-wip-sage-testing2-distro-basic-smithi/1448371

Actions #3

Updated by Jason Dillaman over 6 years ago

Perhaps fixed under tracker # 20783 since it didn't repeat under a single run locally nor under teuthology. Going to run it a few more times.

http://pulpito.ceph.com/jdillaman-2017-07-27_11:55:05-rados-wip-20783-distro-basic-smithi/

Actions #4

Updated by Jason Dillaman over 6 years ago

  • Status changed from 12 to Fix Under Review
  • Assignee set to Jason Dillaman

Appears to be resolved under tracker ticket #20783 [1]

PR: https://github.com/ceph/ceph/pull/16617

[1] http://pulpito.ceph.com/jdillaman-2017-07-27_13:08:07-rados-wip-20783-distro-basic-smithi/ (one test failure -- no crashes, scrub timed-out?)

Actions #5

Updated by Sage Weil over 6 years ago

  • Status changed from Fix Under Review to Resolved
Actions

Also available in: Atom PDF