Project

General

Profile

Bug #14293

Bug #13901: "FAILED assert(0 == "hit suicide timeout")" in upgrade:hammer-hammer-distro-basic-openstack

osd hit suicide timeout in hammer

Added by Tamilarasi muthamizhan over 3 years ago. Updated over 3 years ago.

Status:
Duplicate
Priority:
High
Assignee:
-
Category:
-
Target version:
-
Start date:
01/07/2016
Due date:
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
rados
Pull request ID:

Description

noticed this core dump in the latest rados nightly run on hammer branch
although, looks similar to bug # 13827, not sure and so filing a new ticket.

logs: teuthology.ovh.sepia.ceph.com:/a/teuthology-2016-01-06_20:55:01-rados-hammer-distro-basic-openstack/61060

[pre]
2016-01-06T21:58:19.399 INFO:tasks.ceph.osd.4.target082171.stderr: ceph version 0.94.5-178-g9739d4d (9739d4de49f8167866eda556b2f1581c068ec8a7)
2016-01-06T21:58:19.400 INFO:tasks.ceph.osd.4.target082171.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc5c7b]
2016-01-06T21:58:19.400 INFO:tasks.ceph.osd.4.target082171.stderr: 2: (ceph::HeartbeatMap::_check(ceph::heartbeat_handle_d*, char const*, long)+0x2a9) [0xb02039]
2016-01-06T21:58:19.400 INFO:tasks.ceph.osd.4.target082171.stderr: 3: (ceph::HeartbeatMap::is_healthy()+0xd6) [0xb028c6]
2016-01-06T21:58:19.401 INFO:tasks.ceph.osd.4.target082171.stderr: 4: (OSD::handle_osd_ping(MOSDPing*)+0x723) [0x6a3453]
2016-01-06T21:58:19.401 INFO:tasks.ceph.osd.4.target082171.stderr: 5: (OSD::heartbeat_dispatch(Message*)+0x2fb) [0x6a46cb]
2016-01-06T21:58:19.401 INFO:tasks.ceph.osd.4.target082171.stderr: 6: (DispatchQueue::entry()+0x649) [0xc7ae59]
2016-01-06T21:58:19.401 INFO:tasks.ceph.osd.4.target082171.stderr: 7: (DispatchQueue::DispatchThread::entry()+0xd) [0xba4f5d]
2016-01-06T21:58:19.401 INFO:tasks.ceph.osd.4.target082171.stderr: 8: (()+0x8182) [0x7fdaeaded182]
2016-01-06T21:58:19.402 INFO:tasks.ceph.osd.4.target082171.stderr: 9: (clone()+0x6d) [0x7fdae935847d]
2016-01-06T21:58:19.402 INFO:tasks.ceph.osd.4.target082171.stderr: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
[/pre]

the core file is @ tetuhology.ovh.sepia.ceph.com:
/a/teuthology-2016-01-06_20:55:01-rados-hammer-distro-basic-openstack/61060/remote/target082171/coredump/1452117743.12641.core

History

#1 Updated by Tamilarasi muthamizhan over 3 years ago

another one:logs: teuthology.ovh.sepia.ceph.com:/a/teuthology-2016-01-06_20:55:01-rados-hammer-distro-basic-openstack/60998

2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr:common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout")
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr:common/HeartbeatMap.cc: In function 'bool ceph::HeartbeatMap::_check(ceph::heartbeat_handle_d*, const char*, time_t)' thread 7f44aceca700 time 2016-01-06 23:09:07.921852
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr:common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout")
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr: ceph version 0.94.5-178-g9739d4d (9739d4de49f8167866eda556b2f1581c068ec8a7)
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc5c7b]
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr: 2: (ceph::HeartbeatMap::_check(ceph::heartbeat_handle_d*, char const*, long)+0x2a9) [0xb02039]
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr: 3: (ceph::HeartbeatMap::reset_timeout(ceph::heartbeat_handle_d*, long, long)+0x72) [0xb02342]
2016-01-06T23:09:08.335 INFO:tasks.ceph.osd.4.target084171.stderr: 4: (FileStore::_do_transaction(ObjectStore::Transaction&, unsigned long, int, ThreadPool::TPHandle*)+0x3e2) [0x923fd2]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: 5: (FileStore::_do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long, ThreadPool::TPHandle*)+0x64) [0x92acd4]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: 6: (FileStore::_do_op(FileStore::OpSequencer*, ThreadPool::TPHandle&)+0x180) [0x92ae70]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: 7: (ThreadPool::worker(ThreadPool::WorkThread*)+0xa56) [0xbb6866]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: 8: (ThreadPool::WorkThread::entry()+0x10) [0xbb7910]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: 9: (()+0x8182) [0x7f44bae01182]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: 10: (clone()+0x6d) [0x7f44b936c47d]
2016-01-06T23:09:08.336 INFO:tasks.ceph.osd.4.target084171.stderr: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

#2 Updated by Zack Cerza over 3 years ago

  • Assignee set to Tamilarasi muthamizhan

Not teuthology!

#3 Updated by Yuri Weinstein over 3 years ago

  • Assignee deleted (Tamilarasi muthamizhan)

see #13901

#4 Updated by Tamilarasi muthamizhan over 3 years ago

  • Status changed from New to Duplicate
  • Release deleted (hammer)
  • Release set to giant

#5 Updated by Tamilarasi muthamizhan over 3 years ago

  • Project changed from teuthology to ceph-qa-suite
  • Parent task set to #13901
  • Release deleted (giant)
  • Release set to hammer

Also available in: Atom PDF