Project

General

Profile

Bug #14295

failed to recover before timeout expired

Added by Tamilarasi muthamizhan over 3 years ago. Updated over 3 years ago.

Status:
Closed
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
Start date:
01/07/2016
Due date:
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
rados
Pull request ID:

Description

logs: @teuthology.ovh.sepia.ceph.com: /a/teuthology-2016-01-06_20:55:01-rados-hammer-distro-basic-openstack/61129

2016-01-06 22:00:04.321982 7f2d5d9f8700 -1 common/HeartbeatMap.cc: In function '
bool ceph::HeartbeatMap::_check(ceph::heartbeat_handle_d*, const char*, time_t)'
 thread 7f2d5d9f8700 time 2016-01-06 22:00:03.657060
common/HeartbeatMap.cc: 79: FAILED assert(0 == "hit suicide timeout")

 ceph version 0.94.5-178-g9739d4d (9739d4de49f8167866eda556b2f1581c068ec8a7)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) 
[0xbc5c7b]
 2: (ceph::HeartbeatMap::_check(ceph::heartbeat_handle_d*, char const*, long)+0x
2a9) [0xb02039]
 3: (ceph::HeartbeatMap::is_healthy()+0xd6) [0xb028c6]
 4: (ceph::HeartbeatMap::check_touch_file()+0x17) [0xb02fa7]
 5: (CephContextServiceThread::entry()+0x14b) [0xbd5d6b]
 6: (()+0x8182) [0x7f2d6259d182]
 7: (clone()+0x6d) [0x7f2d60b0847d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

History

#1 Updated by Tamilarasi muthamizhan over 3 years ago

another one with same log: @teuthology.ovh.sepia.ceph.com: /a/teuthology-2016-01-06_20:55:01-rados-hammer-distro-basic-openstack/61100

#2 Updated by Yuri Weinstein over 3 years ago

General assessment on these errors was that openstack/ovh is slow running tests, but we observe more of those now.

#3 Updated by Samuel Just over 3 years ago

  • Status changed from New to Closed

Also available in: Atom PDF