Project

General

Profile

Actions

Bug #21294

closed

ceph_manager: bad AssertionError: failed to recover before timeout expired

Added by Sage Weil over 6 years ago. Updated over 6 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

The pgs are all active+clean and have been that way for many minutes. Not sure how ceph_manager.py got it wrong.

2017-09-07T04:28:23.145 INFO:tasks.thrashosds.thrasher:Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph-c_wip-sage-testing2-2017-09-06-1800/qa/tasks/ceph_manager.py", line 909, in wrapper
    return func(self)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_wip-sage-testing2-2017-09-06-1800/qa/tasks/ceph_manager.py", line 1026, in do_thrash
    timeout=self.config.get('timeout')
  File "/home/teuthworker/src/github.com_ceph_ceph-c_wip-sage-testing2-2017-09-06-1800/qa/tasks/ceph_manager.py", line 2225, in wait_for_recovery
    'failed to recover before timeout expired'
AssertionError: failed to recover before timeout expired

/a/sage-2017-09-07_00:58:30-rados-wip-sage-testing2-2017-09-06-1800-distro-basic-smithi/1602705


Related issues 1 (0 open1 closed)

Copied to Ceph - Backport #21548: luminous: ceph_manager: bad AssertionError: failed to recover before timeout expiredResolvedNathan CutlerActions
Actions #1

Updated by Sage Weil over 6 years ago

  • Priority changed from High to Urgent

/a/sage-2017-09-13_13:31:57-rados-wip-sage-testing-2017-09-12-1750-distro-basic-smithi/1627905

Actions #2

Updated by huang jun over 6 years ago

there is pg in "active+undersized+degraded+forced_backfill" state in case /a/sage-2017-09-13_13:31:57-rados-wip-sage-testing-2017-09-12-1750-distro-basic-smithi/1627905

Actions #3

Updated by huang jun over 6 years ago

Actions #4

Updated by Josh Durgin over 6 years ago

  • Status changed from 12 to 7
Actions #5

Updated by Sage Weil over 6 years ago

  • Status changed from 7 to Pending Backport
  • Backport set to luminous
Actions #6

Updated by Nathan Cutler over 6 years ago

  • Copied to Backport #21548: luminous: ceph_manager: bad AssertionError: failed to recover before timeout expired added
Actions #7

Updated by Nathan Cutler over 6 years ago

  • Status changed from Pending Backport to Resolved
Actions

Also available in: Atom PDF