Project

General

Profile

Bug #21294

ceph_manager: bad AssertionError: failed to recover before timeout expired

Added by Sage Weil almost 2 years ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
Start date:
09/07/2017
Due date:
% Done:

0%

Source:
Tags:
Backport:
luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:

Description

The pgs are all active+clean and have been that way for many minutes. Not sure how ceph_manager.py got it wrong.

2017-09-07T04:28:23.145 INFO:tasks.thrashosds.thrasher:Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_ceph_ceph-c_wip-sage-testing2-2017-09-06-1800/qa/tasks/ceph_manager.py", line 909, in wrapper
    return func(self)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_wip-sage-testing2-2017-09-06-1800/qa/tasks/ceph_manager.py", line 1026, in do_thrash
    timeout=self.config.get('timeout')
  File "/home/teuthworker/src/github.com_ceph_ceph-c_wip-sage-testing2-2017-09-06-1800/qa/tasks/ceph_manager.py", line 2225, in wait_for_recovery
    'failed to recover before timeout expired'
AssertionError: failed to recover before timeout expired

/a/sage-2017-09-07_00:58:30-rados-wip-sage-testing2-2017-09-06-1800-distro-basic-smithi/1602705


Related issues

Copied to Ceph - Backport #21548: luminous: ceph_manager: bad AssertionError: failed to recover before timeout expired Resolved

History

#1 Updated by Sage Weil almost 2 years ago

  • Priority changed from High to Urgent

/a/sage-2017-09-13_13:31:57-rados-wip-sage-testing-2017-09-12-1750-distro-basic-smithi/1627905

#2 Updated by huang jun almost 2 years ago

there is pg in "active+undersized+degraded+forced_backfill" state in case /a/sage-2017-09-13_13:31:57-rados-wip-sage-testing-2017-09-12-1750-distro-basic-smithi/1627905

#4 Updated by Josh Durgin almost 2 years ago

  • Status changed from Verified to Testing

#5 Updated by Sage Weil almost 2 years ago

  • Status changed from Testing to Pending Backport
  • Backport set to luminous

#6 Updated by Nathan Cutler almost 2 years ago

  • Copied to Backport #21548: luminous: ceph_manager: bad AssertionError: failed to recover before timeout expired added

#7 Updated by Nathan Cutler over 1 year ago

  • Status changed from Pending Backport to Resolved

Also available in: Atom PDF