Project

General

Profile

Actions

Bug #50823

open

qa: RuntimeError: timeout waiting for cluster to stabilize

Added by Patrick Donnelly almost 3 years ago. Updated almost 2 years ago.

Status:
New
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS, kceph
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2021-05-14T22:13:31.164 ERROR:tasks.mds_thrash.fs.[cephfs]:exception:
Traceback (most recent call last):
  File "/home/teuthworker/src/github.com_batrick_ceph_e78e41c7f45263bfc3d22dafa953b7e485aac84d/qa/tasks/mds_thrash.py", line 124, in _run
    self.do_thrash()
  File "/home/teuthworker/src/github.com_batrick_ceph_e78e41c7f45263bfc3d22dafa953b7e485aac84d/qa/tasks/mds_thrash.py", line 312, in do_thrash
    status = self.wait_for_stable(rank, gid)
  File "/home/teuthworker/src/github.com_batrick_ceph_e78e41c7f45263bfc3d22dafa953b7e485aac84d/qa/tasks/mds_thrash.py", line 217, in wait_for_stable
    raise RuntimeError('timeout waiting for cluster to stabilize')
RuntimeError: timeout waiting for cluster to stabilize
2021-05-14T22:13:33.470 INFO:tasks.daemonwatchdog.daemon_watchdog:thrasher.fs.[cephfs] failed

From: /ceph/teuthology-archive/pdonnell-2021-05-14_21:45:42-fs-master-distro-basic-smithi/6115762/teuthology.log

Might be part of a group of issues found with the stock kernel.


Related issues 2 (1 open1 closed)

Related to CephFS - Bug #50821: qa: untar_snap_rm failure during mds thrashingNewXiubo Li

Actions
Related to CephFS - Bug #50824: qa: snaptest-git-ceph bus errorWon't FixXiubo Li

Actions
Actions #1

Updated by Patrick Donnelly almost 3 years ago

  • Related to Bug #50821: qa: untar_snap_rm failure during mds thrashing added
Actions #2

Updated by Patrick Donnelly almost 3 years ago

  • Related to Bug #50824: qa: snaptest-git-ceph bus error added
Actions #3

Updated by Jos Collin almost 3 years ago

The MDSThrasher timed out for some reason setting thrasher exception which caused the daemonwatchdog to bark.

Actions #4

Updated by Patrick Donnelly almost 2 years ago

  • Target version deleted (v17.0.0)
Actions

Also available in: Atom PDF