Project

General

Profile

Actions

Bug #46675

closed

nautilus: fs/upgrade test: Crash: 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds

Added by Ramana Raja almost 4 years ago. Updated over 3 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
fs
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

See the following failure in Yuri's nautilus backport testing, fs/upgrade tests in fs-suite
https://pulpito.ceph.com/yuriw-2020-07-20_15:26:34-fs-wip-yuri3-testing-2020-07-17-1802-nautilus-distro-basic-smithi/5244211/
https://pulpito.ceph.com/yuriw-2020-07-20_15:26:34-fs-wip-yuri3-testing-2020-07-17-1802-nautilus-distro-basic-smithi/5244231/

020-07-20T20:19:22.073 DEBUG:teuthology.misc:Ceph health: HEALTH_WARN 1 filesystem is degraded
2020-07-20T20:19:22.186 INFO:tasks.ceph.mds.b.smithi141.stderr:2020-07-20 20:19:22.187 7fc4ebfca700 -1 mds.0.openfiles _load_finish got (2) No such file or directory
2020-07-20T20:19:26.338 INFO:tasks.ceph.mds.b.smithi141.stderr:*** Caught signal (Segmentation fault) **
2020-07-20T20:19:26.339 INFO:tasks.ceph.mds.b.smithi141.stderr: in thread 7fc4ea7c7700 thread_name:md_log_replay
2020-07-20T20:19:26.339 INFO:tasks.ceph.mds.b.smithi141.stderr: ceph version 14.2.10-110-g15a3f7c (15a3f7cb9151a44eaf5129b83ccb0deb5ee0916d) nautilus (stable)
2020-07-20T20:19:26.339 INFO:tasks.ceph.mds.b.smithi141.stderr: 1: (()+0x128a0) [0x7fc4fa7848a0]
2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 2: (MDCache::finish_uncommitted_slave(metareqid_t, bool)+0x21e) [0x55a546b5aede]
2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 3: (ESlaveUpdate::replay(MDSRank*)+0xf9) [0x55a546d58af9]
2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 4: (MDLog::_replay_thread()+0x8b2) [0x55a546cf6592]
2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 5: (MDLog::ReplayThread::entry()+0xd) [0x55a546a5883d]
2020-07-20T20:19:26.341 INFO:tasks.ceph.mds.b.smithi141.stderr: 6: (()+0x76db) [0x7fc4fa7796db]
2020-07-20T20:19:26.341 INFO:tasks.ceph.mds.b.smithi141.stderr: 7: (clone()+0x3f) [0x7fc4f995fa3f]
2020-07-20T20:19:26.341 INFO:tasks.ceph.mds.b.smithi141.stderr:2020-07-20 20:19:26.339 7fc4ea7c7700 -1 *** Caught signal (Segmentation fault) **

Related issues 1 (0 open1 closed)

Is duplicate of CephFS - Bug #46831: nautilus: mds: SIGSEGV in MDCache::finish_uncommitted_slaveResolvedZheng Yan

Actions
Actions #1

Updated by Ramana Raja almost 4 years ago

  • ceph-qa-suite fs added
  • Component(FS) MDS added
Actions #2

Updated by Patrick Donnelly over 3 years ago

  • Status changed from New to Need More Info
  • Assignee set to Ramana Raja
Actions #4

Updated by Patrick Donnelly over 3 years ago

  • Is duplicate of Bug #46831: nautilus: mds: SIGSEGV in MDCache::finish_uncommitted_slave added
Actions #5

Updated by Patrick Donnelly over 3 years ago

  • Status changed from Need More Info to Duplicate

Sorry I missed this tracker ticket when searching. I'll mark this as duplicate since the other already has the fix linked up.

Actions

Also available in: Atom PDF