Actions
Bug #46675
closednautilus: fs/upgrade test: Crash: 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds
% Done:
0%
Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
fs
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
See the following failure in Yuri's nautilus backport testing, fs/upgrade tests in fs-suite
https://pulpito.ceph.com/yuriw-2020-07-20_15:26:34-fs-wip-yuri3-testing-2020-07-17-1802-nautilus-distro-basic-smithi/5244211/
https://pulpito.ceph.com/yuriw-2020-07-20_15:26:34-fs-wip-yuri3-testing-2020-07-17-1802-nautilus-distro-basic-smithi/5244231/
020-07-20T20:19:22.073 DEBUG:teuthology.misc:Ceph health: HEALTH_WARN 1 filesystem is degraded 2020-07-20T20:19:22.186 INFO:tasks.ceph.mds.b.smithi141.stderr:2020-07-20 20:19:22.187 7fc4ebfca700 -1 mds.0.openfiles _load_finish got (2) No such file or directory 2020-07-20T20:19:26.338 INFO:tasks.ceph.mds.b.smithi141.stderr:*** Caught signal (Segmentation fault) ** 2020-07-20T20:19:26.339 INFO:tasks.ceph.mds.b.smithi141.stderr: in thread 7fc4ea7c7700 thread_name:md_log_replay 2020-07-20T20:19:26.339 INFO:tasks.ceph.mds.b.smithi141.stderr: ceph version 14.2.10-110-g15a3f7c (15a3f7cb9151a44eaf5129b83ccb0deb5ee0916d) nautilus (stable) 2020-07-20T20:19:26.339 INFO:tasks.ceph.mds.b.smithi141.stderr: 1: (()+0x128a0) [0x7fc4fa7848a0] 2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 2: (MDCache::finish_uncommitted_slave(metareqid_t, bool)+0x21e) [0x55a546b5aede] 2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 3: (ESlaveUpdate::replay(MDSRank*)+0xf9) [0x55a546d58af9] 2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 4: (MDLog::_replay_thread()+0x8b2) [0x55a546cf6592] 2020-07-20T20:19:26.340 INFO:tasks.ceph.mds.b.smithi141.stderr: 5: (MDLog::ReplayThread::entry()+0xd) [0x55a546a5883d] 2020-07-20T20:19:26.341 INFO:tasks.ceph.mds.b.smithi141.stderr: 6: (()+0x76db) [0x7fc4fa7796db] 2020-07-20T20:19:26.341 INFO:tasks.ceph.mds.b.smithi141.stderr: 7: (clone()+0x3f) [0x7fc4f995fa3f] 2020-07-20T20:19:26.341 INFO:tasks.ceph.mds.b.smithi141.stderr:2020-07-20 20:19:26.339 7fc4ea7c7700 -1 *** Caught signal (Segmentation fault) **
Updated by Ramana Raja almost 4 years ago
- ceph-qa-suite fs added
- Component(FS) MDS added
Updated by Patrick Donnelly almost 4 years ago
- Status changed from New to Need More Info
- Assignee set to Ramana Raja
Updated by Ramana Raja almost 4 years ago
This failure wasn't seen in v14.2.10 release testing,
https://tracker.ceph.com/issues/46039#note-2
https://pulpito.ceph.com/yuriw-2020-06-19_18:38:18-fs-nautilus-distro-basic-smithi/
Updated by Patrick Donnelly over 3 years ago
- Is duplicate of Bug #46831: nautilus: mds: SIGSEGV in MDCache::finish_uncommitted_slave added
Updated by Patrick Donnelly over 3 years ago
- Status changed from Need More Info to Duplicate
Sorry I missed this tracker ticket when searching. I'll mark this as duplicate since the other already has the fix linked up.
Actions