Project

General

Profile

Actions

Bug #65094

open

mds STATE_STARTING won't add root ino for root rank and not correctly handle when fails at STATE_STARTING

Added by ethan wu about 2 months ago. Updated about 2 months ago.

Status:
Fix Under Review
Priority:
Normal
Assignee:
Category:
Correctness/Safety
Target version:
% Done:

0%

Source:
Community (dev)
Tags:
Backport:
quincy,reef,squid
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

root rank doesn't add root inode to its subtree auth when it enters STATE_STARTING,
also it doesn't handle it STATE_STARTING correctly when mds fails or is stopped at STARTING.

This will cause rank damage or ceph_assert failure when mds failover/switchover happens later.

the following are related logs

a.
-1 log_channel(cluster) log [ERR] : No subtrees found for root MDS rank!

b.
-15> 2024-03-24T18:06:19.461+0800 7f1542cbf700 0 mds.0.journal EMetaBlob.replay missing dir ino 0x10000000000
-14> 2024-03-24T18:06:19.461+0800 7f1542cbf700 -1 log_channel(cluster) log [ERR] : failure replaying journal (EMetaBlob)

c.
-6> 2024-03-24T19:39:59.593+0800 7f5903f02700 -1 log_channel(cluster) log [ERR] : replayed ESubtreeMap at 4209845 subtree root 0x1 not in cache


Files

mds.b.log (354 KB) mds.b.log ethan wu, 03/25/2024 12:56 PM
Actions

Also available in: Atom PDF