Project

General

Profile

Actions

Bug #61947

closed

mds: enforce a limit on the size of a session in the sessionmap

Added by Patrick Donnelly 10 months ago. Updated 2 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
Correctness/Safety
Target version:
% Done:

0%

Source:
Tags:
backport_processed
Backport:
reef,quincy,pacific
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

If the session's "completed_requests" vector gets too large, the session can get to a size where the MDS goes read-only because the OSD rejects sessionmap object updates with "Message size too long".

2023-07-10 13:53:30.529 7f8fed08b700  0 log_channel(cluster) log [WRN] : client.744507717 does not advance its oldest_client_tid (3221389957), 5905929 completed requests recorded in session
2023-07-10 13:53:30.529 7f8fed08b700  0 log_channel(cluster) log [WRN] : client.744507717 does not advance its oldest_client_tid (3221389957), 5905929 completed requests recorded in session
2023-07-10 13:53:30.530 7f8fed08b700  0 log_channel(cluster) log [WRN] : client.744507717 does not advance its oldest_client_tid (3221389957), 5905929 completed requests recorded in session
2023-07-10 13:53:30.534 7f8fed08b700  0 log_channel(cluster) log [WRN] : client.744507717 does not advance its oldest_client_tid (3221389957), 5905929 completed requests recorded in session
2023-07-10 13:53:30.534 7f8fed08b700  0 log_channel(cluster) log [WRN] : client.744507717 does not advance its oldest_client_tid (3221389957), 5905929 completed requests recorded in session
2023-07-10 13:53:30.534 7f8fed08b700  0 log_channel(cluster) log [WRN] : client.744507717 does not advance its oldest_client_tid (3221389957), 5905929 completed requests recorded in session
2023-07-10 13:53:35.635 7f8fe687e700 -1 mds.0.2679609 unhandled write error (90) Message too long, force readonly...
2023-07-10 13:53:35.635 7f8fe687e700  1 mds.0.cache force file system read-only
2023-07-10 13:53:35.635 7f8fe687e700  0 log_channel(cluster) log [WRN] : force file system read-only

If a session exceeds some configurable encoded size (maybe 16MB), then evict it.


Subtasks 1 (1 open0 closed)

Bug #62257: mds: blocklist clients that are not advancing `oldest_client_tid`NewVenky Shankar

Actions

Related issues 4 (1 open3 closed)

Related to CephFS - Bug #63364: MDS_CLIENT_OLDEST_TID: 15 clients failing to advance oldest client/flush tidPending BackportXiubo Li

Actions
Copied to CephFS - Backport #62583: reef: mds: enforce a limit on the size of a session in the sessionmapResolvedVenky ShankarActions
Copied to CephFS - Backport #62584: pacific: mds: enforce a limit on the size of a session in the sessionmapResolvedVenky ShankarActions
Copied to CephFS - Backport #62585: quincy: mds: enforce a limit on the size of a session in the sessionmapResolvedVenky ShankarActions
Actions

Also available in: Atom PDF