Project

General

Profile

Actions

Bug #53862

closed

mds: remove the duplicated or incorrect respond when the pool is full

Added by Xiubo Li over 2 years ago. Updated about 2 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
% Done:

0%

Source:
Tags:
Backport:
pacific,octopus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

When the metadata pool is full and when the mds forward the request, in the client side it may receive the stale reply from the old mds:

Jan 09 01:27:13 gpu027 kernel: libceph: osd9 up
Jan 09 01:27:13 gpu027 kernel: libceph: osd10 up
Jan 09 01:28:55 gpu027 kernel: libceph: osd9 down
Jan 09 01:28:55 gpu027 kernel: libceph: osd10 down
Jan 09 01:32:35 gpu027 kernel: libceph: osd6 weight 0x0 (out)
Jan 09 01:32:35 gpu027 kernel: libceph: osd16 weight 0x0 (out)
Jan 09 01:34:18 gpu027 kernel: libceph: osd1 weight 0x0 (out)
Jan 09 01:39:20 gpu027 kernel: libceph: osd9 weight 0x0 (out)
Jan 09 01:39:20 gpu027 kernel: libceph: osd10 weight 0x0 (out)
Jan 09 01:53:07 gpu027 kernel: ceph: mdsc_handle_reply got 30408991 on session mds1 not mds0
Jan 09 01:53:14 gpu027 kernel: ceph: mdsc_handle_reply got 30409829 on session mds1 not mds0
Jan 09 01:53:15 gpu027 kernel: ceph: mdsc_handle_reply got 30409925 on session mds1 not mds0
Jan 09 01:53:28 gpu027 kernel: ceph: mdsc_handle_reply got 30411416 on session mds1 not mds0
Jan 09 02:05:07 gpu027 kernel: ceph: mdsc_handle_reply got 30417742 on session mds0 not mds1
Jan 09 02:48:52 gpu027 kernel: ceph: mdsc_handle_reply got 30449177 on session mds1 not mds0
Jan 09 02:49:17 gpu027 kernel: ceph: mdsc_handle_reply got 30452750 on session mds1 not mds0


Related issues 2 (0 open2 closed)

Copied to CephFS - Backport #53908: pacific: mds: remove the duplicated or incorrect respond when the pool is fullResolvedXiubo LiActions
Copied to CephFS - Backport #53909: octopus: mds: remove the duplicated or incorrect respond when the pool is fullRejectedXiubo LiActions
Actions #1

Updated by Xiubo Li over 2 years ago

The same issue was seen and repoted by https://www.spinics.net/lists/dev-ceph/msg03943.html.

Actions #2

Updated by Xiubo Li over 2 years ago

  • Status changed from Fix Under Review to Pending Backport
  • Backport changed from pacific to pacific,octopus
Actions #3

Updated by Backport Bot over 2 years ago

  • Copied to Backport #53908: pacific: mds: remove the duplicated or incorrect respond when the pool is full added
Actions #4

Updated by Backport Bot over 2 years ago

  • Copied to Backport #53909: octopus: mds: remove the duplicated or incorrect respond when the pool is full added
Actions #5

Updated by Loïc Dachary about 2 years ago

  • Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".

Actions

Also available in: Atom PDF