Bug #48805: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details" - CephFS - Ceph

Actions

Copy link

Bug #48805

closed

mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details"

Added by Patrick Donnelly over 3 years ago. Updated about 2 years ago.

Status:

Resolved

Priority:

Urgent

Assignee:

Milind Changire

Category:

Target version:

Ceph - v17.0.0

% Done:

Source:

Q/A

Tags:

Backport:

pacific,octopus,nautilus

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Component(FS):

MDS

Labels (FS):

qa-failure, scrub, task(medium)

Pull request ID:

40520

Crash signature (v1):

Crash signature (v2):

Description

2021-01-03T05:03:38.865 INFO:teuthology.orchestra.run.smithi191.stdout:2021-01-03T05:00:26.188079+0000 mds.a (mds.0) 124 : cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details

From: /ceph/teuthology-archive/teuthology-2021-01-03_03:15:02-fs-master-distro-basic-smithi/5751921/teuthology.log

Thought we had an issue for this already but I could not find it. Milind is working on this.

Related issues 4 (1 open — 3 closed)

Related to CephFS - Bug #50250: mds: "log [WRN] : Scrub error on inode 0x10000004506 (/client.0/tmp/clients/client3/~dmtmp/COREL) see mds.a log and `damage ls` output for details" ("freshly-calculated rstats don't match existing ones")

New

Milind Changire

Actions

Copied to CephFS - Backport #50251: nautilus: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details"

Rejected

Actions

Copied to CephFS - Backport #50252: octopus: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details"

Rejected

Milind Changire

Actions

Copied to CephFS - Backport #50253: pacific: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details"

Resolved

Patrick Donnelly

Actions

Copy link

Updated by Patrick Donnelly over 3 years ago

Labels (FS) qa-failure, scrub, task(medium) added

Actions

Copy link

Updated by Patrick Donnelly over 3 years ago

Target version changed from v16.0.0 to v17.0.0
Backport changed from octopus,nautilus to pacific,octopus,nautilus

Actions

Copy link

Updated by Milind Changire about 3 years ago

I'm unable to comment on the exact teuthology run mentioned in the description.
However, with the testing so far, there are two types of "scrub error on inode" issues:

backtrace validation for dirs
This issue is mostly (so far) caused only when validating unlinked (stray) entries
For stray entries, the on_disk backtrace version differs with the in_memory version.
raw stats validation for dirs
There's the problem with in-memory and on-disk dirstat being different.
(this needs to be investigated more; no leads on this one yet
Then there's the problem with in-memory and on-disk rstat being different
Here, the rctime and (file+dir) counts remain the same, but only the in-memory rstat version changes.

Actions

Copy link

Updated by Milind Changire about 3 years ago

Status changed from In Progress to Fix Under Review
Pull request ID set to 40520

Actions

Copy link

Updated by Patrick Donnelly about 3 years ago

Related to Bug #50250: mds: "log [WRN] : Scrub error on inode 0x10000004506 (/client.0/tmp/clients/client3/~dmtmp/COREL) see mds.a log and `damage ls` output for details" ("freshly-calculated rstats don't match existing ones") added

Actions

Copy link

Updated by Patrick Donnelly about 3 years ago

Status changed from Fix Under Review to Pending Backport

Actions

Copy link

Updated by Backport Bot about 3 years ago

Copied to Backport #50251: nautilus: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details" added

Actions

Copy link

Updated by Backport Bot about 3 years ago

Copied to Backport #50252: octopus: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details" added

Actions

Copy link

Updated by Backport Bot about 3 years ago

Copied to Backport #50253: pacific: mds: "cluster [WRN] Scrub error on inode 0x1000000039d (/client.0/tmp/blogbench-1.0/src/blogtest_in) see mds.a log and `damage ls` output for details" added

Actions

Copy link

#10