Project

General

Profile

Bug #10317

pg repair doesn't fix missing attr

Added by Luis Periquito over 9 years ago. Updated almost 9 years ago.

Status:
Can't reproduce
Priority:
Urgent
Assignee:
David Zafman
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I have a size 4 replicated pool and got theses messages in the primary OSD for a PG:
2014-12-10 09:38:09.348110 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 0 missing, 1 inconsistent objects
2014-12-10 09:38:09.348116 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 1 errors
2014-12-10 10:13:15.922065 7f8f618be700 0 log [INF] : 9.180 repair ok, 0 fixed
2014-12-10 10:55:27.556358 7f8f618be700 0 log [ERR] : 9.180 shard 6: soid 370cbf80/29145.4_xxx/head//9 missing attr _, missing attr _user.rgw.acl, missing attr _user.rgw.content_type, missing attr _user.rgw.etag, missing attr _user.rgw.idtag, missing attr _user.rgw.manifest, missing attr _user.rgw.x-amz-meta-md5sum, missing attr _user.rgw.x-amz-meta-stat, missing attr snapset
2014-12-10 10:56:50.597952 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 0 missing, 1 inconsistent objects
2014-12-10 10:56:50.597957 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 1 errors

The PG dump for this PG is:
9.180 25614 0 0 0 23306482348 3001 3001 active+clean+inconsistent 2014-12-10 17:29:01.937929 40242'1108124 40242:23305321 [16,10,27,6] 16 [16,10,27,6]16 40242'1071363 2014-12-10 17:29:01.937881 40242'1071363 2014-12-10 17:29:01.937881

As the error was pointing to a secondary PG I executed a pg repair, which maintaned the error (verified with a deep-scrub). This was caused by a disk that would come to fail in the days afterwards on OSD.6.

I'm running the current latest firefly (0.80.7).

History

#1 Updated by Samuel Just over 9 years ago

  • Priority changed from Normal to Urgent

#2 Updated by Loïc Dachary about 9 years ago

  • Assignee set to Loïc Dachary

#3 Updated by Loïc Dachary almost 9 years ago

  • Assignee deleted (Loïc Dachary)
  • Regression set to No

#4 Updated by David Zafman almost 9 years ago

  • Status changed from New to Can't reproduce
  • Assignee set to David Zafman

Can not reproduce on latest code in master branch nor in firefly "ceph version 0.80.9-255-gd0f9c5f (d0f9c5f47024f53b4eccea2e0fde9b7844746362)"

This is the commit which will probably be the next firefly point release.

I tried both a replicated pool and an erasure coded pool.

Also available in: Atom PDF