Bug #10317
pg repair doesn't fix missing attr
0%
Description
I have a size 4 replicated pool and got theses messages in the primary OSD for a PG:
2014-12-10 09:38:09.348110 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 0 missing, 1 inconsistent objects
2014-12-10 09:38:09.348116 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 1 errors
2014-12-10 10:13:15.922065 7f8f618be700 0 log [INF] : 9.180 repair ok, 0 fixed
2014-12-10 10:55:27.556358 7f8f618be700 0 log [ERR] : 9.180 shard 6: soid 370cbf80/29145.4_xxx/head//9 missing attr _, missing attr _user.rgw.acl, missing attr _user.rgw.content_type, missing attr _user.rgw.etag, missing attr _user.rgw.idtag, missing attr _user.rgw.manifest, missing attr _user.rgw.x-amz-meta-md5sum, missing attr _user.rgw.x-amz-meta-stat, missing attr snapset
2014-12-10 10:56:50.597952 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 0 missing, 1 inconsistent objects
2014-12-10 10:56:50.597957 7f8f618be700 0 log [ERR] : 9.180 deep-scrub 1 errors
The PG dump for this PG is:
9.180 25614 0 0 0 23306482348 3001 3001 active+clean+inconsistent 2014-12-10 17:29:01.937929 40242'1108124 40242:23305321 [16,10,27,6] 16 [16,10,27,6]16 40242'1071363 2014-12-10 17:29:01.937881 40242'1071363 2014-12-10 17:29:01.937881
As the error was pointing to a secondary PG I executed a pg repair, which maintaned the error (verified with a deep-scrub). This was caused by a disk that would come to fail in the days afterwards on OSD.6.
I'm running the current latest firefly (0.80.7).
History
#1 Updated by Samuel Just over 9 years ago
- Priority changed from Normal to Urgent
#2 Updated by Loïc Dachary about 9 years ago
- Assignee set to Loïc Dachary
#3 Updated by Loïc Dachary almost 9 years ago
- Assignee deleted (
Loïc Dachary) - Regression set to No
#4 Updated by David Zafman almost 9 years ago
- Status changed from New to Can't reproduce
- Assignee set to David Zafman
Can not reproduce on latest code in master branch nor in firefly "ceph version 0.80.9-255-gd0f9c5f (d0f9c5f47024f53b4eccea2e0fde9b7844746362)"
This is the commit which will probably be the next firefly point release.
I tried both a replicated pool and an erasure coded pool.