Project

General

Profile

Bug #1629

pgs stuck degraded (only mapped to 1 osd)

Added by Josh Durgin almost 9 years ago. Updated almost 9 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
-
Category:
-
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature:

Description

From teuthology:~teuthworker/archive/nightly_coverage_2011-10-18/636/teuthology.log:

2011-10-18T16:52:13.551 INFO:teuthology.task.thrashosds.ceph_manager:2011-10-18 16:45:50.873497    pg v4880: 144 pgs: 140 active+clean, 4 active+clean+degraded; 143 MB data, 63977 MB used, 3075 GB 
/ 3305 GB avail; 4/106 degraded (3.774%)
2011-10-18 16:45:50.874296   mds e5: 1/1/1 up {0=0=up:active}
2011-10-18 16:45:50.874347   osd e1381: 8 osds: 6 up, 6 in
2011-10-18 16:45:50.874429   log 2011-10-18 03:43:09.626049 mon.0 10.3.14.168:6791/0 349 : [INF] osd.6 out (down for 302.502960)
2011-10-18 16:45:50.874516   mon e1: 3 mons at {0=10.3.14.168:6791/0,1=10.3.14.184:6789/0,2=10.3.14.148:6790/0}

I saved the pg dump, osd dump, osdmap, and crushmap in vit:~joshd/thrash_stuck_active5. The degraded pgs all have up acting [6]:

0.1e    0    0    0    0    0    0    216    216    active+clean+degraded    93'229    1274'1749    [6]    [6]    0'0    2011-10-18 01:27:23.335645
1.1d    3    0    3    0    8200    8396089    200    200    active+clean+degraded    1252'3993    1274'6390    [6]    [6]    6'1    2011-10-18 01:27:54.387460
0.6    0    0    0    0    0    0    216    216    active+clean+degraded    93'235    1274'1749    [6]    [6]    0'0    2011-10-18 01:27:06.795813
1.5    1    0    1    0    1    205    400    400    active+clean+degraded    1213'1214    1274'3615    [6]    [6]    0'0    2011-10-18 01:27:30.337044

History

#1 Updated by Sage Weil almost 9 years ago

  • Status changed from New to Can't reproduce

pre-prior set refactor and current round of thrashing fixes.

Also available in: Atom PDF