Bug #23006
closedrepair_test.yaml fails reproducibly in jewel integration testing
0%
Description
Failure reason looks like this: "2018-02-07 09:08:23.888224 osd.3 172.21.15.148:6808/25279 13 : cluster [ERR] 5.0 : soid 5:a0216fbc:::repair_test_obj:head size 256 != size 1 from shard 3" in cluster log
The failure is reproducible in the current 10.2.11 integration branch (wip-jewel-backports), and a similar failure appeared in a 10.2.6 integration run.
See http://tracker.ceph.com/issues/21742#note-14 for the list of PRs included in the 10.2.11 integration branch.
Logs 10.2.11:
- http://pulpito.ceph.com/smithfarm-2018-02-06_21:07:15-rados-wip-jewel-backports-distro-basic-smithi/2160655/
- http://pulpito.ceph.com/smithfarm-2018-02-06_21:07:15-rados-wip-jewel-backports-distro-basic-smithi/2160751
- http://pulpito.ceph.com/smithfarm-2018-02-15_10:48:02-rados-wip-jewel-backports-distro-basic-smithi/2191705/
- http://pulpito.ceph.com/smithfarm-2018-02-15_10:48:02-rados-wip-jewel-backports-distro-basic-smithi/2191704/
The 10.2.6 failure text was ""2017-01-14 09:13:29.649276 osd.3 172.21.15.44:6800/865390 12 : cluster [ERR] 5.0 shard 3: soid 5:a0216fbc:::repair_test_obj:head size 1 != size 223 from auth oi 5:a0216fbc:::repair_test_obj:head(20'1 client.4352.0:1 dirty|data_digest|omap_digest s 223 uv 1 dd 9a3a59aa od ffffffff)" in cluster log"
Log 10.2.6:
- http://pulpito.ceph.com/loic-2017-01-12_15:26:07-rados-wip-jewel-backports-distro-basic-smithi/712073/ (10.2.6 integration testing)