Actions
Bug #13699
closedkrbd: crash under pblio benchmark
Status:
Can't reproduce
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):
Description
Kernel version: unknown, running in aws, with provisioned iops volumes beneath the osds
Running pblio (https://github.com/pblcache/pblcache/wiki/Pblio) with three different rbd devices as ASU1, 2, and 3, and increasing the BSU parameter, caused the client to crash with BSU > 100. No stacktrace is available, the vm simply hung. This was reproduced a few times.
Hypothesis: past 100 BSUs, the test starts to read and write the same blocks, which is usually masked by the page cache/fs on top of rbd. Reproducing this with osds running on tmpfs or memstore may work.
Actions