Bug #21430
closedceph-fuse blocked OSD op threads => OSD restart loop
0%
Description
I can now seemingly easily reproduce a trigger of OSD meltdowns from what seems to be blocked OSD op threads, using a single ceph-fuse write client (only client mounted the cephfs).
Details:
Single MDS
Single MON
5 BlueStore OSD
CephFS EC base pool (no file layout tricks)
Version all around: 12.2.0
Dist: Debian
Write operation: rsync of a maildir folder, files like "VMs/ncis.millnert.se/home/vmail/millnert.se/martin/.Archive.Chalmers.2008-Inbox/cur/1397939074.M5617P1519V000000000000FD06I00000000000E6107_2081.ncis,S=8596431:2,"
After blocking the writes, I appear to sometimes have been able to simply ctrl+c them and then recover. Other times and usually I need to systemctl restart the blocked OSD in question. I never appear to need to break the mount and remount.
I figure since this is a pretty straighforward setup and I'm seemingly able to easily reproduce it we could swiftly get at the core of the issue?
Files