Project

General

Profile

Actions

Bug #17536

closed

Extremely rare Qemu hang with suspicion that RBD might be the issue

Added by Christian Theune over 7 years ago. Updated over 7 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hi,

we are experiencing stalled IO on a VM every few weeks (sigh) and I don't have enough space to log everything on high.
The VM's disk is pretty large (around 8TiB) and I've seen that our backup is running rbddiff while it stalled. The diff took longer than usually when this last happened: normally about 13 hours, this time almost 24 hours.

I'd love to provide reasonable logging but would need a little bit of help to adjust the logging properly: turning everything to 20 wouldn't be helpful if we need to let this run for a couple of weeks ... :/

Actions

Also available in: Atom PDF