Project

General

Profile

Actions

Bug #4079

closed

osd: journal aio deadlock

Added by Alex Elder about 11 years ago. Updated about 11 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I don't really understand this yet. I have seen it occurring
with the new request code. I thought there could be a
connection with http://tracker.ceph.com/issues/3936 but maybe
that was just wishful thinking.

Starting last week I found that my normal rbd xfstests run
is hitting a problem around the time it runs test 074 or 075.
When this problem occurs, the test seems to be hung. However,
if you give it enough time you see that some forward progress
is made, it just takes a HUGE amount of time before tests
complete (i.e., hours rather than seconds or minutes).

I also notice that log messages looking like those show below
show up (the "Mounting Filesystem" and "Ending clean mount"
are normal when running these tests).

This is a fairly serious problem, and it is preventing me
from completing my normal test runs (well, within a reasonable
amount of time).

[ 4761.797182] XFS (rbd2): Mounting Filesystem
[ 4765.918310] XFS (rbd2): Ending clean mount
[ 4767.139085] XFS (rbd2): Mounting Filesystem
[ 4771.009442] XFS (rbd2): Ending clean mount
[ 5665.686530] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[ 6101.597699] XFS (rbd2): Mounting Filesystem
[ 6101.899587] XFS (rbd2): Ending clean mount
[ 6875.996583] XFS (rbd2): Mounting Filesystem
[ 6880.217723] XFS (rbd2): Ending clean mount
[ 7860.758422] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[ 8759.535196] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[10927.275859] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[12131.669510] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[14428.139118] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[15326.917246] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[16954.044979] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[16954.070327] libceph: osd0 10.214.133.23:6800 socket error on read
[19011.107288] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[21159.809958] XFS (rbd2): Mounting Filesystem
[21164.052011] XFS (rbd2): Ending clean mount

Actions

Also available in: Atom PDF