Bug #4079: osd: journal aio deadlock - Ceph - Ceph

Actions

Copy link

Bug #4079

closed

osd: journal aio deadlock

Added by Alex Elder about 11 years ago. Updated about 11 years ago.

Status:

Resolved

Priority:

Urgent

Assignee:

Sage Weil

Category:

Target version:

% Done:

Source:

Development

Tags:

Backport:

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

I don't really understand this yet. I have seen it occurring
with the new request code. I thought there could be a
connection with http://tracker.ceph.com/issues/3936 but maybe
that was just wishful thinking.

Starting last week I found that my normal rbd xfstests run
is hitting a problem around the time it runs test 074 or 075.
When this problem occurs, the test seems to be hung. However,
if you give it enough time you see that some forward progress
is made, it just takes a HUGE amount of time before tests
complete (i.e., hours rather than seconds or minutes).

I also notice that log messages looking like those show below
show up (the "Mounting Filesystem" and "Ending clean mount"
are normal when running these tests).

This is a fairly serious problem, and it is preventing me
from completing my normal test runs (well, within a reasonable
amount of time).

[ 4761.797182] XFS (rbd2): Mounting Filesystem
[ 4765.918310] XFS (rbd2): Ending clean mount
[ 4767.139085] XFS (rbd2): Mounting Filesystem
[ 4771.009442] XFS (rbd2): Ending clean mount
[ 5665.686530] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[ 6101.597699] XFS (rbd2): Mounting Filesystem
[ 6101.899587] XFS (rbd2): Ending clean mount
[ 6875.996583] XFS (rbd2): Mounting Filesystem
[ 6880.217723] XFS (rbd2): Ending clean mount
[ 7860.758422] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[ 8759.535196] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[10927.275859] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[12131.669510] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[14428.139118] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[15326.917246] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[16954.044979] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[16954.070327] libceph: osd0 10.214.133.23:6800 socket error on read
[19011.107288] libceph: osd0 10.214.133.23:6800 socket closed (con state OPEN)
[21159.809958] XFS (rbd2): Mounting Filesystem
[21164.052011] XFS (rbd2): Ending clean mount

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Ceph

Custom queries

Bug #4079

osd: journal aio deadlock

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Ian Colle about 11 years ago

Updated by Alex Elder about 11 years ago

Updated by Sage Weil about 11 years ago

Updated by Ian Colle about 11 years ago

Updated by Sage Weil about 11 years ago

Updated by Josh Durgin about 11 years ago

Updated by Sage Weil about 11 years ago

Updated by Sage Weil about 11 years ago

Updated by Sage Weil about 11 years ago