Project

General

Profile

Actions

Bug #777

closed

mount hung, tid timed out messages in log

Added by John Leach about 13 years ago. Updated about 13 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

100%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I have a ceph cluster with 3 mons, 1 mds and 4 osds. I mounted the ceph filesystem on another machine using the default 2.6.37 kernel client and write a script to create a million files, between 1k and 4k in size. It ran for some unknown amount of time (at a guess, at least 30 to 60 mins) putting load on the cluster but at some point my script process got stuck in D state and the mount because inaccessible (could see the free space but any kind of read or write operation on the mount stuck the process in D state) and all load ceased.

In the kernel log on the client I see message like "tid 405965 timed out on osd1, will reset osd" (see client.log). You'll also see where I tried restarting the whole ceph cluster (though not the client) in an attempt to get the mount working again (with no luck).

Logs for 3 of the 4 osds attached (osd0 never seemed to have any timed out messages).

I started the test at around 12:30 on 4th Feb. I turned osd debugging on at about 23:36 on 4th Feb and shut the cluster down to go to bed at around 00:45 on 5th Feb.

I've also included a tarball of the /sys/kernel/debug/ceph directory on the client (though this was made today, many hours after the cluster was been shut down).


Files

cephlogs.tar.bz2 (1.78 MB) cephlogs.tar.bz2 John Leach, 02/05/2011 10:45 AM
mds-commit-fail.log (12.9 KB) mds-commit-fail.log John Leach, 02/09/2011 03:38 PM

Subtasks 2 (0 open2 closed)

Tasks #796: Let CDir::_commit_full write in piecesResolvedGreg Farnum02/10/2011

Actions
CephFS - Tasks #797: Don't _commit_full just because dir is_complete()ResolvedGreg Farnum02/10/2011

Actions
Actions

Also available in: Atom PDF