Project

General

Profile

Actions

Bug #791

closed

ls -al waits for writes to complete

Added by Greg Farnum about 13 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

On Mon, 6 Dec 2010, Jim Schutt wrote:

Hi Sage,

On Sat, 2010-12-04 at 21:59 -0700, Sage Weil wrote:

Also, a possibly related behavior I've noticed is that
an 'ls' on a directory where I'm writing files
does not return until all the writers are finished.

I realize it's likely related to caps, but
I'm hoping that can be fixed up somehow?

It depends. If the clients "wrote" that data into the buffer cache and
it's just taking a long time to flush it out, then things are working as
intended (given current locking state machine). That can be improved, but
hasn't been a priority (see #541). If the dd's are still writing and they
don't stop, something is wrong, either on the mds or kclient.

So here's the results from a couple trials. In the
below results, "do_pdd" is a simple wrapper around
"pdsh -w <clients> dd" that computes aggregate
results.

In one window I launch the parallel dd commands;
in another window, on a client of my filesystem,
I do the ls.

This is possibly related to #325.

Actions #1

Updated by Sage Weil about 13 years ago

  • Translation missing: en.field_story_points set to 2
  • Translation missing: en.field_position set to 560
Actions #2

Updated by Sage Weil almost 13 years ago

  • Target version changed from 12 to v0.29
  • Translation missing: en.field_position deleted (601)
  • Translation missing: en.field_position set to 622
Actions #3

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (621)
  • Translation missing: en.field_position set to 619
Actions #4

Updated by Sage Weil almost 13 years ago

  • Category set to 1
  • Assignee set to Sage Weil

I reproduced some long stalls (~20 seconds) due to the loner flip-flopping. Need to analyze the logs (currently on vapre) to sort out what happened and fix.

Actions #5

Updated by Sage Weil almost 13 years ago

  • Status changed from New to Resolved

commit:ee7fa813ef29890557f0b03bd3950d422484215d

Actions #6

Updated by John Spray over 7 years ago

  • Project changed from Ceph to CephFS
  • Category deleted (1)
  • Target version deleted (v0.29)

Bulk updating project=ceph category=mds bugs so that I can remove the MDS category from the Ceph project to avoid confusion.

Actions

Also available in: Atom PDF