Project

General

Profile

Bug #1369

ffsb hang on cfuse (messenger?)

Added by Josh Durgin over 12 years ago. Updated over 12 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
Category:
-
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

This is from teuthology run 218 (logs in teuthology:~teuthworker/archive/full_suite_coverage_20110805/218/).
The relevant nodes, sepia{5,46,52} are still locked.


Related issues

Related to Ceph - Bug #1359: fsstress workunit hang on cfuse Can't reproduce 08/04/2011

History

#1 Updated by Sage Weil over 12 years ago

  • Target version set to v0.34
  • translation missing: en.field_position set to 795

#2 Updated by Sage Weil over 12 years ago

  • translation missing: en.field_position deleted (798)
  • translation missing: en.field_position set to 37

#3 Updated by Sage Weil over 12 years ago

  • Assignee set to Sage Weil

The logs show regular ms_handle_resets (exactly 15 minute intervals). In config.cc is ms_tcp_read_timeout is 900 sec, which looks promising...

also have this running in a loop against other nodes

#4 Updated by Sage Weil over 12 years ago

  • Subject changed from ffsb hang on cfuse to ffsb hang on cfuse (messenger?)

#5 Updated by Sage Weil over 12 years ago

  • Target version changed from v0.34 to v0.35

#6 Updated by Sage Weil over 12 years ago

unlocked nodes

#7 Updated by Sage Weil over 12 years ago

  • Status changed from New to Can't reproduce

I've run this a gazillion times now. There were many hangs, but they were a result of mon crashes and other bugs that have since been fixes. It's not clear if this could have been one of them or not, but in any case, I'm not seeing any problems here now.

Also available in: Atom PDF