Project

General

Profile

Actions

Bug #4035

closed

Ceph doesn't recover from fault on Opensuse (cfuse tests & rbd-cli tests)

Added by Ken Franklin about 11 years ago. Updated about 11 years ago.

Status:
Rejected
Priority:
High
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I'm not sure if this is exclusive to fs but on an opensuse, single node cluster, when running cfuse and rbd tests a fault was generated that didn't recover until ceph was restarted:
2013-02-06 13:15:09.050422 7fed981e1700 0 -- 192.168.6.131:6802/9838 >> 192.168.6.131:6805/33151 pipe(0x311b8f0 sd=29 :0 s=1 pg
s=0 cs=1 l=0).fault

The fault appears in both mds and osd logs (attached)


Files

ceph-mds.a.copy.log (6.34 KB) ceph-mds.a.copy.log Ken Franklin, 02/06/2013 11:01 AM
ceph-osd.0.copy.log (43.8 KB) ceph-osd.0.copy.log Ken Franklin, 02/06/2013 11:01 AM
Actions #1

Updated by Ian Colle about 11 years ago

  • Assignee set to Sage Weil
  • Priority changed from Normal to Urgent
Actions #2

Updated by Sage Weil about 11 years ago

  • Status changed from New to Need More Info
  • Priority changed from Urgent to High

the fault message itself is nothing to worry about; just a socket error that we normally recover from. can you clarify what else broke/didn't behave?

Actions #3

Updated by Sage Weil about 11 years ago

  • Status changed from Need More Info to Rejected
Actions

Also available in: Atom PDF