Project

General

Profile

Bug #4035

Ceph doesn't recover from fault on Opensuse (cfuse tests & rbd-cli tests)

Added by Ken Franklin about 11 years ago. Updated about 11 years ago.

Status:
Rejected
Priority:
High
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I'm not sure if this is exclusive to fs but on an opensuse, single node cluster, when running cfuse and rbd tests a fault was generated that didn't recover until ceph was restarted:
2013-02-06 13:15:09.050422 7fed981e1700 0 -- 192.168.6.131:6802/9838 >> 192.168.6.131:6805/33151 pipe(0x311b8f0 sd=29 :0 s=1 pg
s=0 cs=1 l=0).fault

The fault appears in both mds and osd logs (attached)

ceph-mds.a.copy.log View (6.34 KB) Ken Franklin, 02/06/2013 11:01 AM

ceph-osd.0.copy.log View (43.8 KB) Ken Franklin, 02/06/2013 11:01 AM

History

#1 Updated by Ian Colle about 11 years ago

  • Assignee set to Sage Weil
  • Priority changed from Normal to Urgent

#2 Updated by Sage Weil about 11 years ago

  • Status changed from New to Need More Info
  • Priority changed from Urgent to High

the fault message itself is nothing to worry about; just a socket error that we normally recover from. can you clarify what else broke/didn't behave?

#3 Updated by Sage Weil about 11 years ago

  • Status changed from Need More Info to Rejected

Also available in: Atom PDF