Project

General

Profile

Actions

Bug #14193

closed

XIO:OSD down when write data at fuse-client use iozone

Added by Pan Liang over 8 years ago. Updated about 5 years ago.

Status:
Won't Fix
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

we recently work at ceph 9.0.3, and try to use XIO msg in infiniband environment.Firstly, we deploy 8 osds in our cluster, and mount use fuse client, then write data with iozone. It seems stably writing 24 hours. Then, we try to deploy cluster with 24 osds, and do above steps, but when write a peirod of time, some osds downed.
we use accelio 1.4, and we found some errors in osds log, which come from accelio. These errors confused us, because these will also report when cluster run normally.
We found these will appear every time and always accompany request blocked. Here is one log of downed osd in below file, core dump found in it.
If you need any other logs or have ideas of these question it would be very appreciated.


Files

ceph-osd.2.rar (133 KB) ceph-osd.2.rar here is the log Pan Liang, 12/29/2015 05:14 AM
Actions #1

Updated by Pan Liang over 8 years ago

Actions #2

Updated by Josh Durgin about 7 years ago

  • Status changed from New to Won't Fix

The rdma backend for async messenger is the path forward for infiniband support at this point.

Actions #3

Updated by Greg Farnum about 5 years ago

  • Project changed from Ceph to Messengers
  • Category deleted (msgr)
Actions

Also available in: Atom PDF