Project

General

Profile

Bug #37720

Ceph-osd is halt when enable SPDK

Added by Tone ZHANG 8 months ago. Updated 8 months ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
Start date:
12/20/2018
Due date:
% Done:

0%

Source:
Community (dev)
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:

Description

When set up development Ceph cluster enabling SPDK, observed ceph-osd is halt on aarch64 platform and assert on x86 platform.

The ceph version is master/LATEST.

The error log on aarch64 is below:

Starting SPDK v18.04.1 / DPDK 18.05.0 initialization...
[ DPDK EAL parameters: nvme-device-manager -c 0x1 -m 2048 --file-prefix=spdk_pid16987 ]
EAL: Detected 46 lcore(s)
EAL: Detected 1 NUMA nodes
EAL: Multi-process socket /var/run/dpdk/spdk_pid16987/mp_socket
EAL: Probing VFIO support...
EAL: VFIO support initialized
EAL: PCI device 0000:01:00.0 on NUMA socket 0
EAL: probe driver: 8086:953 spdk_nvme
EAL: using IOMMU type 1 (Type 1)
^C

The assert log on x86 is below:

/home/ubuntu/ceph/src/common/mutex_debug.h: 114: FAILED ceph_assert(r == 0)
ceph version 14.0.1-1862-g403622b (403622be721a460f3dff2d84f6bfc628f5026704) nautilus (dev)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1aa) [0x55d42ed9506e]
2: (()+0x2b292f0) [0x55d42ed952f0]
3: (ceph::mutex_debug_detail::mutex_debug_impl<false>::~mutex_debug_impl()+0x34) [0x55d42e3ae4d0]
4: (NVMEManager::~NVMEManager()+0x42) [0x55d42ed0fc18]
5: (()+0x39ff8) [0x7feff061aff8]
6: (()+0x3a045) [0x7feff061b045]
7: (Preforker::daemonize()+0) [0x55d42e2f3aa6]
8: (main()+0x1f3f) [0x55d42e2ea4bf]
9: (__libc_start_main()+0xf0) [0x7feff0601830]
10: (_start()+0x29) [0x55d42e2e7579]
  • Caught signal (Aborted) **
    in thread 7feff4037900 thread_name:ceph-osd
    2018-12-19 10:08:24.217 7feff4037900 -1 /home/ubuntu/ceph/src/common/mutex_debug.h: In function 'ceph::mutex_debug_detail::mutex_debug_impl<<anonymous> >::~mutex_debug_impl() [with bool Recursive = false]' thread 7feff4037900 time 2018-12-19 10:08:24.208161

History

#1 Updated by Tone ZHANG 8 months ago

I'm working on the issue.

#2 Updated by Tone ZHANG 8 months ago

Please review the correction https://github.com/ceph/ceph/pull/25646

#3 Updated by Greg Farnum 8 months ago

  • Project changed from Ceph to RADOS
  • Status changed from New to Resolved

Also available in: Atom PDF