Project

General

Profile

Actions

Bug #19511

closed

bluestore overwhelms aio queue

Added by Dmitry Smirnov about 7 years ago. Updated over 6 years ago.

Status:
Resolved
Priority:
High
Assignee:
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
1 - critical
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

In CEPH cluster V12.0.1 with BlueStore-enabled backend some OSD (currently 4 of 30, on different hosts) started to being marked DOWN and OSD Linux processes are failing upon restart attempts with core dumps.
It seems to start happening after upgrade from V12.0.0 to V12.0.1. CEPH is deployed on six RedHat 7.3 servers with 10Gb/s interconnection.

Error message:
-------------------------------------------------------------------------------------------------------------------------------------------------
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.512466 7f5498612b00 1 WARNING: the following dangerous and experimental features are enabled: bluestore
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.512748 7f5498612b00 -1 WARNING: the following dangerous and experimental features are enabled: bluestore
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.512890 7f5498612b00 -1 WARNING: experimental feature 'bluestore' is enabled
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: Please be aware that this feature is experimental, untested,
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: unsupported, and may result in data corruption, data loss,
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: and/or irreparable damage to your cluster. Do not use
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: feature with important data.
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: starting osd.18 at - osd_data /var/lib/ceph/osd/ceph-18 /var/lib/ceph/osd/ceph-18/journal
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.535362 7f5498612b00 -1 WARNING: the following dangerous and experimental features are enabled: bluestore
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.632827 7f5498612b00 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.633139 7f5498612b00 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.633374 7f5498612b00 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.633709 7f5498612b00 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:12 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:12.633977 7f5498612b00 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:14 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:14.484098 7f5498612b00 -1 osd.18 4529 log_to_monitors {default=true}
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.711108 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.849766 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.850305 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.850851 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.959358 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.960130 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:15 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:15.960468 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:16 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:16.124253 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.0.1/rpm/el7/BUILD/
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.0.1/rpm/el7/BUILD/
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:24.317687 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 16
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:24.317693 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio submit got (11) Resource temporarily unavailable
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: ceph version 12.0.1 (5456408827a1a31690514342624a4ff9b66be1d5)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x7f5498fe4470]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2: (KernelDevice::aio_submit(IOContext*)+0x893) [0x7f5498f6d723]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 3: (BlueStore::_txc_aio_submit(BlueStore::TransContext*)+0x72) [0x7f5498e46072]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 4: (BlueStore::_deferred_try_submit(BlueStore::OpSequencer*)+0xf44) [0x7f5498e92904]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 5: (BlueStore::_deferred_try_submit()+0xaa) [0x7f5498e9300a]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 6: (BlueStore::_kv_sync_thread()+0x1cde) [0x7f5498e9946e]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 7: (BlueStore::KVSyncThread::entry()+0xd) [0x7f5498ec6eed]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 8: (()+0x7dc5) [0x7f5495e80dc5]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 9: (clone()+0x6d) [0x7f5494f7573d]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:24.320362 7f5485251700 -1 /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACH
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.0.1/rpm/el7/BUILD/
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: ceph version 12.0.1 (5456408827a1a31690514342624a4ff9b66be1d5)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x7f5498fe4470]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2: (KernelDevice::aio_submit(IOContext*)+0x893) [0x7f5498f6d723]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 3: (BlueStore::_txc_aio_submit(BlueStore::TransContext*)+0x72) [0x7f5498e46072]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 4: (BlueStore::_deferred_try_submit(BlueStore::OpSequencer*)+0xf44) [0x7f5498e92904]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 5: (BlueStore::_deferred_try_submit()+0xaa) [0x7f5498e9300a]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 6: (BlueStore::_kv_sync_thread()+0x1cde) [0x7f5498e9946e]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 7: (BlueStore::KVSyncThread::entry()+0xd) [0x7f5498ec6eed]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 8: (()+0x7dc5) [0x7f5495e80dc5]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 9: (clone()+0x6d) [0x7f5494f7573d]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -9902> 2017-04-06 12:52:14.484098 7f5498612b00 -1 osd.18 4529 log_to_monitors {default=true}
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -971> 2017-04-06 12:52:15.711108 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -928> 2017-04-06 12:52:15.849766 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -917> 2017-04-06 12:52:15.850305 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -906> 2017-04-06 12:52:15.850851 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -881> 2017-04-06 12:52:15.959358 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -880> 2017-04-06 12:52:15.960130 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -879> 2017-04-06 12:52:15.960468 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -837> 2017-04-06 12:52:16.124253 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 1
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -2> 2017-04-06 12:52:24.317687 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio_submit retries 16
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: -1> 2017-04-06 12:52:24.317693 7f5485251700 -1 bdev(0x7f54a2ca3a00 /var/lib/ceph/osd/ceph-18/block) aio submit got (11) Resource temporarily unavailable
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 0> 2017-04-06 12:52:24.320362 7f5485251700 -1 /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/M
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: /home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.0.1/rpm/el7/BUILD/
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: ceph version 12.0.1 (5456408827a1a31690514342624a4ff9b66be1d5)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x7f5498fe4470]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2: (KernelDevice::aio_submit(IOContext*)+0x893) [0x7f5498f6d723]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 3: (BlueStore::_txc_aio_submit(BlueStore::TransContext*)+0x72) [0x7f5498e46072]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 4: (BlueStore::_deferred_try_submit(BlueStore::OpSequencer*)+0xf44) [0x7f5498e92904]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 5: (BlueStore::_deferred_try_submit()+0xaa) [0x7f5498e9300a]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 6: (BlueStore::_kv_sync_thread()+0x1cde) [0x7f5498e9946e]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 7: (BlueStore::KVSyncThread::entry()+0xd) [0x7f5498ec6eed]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 8: (()+0x7dc5) [0x7f5495e80dc5]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 9: (clone()+0x6d) [0x7f5494f7573d]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: * Caught signal (Aborted) *
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: in thread 7f5485251700 thread_name:bstore_kv_sync
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: ceph version 12.0.1 (5456408827a1a31690514342624a4ff9b66be1d5)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 1: (()+0x95091f) [0x7f5498f8991f]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2: (()+0xf370) [0x7f5495e88370]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 3: (gsignal()+0x37) [0x7f5494eb31d7]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 4: (abort()+0x148) [0x7f5494eb48c8]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 5: (ceph::__ceph_assert_fail(char const
, char const*, int, char const*)+0x284) [0x7f5498fe45e4]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 6: (KernelDevice::aio_submit(IOContext*)+0x893) [0x7f5498f6d723]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 7: (BlueStore::_txc_aio_submit(BlueStore::TransContext*)+0x72) [0x7f5498e46072]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 8: (BlueStore::_deferred_try_submit(BlueStore::OpSequencer*)+0xf44) [0x7f5498e92904]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 9: (BlueStore::_deferred_try_submit()+0xaa) [0x7f5498e9300a]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 10: (BlueStore::_kv_sync_thread()+0x1cde) [0x7f5498e9946e]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 11: (BlueStore::KVSyncThread::entry()+0xd) [0x7f5498ec6eed]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 12: (()+0x7dc5) [0x7f5495e80dc5]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 13: (clone()+0x6d) [0x7f5494f7573d]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2017-04-06 12:52:24.344606 7f5485251700 -1
Caught signal (Aborted)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: in thread 7f5485251700 thread_name:bstore_kv_sync
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: ceph version 12.0.1 (5456408827a1a31690514342624a4ff9b66be1d5)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 1: (()+0x95091f) [0x7f5498f8991f]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2: (()+0xf370) [0x7f5495e88370]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 3: (gsignal()+0x37) [0x7f5494eb31d7]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 4: (abort()+0x148) [0x7f5494eb48c8]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 5: (ceph::__ceph_assert_fail(char const
, char const*, int, char const*)+0x284) [0x7f5498fe45e4]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 6: (KernelDevice::aio_submit(IOContext*)+0x893) [0x7f5498f6d723]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 7: (BlueStore::_txc_aio_submit(BlueStore::TransContext*)+0x72) [0x7f5498e46072]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 8: (BlueStore::_deferred_try_submit(BlueStore::OpSequencer*)+0xf44) [0x7f5498e92904]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 9: (BlueStore::_deferred_try_submit()+0xaa) [0x7f5498e9300a]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 10: (BlueStore::_kv_sync_thread()+0x1cde) [0x7f5498e9946e]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 11: (BlueStore::KVSyncThread::entry()+0xd) [0x7f5498ec6eed]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 12: (()+0x7dc5) [0x7f5495e80dc5]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 13: (clone()+0x6d) [0x7f5494f7573d]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 0> 2017-04-06 12:52:24.344606 7f5485251700 -1
Caught signal (Aborted) *
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: in thread 7f5485251700 thread_name:bstore_kv_sync
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: ceph version 12.0.1 (5456408827a1a31690514342624a4ff9b66be1d5)
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 1: (()+0x95091f) [0x7f5498f8991f]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 2: (()+0xf370) [0x7f5495e88370]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 3: (gsignal()+0x37) [0x7f5494eb31d7]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 4: (abort()+0x148) [0x7f5494eb48c8]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x284) [0x7f5498fe45e4]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 6: (KernelDevice::aio_submit(IOContext*)+0x893) [0x7f5498f6d723]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 7: (BlueStore::_txc_aio_submit(BlueStore::TransContext*)+0x72) [0x7f5498e46072]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 8: (BlueStore::_deferred_try_submit(BlueStore::OpSequencer*)+0xf44) [0x7f5498e92904]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 9: (BlueStore::_deferred_try_submit()+0xaa) [0x7f5498e9300a]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 10: (BlueStore::_kv_sync_thread()+0x1cde) [0x7f5498e9946e]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 11: (BlueStore::KVSyncThread::entry()+0xd) [0x7f5498ec6eed]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 12: (()+0x7dc5) [0x7f5495e80dc5]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: 13: (clone()+0x6d) [0x7f5494f7573d]
Apr 06 12:52:24 rdocloudnode5.kc.lv ceph-osd95063: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Apr 06 12:52:24 rdocloudnode5.kc.lv abrt-hook-ccpp95209: Process 95063 (ceph-osd) of user 167 killed by SIGABRT - dumping core
Apr 06 12:52:25 rdocloudnode5.kc.lv abrt-hook-ccpp95209: Failed to create core_backtrace: waitpid failed: No child processes
Apr 06 12:52:25 rdocloudnode5.kc.lv systemd1: : main process exited, code=killed, status=6/ABRT
Apr 06 12:52:25 rdocloudnode5.kc.lv systemd1: Unit entered failed state.
Apr 06 12:52:25 rdocloudnode5.kc.lv systemd1: failed.
Apr 06 12:52:25 rdocloudnode5.kc.lv abrt-server95222: Package 'ceph-osd' isn't signed with proper key
Apr 06 12:52:25 rdocloudnode5.kc.lv abrt-server95222: 'post-create' on '/var/spool/abrt/ccpp-2017-04-06-12:52:24-95063' exited with 1
Apr 06 12:52:25 rdocloudnode5.kc.lv abrt-server95222: Deleting problem directory '/var/spool/abrt/ccpp-2017-04-06-12:52:24-95063'
------------------------------------------------------------------------------------------------------------------------------------------------

  1. ceph-disk list
    /dev/dm-0 other, xfs, mounted on /
    /dev/dm-1 swap, swap
    /dev/nvme0n1 :
    /dev/nvme0n1p1 ceph block.wal, for /dev/sdb1
    /dev/nvme0n1p2 ceph block.wal, for /dev/sdc1
    /dev/nvme0n1p3 ceph block.wal, for /dev/sdd1
    /dev/nvme0n1p4 ceph block.wal, for /dev/sde1
    /dev/nvme0n1p5 ceph block.wal, for /dev/sdf1
    /dev/sda :
    /dev/sda1 other, vfat, mounted on /boot/efi
    /dev/sda2 other, xfs, mounted on /boot
    /dev/sda3 other, LVM2_member
    /dev/sdb :
    /dev/sdb1 ceph data, active, cluster ceph, osd.16, block /dev/sdb2, block.wal /dev/nvme0n1p1
    /dev/sdb2 ceph block, for /dev/sdb1
    /dev/sdc :
    /dev/sdc1 ceph data, active, cluster ceph, osd.17, block /dev/sdc2, block.wal /dev/nvme0n1p2
    /dev/sdc2 ceph block, for /dev/sdc1
    /dev/sdd :
    /dev/sdd1 ceph data, active, cluster ceph, osd.18, block /dev/sdd2, block.wal /dev/nvme0n1p3
    /dev/sdd2 ceph block, for /dev/sdd1
    /dev/sde :
    /dev/sde1 ceph data, active, cluster ceph, osd.19, block /dev/sde2, block.wal /dev/nvme0n1p4
    /dev/sde2 ceph block, for /dev/sde1
    /dev/sdf :
    /dev/sdf1 ceph data, active, cluster ceph, osd.28, block /dev/sdf2, block.wal /dev/nvme0n1p5
    /dev/sdf2 ceph block, for /dev/sdf1

Related issues 1 (0 open1 closed)

Related to RADOS - Bug #21171: bluestore: aio submission deadlockResolvedSage Weil08/29/2017

Actions
Actions

Also available in: Atom PDF