Project

General

Profile

Bug #47678

mgr: include/interval_set.h: 466: ceph_abort_msg("abort() called")

Added by Patrick Donnelly 9 months ago. Updated 5 months ago.

Status:
New
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
% Done:

0%

Source:
Q/A
Tags:
Backport:
pacific,octopus,nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
qa, qa-failure
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2020-09-27T18:26:54.485+0000 7f122522e700  0 [rbd_support DEBUG root] TrashPurgeScheduleHandler: refresh_pools
2020-09-27T18:26:54.485+0000 7f122522e700  0 [rbd_support INFO root] TrashPurgeScheduleHandler: load_schedules
2020-09-27T18:26:54.485+0000 7f122522e700 20 mgr get_config  key: mgr/rbd_support/y/trash_purge_schedule
2020-09-27T18:26:54.485+0000 7f122522e700 20 mgr get_config  key: mgr/rbd_support/trash_purge_schedule
2020-09-27T18:26:54.485+0000 7f1226a31700  0 [rbd_support DEBUG root] MirrorSnapshotScheduleHandler: refresh_images
2020-09-27T18:26:54.485+0000 7f1226a31700  0 [rbd_support INFO root] MirrorSnapshotScheduleHandler: load_schedules
2020-09-27T18:26:54.485+0000 7f1226a31700 20 mgr get_config  key: mgr/rbd_support/y/mirror_snapshot_schedule
2020-09-27T18:26:54.485+0000 7f122522e700 10 mgr get_typed_config  [y/]trash_purge_schedule not found
2020-09-27T18:26:54.485+0000 7f1226a31700 20 mgr get_config  key: mgr/rbd_support/mirror_snapshot_schedule
2020-09-27T18:26:54.486+0000 7f122522e700  0 [rbd_support INFO root] load_schedules: rbd, start_after=
2020-09-27T18:26:54.486+0000 7f1226a31700 10 mgr get_typed_config  [y/]mirror_snapshot_schedule not found
2020-09-27T18:26:54.487+0000 7f122522e700  1 -- 172.21.15.110:0/2985789914 --> [v2:172.21.15.110:6826/34265,v1:172.21.15.110:6827/34265] -- osd_op(unknown.0.0:13 2.1 2:88c1567c:::rbd_trash_purge_schedule:head [omap-get-vals in=16b] snapc 0=[] ondisk+read+known_if_redirected e42) v8 -- 0x55b835afc000 con 0x55b835c88800
2020-09-27T18:26:54.487+0000 7f1226a31700  0 [rbd_support INFO root] load_schedules: rbd, start_after=
2020-09-27T18:26:54.487+0000 7f1226a31700  1 -- 172.21.15.110:0/2985789914 --> [v2:172.21.15.42:6816/34436,v1:172.21.15.42:6817/34436] -- osd_op(unknown.0.0:14 2.2 2:4e99cc3e:::rbd_mirror_snapshot_schedule:head [omap-get-vals in=16b] snapc 0=[] ondisk+read+known_if_redirected e42) v8 -- 0x55b83613ec00 con 0x55b835c89400
2020-09-27T18:26:54.488+0000 7f1228a35700  1 -- 172.21.15.110:0/2985789914 <== osd.5 v2:172.21.15.42:6816/34436 11 ==== osd_map(43..56 src has 1..56) v4 ==== 7684+0+0 (secure 0 0 0) 0x55b835f44000 con 0x55b835c89400
2020-09-27T18:26:54.488+0000 7f1254515700  1 -- 172.21.15.110:0/2985789914 <== osd.5 v2:172.21.15.42:6816/34436 12 ==== osd_op_reply(14 rbd_mirror_snapshot_schedule [omap-get-vals] v0'0 uv0 ondisk = -2 ((2) No such file or directory)) v8 ==== 172+0+0 (secure 0 0 0) 0x55b835cd38c0 con 0x55b835c89400
2020-09-27T18:26:54.488+0000 7f1255517700  1 -- 172.21.15.110:0/2985789914 <== osd.2 v2:172.21.15.110:6826/34265 12 ==== osd_op_reply(13 rbd_trash_purge_schedule [omap-get-vals] v0'0 uv0 ondisk = -2 ((2) No such file or directory)) v8 ==== 168+0+0 (secure 0 0 0) 0x55b8360bd440 con 0x55b835c88800
2020-09-27T18:26:54.489+0000 7f1228a35700 -1 /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.0.0-5932-g53f857cc/rpm/el8/BUILD/ceph-16.0.0-5932-g53f857cc/src/include/interval_set.h: In function 'void interval_set<T, C>::insert(T, T, T*, T*) [with T = snapid_t; C = mempool::osdmap::flat_map]' thread 7f1228a35700 time 2020-09-27T18:26:54.489696+0000
/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.0.0-5932-g53f857cc/rpm/el8/BUILD/ceph-16.0.0-5932-g53f857cc/src/include/interval_set.h: 466: ceph_abort_msg("abort() called")

 ceph version 16.0.0-5932-g53f857cc (53f857cc0602dd77284fd703c292252370ced0a5) pacific (dev)
 1: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0xe5) [0x7f125b204ce8]
 2: ()
 3: ()
 4: ()
 5: (DispatchQueue::entry()+0x126a) [0x7f125b42db1a]
 6: (DispatchQueue::DispatchThread::entry()+0x11) [0x7f125b4dcd31]
 7: ()
 8: clone()

2020-09-27T18:26:54.489+0000 7f1228a35700 -1 *** Caught signal (Aborted) **
 in thread 7f1228a35700 thread_name:ms_dispatch

 ceph version 16.0.0-5932-g53f857cc (53f857cc0602dd77284fd703c292252370ced0a5) pacific (dev)
 1: ()
 2: gsignal()
 3: abort()
 4: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x1b6) [0x7f125b204db9]
 5: ()
 6: ()
 7: ()
 8: (DispatchQueue::entry()+0x126a) [0x7f125b42db1a]
 9: (DispatchQueue::DispatchThread::entry()+0x11) [0x7f125b4dcd31]
 10: ()
 11: clone()
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

From: /ceph/teuthology-archive/pdonnell-2020-09-26_05:47:56-fs-wip-pdonnell-testing-20200926.000836-distro-basic-smithi/5471511/remote/smithi110/log/ceph-mgr.y.log.gz

This seems to coincide with fs/snaps workunit and the rbd snap scheduler.

History

#2 Updated by Patrick Donnelly 8 months ago

  • Project changed from Ceph to CephFS
  • Labels (FS) qa, qa-failure added

Putting this in the fs project for now.

#3 Updated by Patrick Donnelly 5 months ago

  • Target version changed from v16.0.0 to v17.0.0
  • Backport set to pacific,octopus,nautilus

Also available in: Atom PDF