Bug #47678
mgr: include/interval_set.h: 466: ceph_abort_msg("abort() called")
Status:
New
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
Q/A
Tags:
Backport:
pacific,octopus,nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
qa, qa-failure
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
2020-09-27T18:26:54.485+0000 7f122522e700 0 [rbd_support DEBUG root] TrashPurgeScheduleHandler: refresh_pools 2020-09-27T18:26:54.485+0000 7f122522e700 0 [rbd_support INFO root] TrashPurgeScheduleHandler: load_schedules 2020-09-27T18:26:54.485+0000 7f122522e700 20 mgr get_config key: mgr/rbd_support/y/trash_purge_schedule 2020-09-27T18:26:54.485+0000 7f122522e700 20 mgr get_config key: mgr/rbd_support/trash_purge_schedule 2020-09-27T18:26:54.485+0000 7f1226a31700 0 [rbd_support DEBUG root] MirrorSnapshotScheduleHandler: refresh_images 2020-09-27T18:26:54.485+0000 7f1226a31700 0 [rbd_support INFO root] MirrorSnapshotScheduleHandler: load_schedules 2020-09-27T18:26:54.485+0000 7f1226a31700 20 mgr get_config key: mgr/rbd_support/y/mirror_snapshot_schedule 2020-09-27T18:26:54.485+0000 7f122522e700 10 mgr get_typed_config [y/]trash_purge_schedule not found 2020-09-27T18:26:54.485+0000 7f1226a31700 20 mgr get_config key: mgr/rbd_support/mirror_snapshot_schedule 2020-09-27T18:26:54.486+0000 7f122522e700 0 [rbd_support INFO root] load_schedules: rbd, start_after= 2020-09-27T18:26:54.486+0000 7f1226a31700 10 mgr get_typed_config [y/]mirror_snapshot_schedule not found 2020-09-27T18:26:54.487+0000 7f122522e700 1 -- 172.21.15.110:0/2985789914 --> [v2:172.21.15.110:6826/34265,v1:172.21.15.110:6827/34265] -- osd_op(unknown.0.0:13 2.1 2:88c1567c:::rbd_trash_purge_schedule:head [omap-get-vals in=16b] snapc 0=[] ondisk+read+known_if_redirected e42) v8 -- 0x55b835afc000 con 0x55b835c88800 2020-09-27T18:26:54.487+0000 7f1226a31700 0 [rbd_support INFO root] load_schedules: rbd, start_after= 2020-09-27T18:26:54.487+0000 7f1226a31700 1 -- 172.21.15.110:0/2985789914 --> [v2:172.21.15.42:6816/34436,v1:172.21.15.42:6817/34436] -- osd_op(unknown.0.0:14 2.2 2:4e99cc3e:::rbd_mirror_snapshot_schedule:head [omap-get-vals in=16b] snapc 0=[] ondisk+read+known_if_redirected e42) v8 -- 0x55b83613ec00 con 0x55b835c89400 2020-09-27T18:26:54.488+0000 7f1228a35700 1 -- 172.21.15.110:0/2985789914 <== osd.5 v2:172.21.15.42:6816/34436 11 ==== osd_map(43..56 src has 1..56) v4 ==== 7684+0+0 (secure 0 0 0) 0x55b835f44000 con 0x55b835c89400 2020-09-27T18:26:54.488+0000 7f1254515700 1 -- 172.21.15.110:0/2985789914 <== osd.5 v2:172.21.15.42:6816/34436 12 ==== osd_op_reply(14 rbd_mirror_snapshot_schedule [omap-get-vals] v0'0 uv0 ondisk = -2 ((2) No such file or directory)) v8 ==== 172+0+0 (secure 0 0 0) 0x55b835cd38c0 con 0x55b835c89400 2020-09-27T18:26:54.488+0000 7f1255517700 1 -- 172.21.15.110:0/2985789914 <== osd.2 v2:172.21.15.110:6826/34265 12 ==== osd_op_reply(13 rbd_trash_purge_schedule [omap-get-vals] v0'0 uv0 ondisk = -2 ((2) No such file or directory)) v8 ==== 168+0+0 (secure 0 0 0) 0x55b8360bd440 con 0x55b835c88800 2020-09-27T18:26:54.489+0000 7f1228a35700 -1 /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.0.0-5932-g53f857cc/rpm/el8/BUILD/ceph-16.0.0-5932-g53f857cc/src/include/interval_set.h: In function 'void interval_set<T, C>::insert(T, T, T*, T*) [with T = snapid_t; C = mempool::osdmap::flat_map]' thread 7f1228a35700 time 2020-09-27T18:26:54.489696+0000 /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.0.0-5932-g53f857cc/rpm/el8/BUILD/ceph-16.0.0-5932-g53f857cc/src/include/interval_set.h: 466: ceph_abort_msg("abort() called") ceph version 16.0.0-5932-g53f857cc (53f857cc0602dd77284fd703c292252370ced0a5) pacific (dev) 1: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0xe5) [0x7f125b204ce8] 2: () 3: () 4: () 5: (DispatchQueue::entry()+0x126a) [0x7f125b42db1a] 6: (DispatchQueue::DispatchThread::entry()+0x11) [0x7f125b4dcd31] 7: () 8: clone() 2020-09-27T18:26:54.489+0000 7f1228a35700 -1 *** Caught signal (Aborted) ** in thread 7f1228a35700 thread_name:ms_dispatch ceph version 16.0.0-5932-g53f857cc (53f857cc0602dd77284fd703c292252370ced0a5) pacific (dev) 1: () 2: gsignal() 3: abort() 4: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0x1b6) [0x7f125b204db9] 5: () 6: () 7: () 8: (DispatchQueue::entry()+0x126a) [0x7f125b42db1a] 9: (DispatchQueue::DispatchThread::entry()+0x11) [0x7f125b4dcd31] 10: () 11: clone() NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
From: /ceph/teuthology-archive/pdonnell-2020-09-26_05:47:56-fs-wip-pdonnell-testing-20200926.000836-distro-basic-smithi/5471511/remote/smithi110/log/ceph-mgr.y.log.gz
This seems to coincide with fs/snaps workunit and the rbd snap scheduler.
History
#1 Updated by Patrick Donnelly almost 3 years ago
https://pulpito.ceph.com/teuthology-2020-09-21_04:15:02-multimds-master-distro-basic-smithi/5454314/
Seems to be a recent master regression.
#2 Updated by Patrick Donnelly almost 3 years ago
- Project changed from Ceph to CephFS
- Labels (FS) qa, qa-failure added
Putting this in the fs project for now.
#3 Updated by Patrick Donnelly over 2 years ago
- Target version changed from v16.0.0 to v17.0.0
- Backport set to pacific,octopus,nautilus
#4 Updated by Patrick Donnelly about 1 year ago
- Target version deleted (
v17.0.0)
#5 Updated by Milind Changire 4 months ago
- Related to Bug #61008: crash: void interval_set<T, C>::insert(T, T, T*, T*) [with T = inodeno_t; C = std::map]: abort added
#6 Updated by Milind Changire 3 months ago
- Related to deleted (Bug #61008: crash: void interval_set<T, C>::insert(T, T, T*, T*) [with T = inodeno_t; C = std::map]: abort)