Project

General

Profile

Actions

Bug #65842

open

unittest-seastore (Failed) on arm64

Added by Casey Bodley 12 days ago. Updated 12 days ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

from https://jenkins.ceph.com/job/ceph-pull-requests-arm64/56090/consoleFull#772176351e840cee4-f4a4-4183-81dd-42855615f2c1

274/292 Test #260: unittest-seastore .........................***Failed  365.97 sec
...
INFO  2024-05-07 09:45:38,868 [shard 0:main] seastore_journal - SegmentAllocator::close_segment: Dev(0)_MD_G3 close segment segment_tail_t(Seg[Dev(0),12] OOL sseq(9), segment_nonce=1666304193, modify_time=mod_tp(NULL), num_extents=0), written_to=4096
INFO  2024-05-07 09:45:38,883 [shard 0:main] seastore_cleaner - segments_info_t::mark_closed: closing Seg[Dev(0),12], seg_info_t(state=OPEN, Seg[Dev(0),12] OOL sseq(9) MD GEN(3), modify_time=tp(NULL), num_extents=0, written_to=4096), num_segments(empty=114, opened=2, closed=12)
INFO  2024-05-07 09:45:38,884 [shard 0:main] seastore_cleaner - SegmentCleaner::close_segment: closed, SegmentCleaner(should_block_io_on_clean=0, should_clean=1, projected_avail_ratio=0.898434, reclaim_ratio=0.999737, alive_ratio=2.67029e-05) -- seg_info_t(state=CLOSED, Seg[Dev(0),12] OOL sseq(9) MD GEN(3), modify_time=tp(NULL), num_extents=0, written_to=4096)
INFO  2024-05-07 09:45:38,885 [shard 0:main] seastore_journal - SegmentAllocator::close_segment: Dev(0)_MD_G4 close segment segment_tail_t(Seg[Dev(0),13] OOL sseq(10), segment_nonce=1890166834, modify_time=mod_tp(NULL), num_extents=0), written_to=4096
INFO  2024-05-07 09:45:38,897 [shard 0:main] seastore_cleaner - segments_info_t::mark_closed: closing Seg[Dev(0),13], seg_info_t(state=OPEN, Seg[Dev(0),13] OOL sseq(10) MD GEN(4), modify_time=tp(NULL), num_extents=0, written_to=4096), num_segments(empty=114, opened=1, closed=13)
INFO  2024-05-07 09:45:38,897 [shard 0:main] seastore_cleaner - SegmentCleaner::close_segment: closed, SegmentCleaner(should_block_io_on_clean=0, should_clean=1, projected_avail_ratio=0.890625, reclaim_ratio=0.999756, alive_ratio=2.67029e-05) -- seg_info_t(state=CLOSED, Seg[Dev(0),13] OOL sseq(10) MD GEN(4), modify_time=tp(NULL), num_extents=0, written_to=4096)
INFO  2024-05-07 09:45:38,897 [shard 0:main] seastore_tm - TransactionManager::close: completed
Reactor stalled for 145 ms on shard 0. Backtrace: 0x6ed3223 0xce57867 0xce56d87 0xcc408f3 0xcc3aea3 0xcc3a96f 0xcc3b303 0xcc41d73 0x7db 0xcb28603 0xcb280f3 0xcb27be7 0xcb33f17 0xcb3353f 0xcb330f7 0xcb098d7 0xcaf5a97 0xcaf1007 0xcaf0d9b 0xcaf117f 0xcb1a217 0xcb01b17 0xcae82e7 0xcae8243 0x86276c7 0x85e2b4b 0x8435807 0x82f8203 0x82f826f 0x7ef6653 0x7ef6237 0x7b3217f 0x76e23db 0x76e227f 0x76e1d4b 0x76e1ad7 0x76e152b 0x76e136f 0x7b69c1b 0x7b6993f 0x7b6938b 0x7b6916b 0x7b68e07 0xcc62083 0xcc6ef57 0xcc745c3 0xcc7280b 0xc9f312f 0xc9f0bdf 0xc9f38eb 0x7553c3f 0x7553a43 0x75538df 0x755356b 0x755334b 0x75530b3 0x7552c5b 0xdbc6b 0x7d5c7 0xe5edb
AddressSanitizer:DEADLYSIGNAL
=================================================================
==585578==ERROR: AddressSanitizer: SEGV on unknown address 0x000100000001 (pc 0xffff8ed7f8b0 bp 0xffff86fee210 sp 0xffff86fee210 T1)
==585578==The signal is caused by a READ memory access.
AddressSanitizer:DEADLYSIGNAL
AddressSanitizer: nested bug in the same thread, aborting.


Related issues 1 (1 open0 closed)

Related to crimson - Bug #65635: Crimson seastore unit test random failure on AARCH64 (DEADLYSIGNAL by caused by a READ memory access)New

Actions
Actions #1

Updated by Casey Bodley 12 days ago

similar arm64 failure from unittest-staged-fltree in https://jenkins.ceph.com/job/ceph-pull-requests-arm64/55826/consoleFull#1749163831e840cee4-f4a4-4183-81dd-42855615f2c1

[----------] 4 tests from d_seastore_tm_test/d_seastore_tm_test_t
[ RUN      ] d_seastore_tm_test/d_seastore_tm_test_t.6_random_tree_insert_erase/0
INFO  2024-04-30 08:00:07,236 [shard 0:main] test - setup started...
INFO  2024-04-30 08:00:07,237 [shard 0:main] test - EphemeralTestState::tm_setup: begin with 1 devices ...
INFO  2024-04-30 08:00:07,237 [shard 0:main] seastore_device - Initing ephemeral segment manager with config ephemeral_config_t(size=1073741824, block_size=4096, segment_size=8388608)
INFO  2024-04-30 08:00:08,706 [shard 0:main] seastore_device - Mkfs ephemeral segment manager with device_config_t(major_dev=1, spec=device_spec(magic=43981, dtype=EPHEMERAL_MAIN, Dev(0)), meta=00000000-0000-0000-0000-000000000000, secondary())
INFO  2024-04-30 08:00:08,708 [shard 0:main] seastore_cache - Cache::Cache: created, lru_size=67108864
Reactor stalled for 79 ms on shard 0. Backtrace: 0x6cdc31b 0xc8a2143 0xc8a1663 0xc689e6b 0xc68441b 0xc683ee7 0xc68487b 0xc68b2eb 0x7db 0x731688f 0x7316adf 0xc2117f3 0xc56408f 0xc56377f 0xc5715c3 0xc5710d7 0xc570697 0xc570213 0xc56fd07 0xc57c037 0xc57b9bf 0xc59029b 0xc58f78f 0xc58f19b 0xc55bd23 0xc53adff 0xc539a3b 0xc53c57f 0xc52fb7b 0x841f337 0x84141d3 0x822c26f 0x80e597f 0x7093e53 0x72fdcf7 0x72fdb77 0x72fdb0f 0x72fda6b 0x72fd9df 0x72601b7 0x7260013 0x725fddf 0x725fbbb 0x725f983 0x725f8db 0x725f633 0x725f40f 0x725ef3b 0xc6ab5fb 0xc6b84cf 0xc6bdb3b 0xc6bbd83 0xc4396af 0xc43715f 0xc439e6b 0x7f365eb 0x7f363ef 0x7f3628b 0x7f35f17 0x7f35cf7 0x7f35a5f 0x7f35607 0xda9fb 0x7d5c7 0xe5edb
Reactor stalled for 155 ms on shard 0. Backtrace: 0x6cdc31b 0xc8a2143 0xc8a1663 0xc689e6b 0xc68441b 0xc683ee7 0xc68487b 0xc68b2eb 0x7db 0x6d44977 0x6ca7dd7 0x6ca7a5b 0x6d1d847 0x8526ebb 0xc300b5b 0xc300a37 0xc30095f 0xc3006d3 0xc3004c3 0xc30037b 0xc2ffe6f 0xc2fef57 0xc2feb63 0xc2fe607 0xc2fde37 0xc2fdc7b 0xc5a1157 0xc5a0fa3 0xc59fdff 0xc54fa4f 0xc539aaf 0xc53c57f 0xc52fb7b 0x842265b 0x84141d3 0x822c26f 0x80e597f 0x7093e53 0x72fdcf7 0x72fdb77 0x72fdb0f 0x72fda6b 0x72fd9df 0x72601b7 0x7260013 0x725fddf 0x725fbbb 0x725f983 0x725f8db 0x725f633 0x725f40f 0x725ef3b 0xc6ab5fb 0xc6b84cf 0xc6bdb3b 0xc6bbd83 0xc4396af 0xc43715f 0xc439e6b 0x7f365eb 0x7f363ef 0x7f3628b 0x7f35f17 0x7f35cf7 0x7f35a5f 0x7f35607 0xda9fb 0x7d5c7 0xe5edb
AddressSanitizer:DEADLYSIGNAL
=================================================================
==3601409==ERROR: AddressSanitizer: SEGV on unknown address 0x00019d9e6360 (pc 0xffffa576f890 bp 0xffff9d9e1190 sp 0xffff9d9e1190 T1)
==3601409==The signal is caused by a READ memory access.
AddressSanitizer:DEADLYSIGNAL
AddressSanitizer: nested bug in the same thread, aborting.

Actions #2

Updated by Yingxin Cheng 12 days ago

  • Related to Bug #65635: Crimson seastore unit test random failure on AARCH64 (DEADLYSIGNAL by caused by a READ memory access) added
Actions

Also available in: Atom PDF