Project

General

Profile

Actions

Bug #55141

open

thrashers/fastread: assertion failure: rollback_info_trimmed_to == head

Added by Radoslaw Zarzynski about 2 years ago. Updated about 2 months ago.

Status:
In Progress
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
quincy,reef,squid
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

From the /home/teuthworker/archive/yuriw-2022-03-29_21:35:32-rados-wip-yuri5-testing-2022-03-29-1152-quincy-distro-default-smithi/6767850/teuthology.log:

2022-03-30T01:34:59.499 INFO:tasks.ceph.osd.2.smithi080.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.1.0-125-g9053ed98/rpm/el8/BUILD/ceph-17.1.0-125-g9053ed98/src/osd/PGLog.h: In function 'void PGLog::IndexedLog::claim_log_and_clear_rollback_info(const pg_log_t&)' thread 7fa658b53700 time 2022-03-30T01:34:59.459273+0000
2022-03-30T01:34:59.499 INFO:tasks.ceph.osd.2.smithi080.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.1.0-125-g9053ed98/rpm/el8/BUILD/ceph-17.1.0-125-g9053ed98/src/osd/PGLog.h: 286: FAILED ceph_assert(rollback_info_trimmed_to == head)
2022-03-30T01:34:59.499 INFO:tasks.ceph.osd.2.smithi080.stderr: ceph version 17.1.0-125-g9053ed98 (9053ed984698b7140d91d3195fcba61aa554fe69) quincy (stable)
2022-03-30T01:34:59.499 INFO:tasks.ceph.osd.2.smithi080.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x152) [0x55d6569c2464]
2022-03-30T01:34:59.500 INFO:tasks.ceph.osd.2.smithi080.stderr: 2: ceph-osd(+0x5d7685) [0x55d6569c2685]
2022-03-30T01:34:59.500 INFO:tasks.ceph.osd.2.smithi080.stderr: 3: (PeeringState::Stray::react(MLogRec const&)+0x3d0) [0x55d656de5390]
2022-03-30T01:34:59.500 INFO:tasks.ceph.osd.2.smithi080.stderr: 4: (boost::statechart::simple_state<PeeringState::Stray, PeeringState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x280) [0x55d656e1a6c0]
2022-03-30T01:34:59.500 INFO:tasks.ceph.osd.2.smithi080.stderr: 5: (boost::statechart::state_machine<PeeringState::PeeringMachine, PeeringState::Initial, std::allocator<boost::statechart::none>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)+0x74) [0x55d656b90c54]
2022-03-30T01:34:59.500 INFO:tasks.ceph.osd.2.smithi080.stderr: 6: (PG::do_peering_event(std::shared_ptr<PGPeeringEvent>, PeeringCtx&)+0x2d6) [0x55d656b84ea6]
2022-03-30T01:34:59.501 INFO:tasks.ceph.osd.2.smithi080.stderr: 7: (OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0x175) [0x55d656afa3c5]
2022-03-30T01:34:59.501 INFO:tasks.ceph.osd.2.smithi080.stderr: 8: (ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x56) [0x55d656d915b6]
2022-03-30T01:34:59.501 INFO:tasks.ceph.osd.2.smithi080.stderr: 9: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xaf8) [0x55d656aec0e8]
2022-03-30T01:34:59.501 INFO:tasks.ceph.osd.2.smithi080.stderr: 10: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x5c4) [0x55d6571f1a64]
2022-03-30T01:34:59.502 INFO:tasks.ceph.osd.2.smithi080.stderr: 11: (ShardedThreadPool::WorkThreadSharded::entry()+0x14) [0x55d6571f2e04]
2022-03-30T01:34:59.502 INFO:tasks.ceph.osd.2.smithi080.stderr: 12: /lib64/libpthread.so.0(+0x817f) [0x7fa684d7017f]
2022-03-30T01:34:59.502 INFO:tasks.ceph.osd.2.smithi080.stderr: 13: clone()
2022-03-30T01:34:59.502 INFO:tasks.ceph.osd.2.smithi080.stderr:*** Caught signal (Aborted) **

Related issues 2 (1 open1 closed)

Related to RADOS - Bug #60084: crash: void std::list<pg_log_entry_t, mempool::pool_allocator<(mempool::pool_index_t), pg_log_entry_t> >::_M_insert<pg_log_entry_t const&>(std::_List_iterator<pg_log_entry_t>, pg_log_entry_t const&)New

Actions
Has duplicate RADOS - Bug #57913: Thrashosd: timeout 120 ceph --cluster ceph osd pool rm unique_pool_2 unique_pool_2 --yes-i-really-really-mean-itDuplicateNitzan Mordechai

Actions
Actions

Also available in: Atom PDF