Project

General

Profile

Actions

Bug #58496

open

osd/PeeringState: FAILED ceph_assert(!acting_recovery_backfill.empty())

Added by Laura Flores over 1 year ago. Updated about 1 year ago.

Status:
Pending Backport
Priority:
Urgent
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
backport_processed
Backport:
reef
Regression:
Yes
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

/a/yuriw-2023-01-12_20:11:41-rados-main-distro-default-smithi/7138659

2023-01-13T14:56:19.348 INFO:tasks.ceph.osd.4.smithi120.stderr:/build/ceph-18.0.0-1762-gcb17f286/src/osd/PeeringState.cc: In function 'void PeeringState::update_calc_stats()' thread 7fc997d11700 time 2023-01-13T14:56:19.348816+0000
2023-01-13T14:56:19.349 INFO:tasks.ceph.osd.4.smithi120.stderr:/build/ceph-18.0.0-1762-gcb17f286/src/osd/PeeringState.cc: 3553: FAILED ceph_assert(!acting_recovery_backfill.empty())
2023-01-13T14:56:19.364 INFO:tasks.ceph.osd.4.smithi120.stderr: ceph version 18.0.0-1762-gcb17f286 (cb17f286272f7ae9dbdf8117ca7b077b0a5cf650) reef (dev)
2023-01-13T14:56:19.364 INFO:tasks.ceph.osd.4.smithi120.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x128) [0x55d542083f11]
2023-01-13T14:56:19.364 INFO:tasks.ceph.osd.4.smithi120.stderr: 2: ceph-osd(+0xcd10cd) [0x55d5420840cd]
2023-01-13T14:56:19.365 INFO:tasks.ceph.osd.4.smithi120.stderr: 3: (PeeringState::update_calc_stats()+0x1c42) [0x55d54248ad32]
2023-01-13T14:56:19.365 INFO:tasks.ceph.osd.4.smithi120.stderr: 4: (PeeringState::prepare_stats_for_publish(std::optional<pg_stat_t> const&, object_stat_collection_t const&)+0xf8) [0x55d54248b2f8]
2023-01-13T14:56:19.365 INFO:tasks.ceph.osd.4.smithi120.stderr: 5: (PG::publish_stats_to_osd()+0xf7) [0x55d5421f3287]
2023-01-13T14:56:19.365 INFO:tasks.ceph.osd.4.smithi120.stderr: 6: (PgScrubber::clear_pgscrub_state()+0x99) [0x55d54237f4c9]
2023-01-13T14:56:19.365 INFO:tasks.ceph.osd.4.smithi120.stderr: 7: (PgScrubber::scrub_clear_state()+0x42) [0x55d54237f632]
2023-01-13T14:56:19.366 INFO:tasks.ceph.osd.4.smithi120.stderr: 8: (PgScrubber::on_primary_change(std::basic_string_view<char, std::char_traits<char> >, requested_scrub_t const&)+0x207) [0x55d542380f77]
2023-01-13T14:56:19.366 INFO:tasks.ceph.osd.4.smithi120.stderr: 9: (PeeringState::on_new_interval()+0x257) [0x55d54247af67]
2023-01-13T14:56:19.366 INFO:tasks.ceph.osd.4.smithi120.stderr: 10: (PeeringState::start_peering_interval(std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> > const&, int, std::vector<int, std::allocator<int> > const&, int, ceph::os::Transaction&)+0x59f) [0x55d54247c67f]
2023-01-13T14:56:19.367 INFO:tasks.ceph.osd.4.smithi120.stderr: 11: (PeeringState::Reset::react(PeeringState::AdvMap const&)+0x26b) [0x55d54248eeeb]
2023-01-13T14:56:19.367 INFO:tasks.ceph.osd.4.smithi120.stderr: 12: (boost::statechart::simple_state<PeeringState::Reset, PeeringState::PeeringMachine, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x1e1) [0x55d5424db091]
2023-01-13T14:56:19.367 INFO:tasks.ceph.osd.4.smithi120.stderr: 13: (boost::statechart::state_machine<PeeringState::PeeringMachine, PeeringState::Initial, std::allocator<boost::statechart::none>, boost::statechart::null_exception_translator>::process_queued_events()+0xd3) [0x55d5424c9db3]
2023-01-13T14:56:19.367 INFO:tasks.ceph.osd.4.smithi120.stderr: 14: (PeeringState::advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PeeringCtx&)+0x29c) [0x55d54247996c]
2023-01-13T14:56:19.367 INFO:tasks.ceph.osd.4.smithi120.stderr: 15: (PG::handle_advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PeeringCtx&)+0xfd) [0x55d5422042ed]
2023-01-13T14:56:19.368 INFO:tasks.ceph.osd.4.smithi120.stderr: 16: (OSD::advance_pg(unsigned int, PG*, ThreadPool::TPHandle&, PeeringCtx&)+0x364) [0x55d5421653f4]
2023-01-13T14:56:19.368 INFO:tasks.ceph.osd.4.smithi120.stderr: 17: (OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0x299) [0x55d5421678f9]
2023-01-13T14:56:19.368 INFO:tasks.ceph.osd.4.smithi120.stderr: 18: (ceph::osd::scheduler::PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x59) [0x55d54244d519]
2023-01-13T14:56:19.368 INFO:tasks.ceph.osd.4.smithi120.stderr: 19: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xb28) [0x55d54215f3c8]
2023-01-13T14:56:19.369 INFO:tasks.ceph.osd.4.smithi120.stderr: 20: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x434) [0x55d5428a1934]
2023-01-13T14:56:19.369 INFO:tasks.ceph.osd.4.smithi120.stderr: 21: (ShardedThreadPool::WorkThreadSharded::entry()+0x14) [0x55d5428a4a24]
2023-01-13T14:56:19.369 INFO:tasks.ceph.osd.4.smithi120.stderr: 22: /lib/x86_64-linux-gnu/libpthread.so.0(+0x8609) [0x7fc9bc9a1609]
2023-01-13T14:56:19.369 INFO:tasks.ceph.osd.4.smithi120.stderr: 23: clone()

The teuthology log for this failure was huge, but I managed to view it by copying a portion of it into a new file (cp teuthology.log ~/teuthology.log).

There is a coredump available at /a/yuriw-2023-01-12_20:11:41-rados-main-distro-default-smithi/7138659/remote/smithi120/coredump.


Related issues 1 (0 open1 closed)

Copied to RADOS - Backport #59005: reef: osd/PeeringState: FAILED ceph_assert(!acting_recovery_backfill.empty())ResolvedRonen FriedmanActions
Actions

Also available in: Atom PDF