Project

General

Profile

Actions

Bug #20694

closed

osd/ReplicatedBackend.cc: 1417: FAILED assert(get_parent()->get_log().get_log().objects.count(soid) && (get_parent()->get_log().get_log().objects.find(soid)->second->op == pg_log_entry_t::LOST_REVERT) && (get_parent()->get_log().get_log().objects.find( s

Added by Sage Weil over 6 years ago. Updated over 4 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2017-07-19T16:30:46.499 INFO:tasks.ceph.osd.4.smithi035.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.1.1-123-ga47dc55/rpm/el7/BUILD/ceph-12.1.1-123-ga47dc55/src/osd/ReplicatedBackend.cc: In function '
void ReplicatedBackend::prepare_pull(eversion_t, const hobject_t&, ObjectContextRef, ReplicatedBackend::RPGHandle*)' thread 7f554633a700 time 2017-07-19 16:30:46.498683
2017-07-19T16:30:46.499 INFO:tasks.ceph.osd.4.smithi035.stderr:/home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.1.1-123-ga47dc55/rpm/el7/BUILD/ceph-12.1.1-123-ga47dc55/src/osd/ReplicatedBackend.cc: 1417: FAILED 
assert(get_parent()->get_log().get_log().objects.count(soid) && (get_parent()->get_log().get_log().objects.find(soid)->second->op == pg_log_entry_t::LOST_REVERT) && (get_parent()->get_log().get_log().objects.find( soid)->second->reverting_to == v))
2017-07-19T16:30:46.512 INFO:tasks.ceph.osd.2.smithi062.stderr:2017-07-19 16:30:46.512340 7f6f3effa700 -1 received  signal: Hangup from  PID: 1322 task name: /usr/bin/python /usr/bin/daemon-helper kill ceph-osd -f --cluster ceph -i 2  UID: 0
2017-07-19T16:30:46.520 INFO:tasks.ceph.osd.4.smithi035.stderr: ceph version 12.1.1-123-ga47dc55 (a47dc55edb6dbc0036d7efb029e223bbcf851a56) luminous (rc)
2017-07-19T16:30:46.520 INFO:tasks.ceph.osd.4.smithi035.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x7f5566d64d40]
2017-07-19T16:30:46.520 INFO:tasks.ceph.osd.4.smithi035.stderr: 2: (ReplicatedBackend::prepare_pull(eversion_t, hobject_t const&, std::shared_ptr<ObjectContext>, ReplicatedBackend::RPGHandle*)+0x779) [0x7f5566ab5789]
2017-07-19T16:30:46.520 INFO:tasks.ceph.osd.4.smithi035.stderr: 3: (ReplicatedBackend::recover_object(hobject_t const&, eversion_t, std::shared_ptr<ObjectContext>, std::shared_ptr<ObjectContext>, PGBackend::RecoveryHandle*)+0x25e) [0x7f5566ab82ee]
2017-07-19T16:30:46.520 INFO:tasks.ceph.osd.4.smithi035.stderr: 4: (PrimaryLogPG::recover_missing(hobject_t const&, eversion_t, int, PGBackend::RecoveryHandle*)+0x5be) [0x7f5566964e5e]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 5: (PrimaryLogPG::maybe_kick_recovery(hobject_t const&)+0x30d) [0x7f5566965f3d]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 6: (PrimaryLogPG::wait_for_unreadable_object(hobject_t const&, boost::intrusive_ptr<OpRequest>)+0x44) [0x7f5566966104]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 7: (PrimaryLogPG::do_op(boost::intrusive_ptr<OpRequest>&)+0x1d7f) [0x7f556699c12f]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 8: (PrimaryLogPG::do_request(boost::intrusive_ptr<OpRequest>&, ThreadPool::TPHandle&)+0xeba) [0x7f556695a55a]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 9: (OSD::dequeue_op(boost::intrusive_ptr<PG>, boost::intrusive_ptr<OpRequest>, ThreadPool::TPHandle&)+0x3f9) [0x7f55667fa269]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 10: (PGQueueable::RunVis::operator()(boost::intrusive_ptr<OpRequest> const&)+0x57) [0x7f5566a544d7]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 11: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0xfce) [0x7f5566824e3e]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 12: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x8e9) [0x7f5566d6a889]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 13: (ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0x7f5566d6ca10]
2017-07-19T16:30:46.521 INFO:tasks.ceph.osd.4.smithi035.stderr: 14: (()+0x7dc5) [0x7f5563b75dc5]

/a/sage-2017-07-19_15:27:16-rados-wip-sage-testing2-distro-basic-smithi/1419399

Related issues 1 (0 open1 closed)

Has duplicate RADOS - Bug #20551: LOST_REVERT assert during rados bench+thrash in ReplicatedBackend::prepare_pull()Duplicate07/07/2017

Actions
Actions #1

Updated by Kefu Chai over 6 years ago

/a/kchai-2017-07-20_03:05:27-rados-wip-kefu-testing-distro-basic-mira/1422161

$ zless remote/mira104/log/ceph-osd.2.log.gz

Actions #2

Updated by Kefu Chai over 6 years ago

  • Has duplicate Bug #20551: LOST_REVERT assert during rados bench+thrash in ReplicatedBackend::prepare_pull() added
Actions #3

Updated by Sage Weil over 5 years ago

/a/sage-2018-09-06_16:02:58-rados-wip-sage-testing-2018-09-05-1559-distro-basic-smithi/2985475

Actions #4

Updated by Neha Ojha over 5 years ago

Seen in mimic: /a/yuriw-2018-09-10_16:59:58-rados-wip-yuri-testing-2018-09-10-1525-mimic-distro-basic-smithi/3002608/

Actions #5

Updated by Neha Ojha over 5 years ago

/a/yuriw-2018-10-25_15:31:28-rados-wip-yuri4-testing-2018-10-24-2310-mimic-distro-basic-smithi/3183476/

Actions #6

Updated by Josh Durgin over 5 years ago

/a/dzafman-2018-12-14_11:02:20-rados-wip-zafman-testing-distro-basic-smithi/3362388

Actions #7

Updated by Greg Farnum over 4 years ago

  • Status changed from 12 to Can't reproduce
  • Priority changed from High to Normal

Sam changed this with his PeeringStateMachine refactor. :D

Actions

Also available in: Atom PDF