Project

General

Profile

Actions

Bug #38294

closed

osd/PG.cc: 6141: FAILED ceph_assert(info.history.same_interval_since != 0) during upgrade

Added by Sage Weil about 5 years ago. Updated about 5 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2019-02-13T11:18:31.075 INFO:tasks.ceph.osd.7.smithi053.stderr:/build/ceph-14.0.1-2906-gec352ea/src/osd/PG.cc: In function 'void PG::start_peering_interval(OSDMapRef, const std::vector<int>&, int, const std::vector<int>&, int, ObjectStore::Transaction*)' thread 7f8638607700 time 2019-02-13 11:18:31.075208
2019-02-13T11:18:31.076 INFO:tasks.ceph.osd.7.smithi053.stderr:/build/ceph-14.0.1-2906-gec352ea/src/osd/PG.cc: 6141: FAILED ceph_assert(info.history.same_interval_since != 0)
2019-02-13T11:18:31.079 INFO:teuthology.orchestra.run.smithi103:Running:
2019-02-13T11:18:31.079 INFO:teuthology.orchestra.run.smithi103:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 30 ceph --cluster ceph --admin-daemon /var/run/ceph/ceph-osd.1.asok dump_blocked_ops
2019-02-13T11:18:31.081 INFO:tasks.ceph.osd.7.smithi053.stderr: ceph version 14.0.1-2906-gec352ea (ec352eafa6077a434a767de72d6c5b903e8abc16) nautilus (dev)
2019-02-13T11:18:31.082 INFO:tasks.ceph.osd.7.smithi053.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x152) [0x833a24]
2019-02-13T11:18:31.082 INFO:tasks.ceph.osd.7.smithi053.stderr: 2: (ceph::__ceph_assertf_fail(char const*, char const*, int, char const*, char const*, ...)+0) [0x833bff]
2019-02-13T11:18:31.082 INFO:tasks.ceph.osd.7.smithi053.stderr: 3: (PG::start_peering_interval(std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> > const&, int, std::vector<int, std::allocator<int> > const&, int, ObjectStore::Transaction*)+0x1607) [0xa26b37]
2019-02-13T11:18:31.082 INFO:tasks.ceph.osd.7.smithi053.stderr: 4: (PG::RecoveryState::Reset::react(PG::AdvMap const&)+0x1a6) [0xa2a436]
2019-02-13T11:18:31.083 INFO:tasks.ceph.osd.7.smithi053.stderr: 5: (boost::statechart::simple_state<PG::RecoveryState::Reset, PG::RecoveryState::RecoveryMachine, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x140) [0xa8b850]
2019-02-13T11:18:31.083 INFO:tasks.ceph.osd.7.smithi053.stderr: 6: (PG::handle_advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PG::RecoveryCtx*)+0x256) [0xa295f6]
2019-02-13T11:18:31.083 INFO:tasks.ceph.osd.7.smithi053.stderr: 7: (OSD::advance_pg(unsigned int, PG*, ThreadPool::TPHandle&, PG::RecoveryCtx*)+0x2d1) [0x98b141]
2019-02-13T11:18:31.083 INFO:tasks.ceph.osd.7.smithi053.stderr: 8: (OSD::dequeue_peering_evt(OSDShard*, PG*, std::shared_ptr<PGPeeringEvent>, ThreadPool::TPHandle&)+0xa6) [0x98cae6]
2019-02-13T11:18:31.083 INFO:tasks.ceph.osd.7.smithi053.stderr: 9: (PGPeeringItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x50) [0xc075c0]

/a/sage-2019-02-13_03:14:34-upgrade:luminous-x-wip-v2-upgrade-distro-basic-smithi/3582461

Related issues 1 (0 open1 closed)

Has duplicate RADOS - Bug #37654: FAILED ceph_assert(info.history.same_interval_since != 0) in PG::start_peering_interval()Resolvedxie xingguo12/13/2018

Actions
Actions #1

Updated by Sage Weil about 5 years ago

  • Status changed from 12 to Rejected

no logs

Actions #2

Updated by Sage Weil about 5 years ago

  • Status changed from Rejected to In Progress

/a/sage-2019-03-17_00:28:04-upgrade:luminous-x-wip-sage4-testing-2019-03-16-1713-distro-basic-smithi/3737326

pg 1.6 was exported from luminous and imported to nautilus.

I think we should not attempt to support this...there were too many hairy changes in the way imports/exports and past intervals work between L and N.

Actions #3

Updated by Sage Weil about 5 years ago

  • Status changed from In Progress to Fix Under Review
Actions #4

Updated by Kefu Chai about 5 years ago

  • Pull request ID set to 27018
Actions #5

Updated by Sage Weil about 5 years ago

  • Status changed from Fix Under Review to Resolved
Actions #6

Updated by Greg Farnum over 4 years ago

  • Has duplicate Bug #37654: FAILED ceph_assert(info.history.same_interval_since != 0) in PG::start_peering_interval() added
Actions

Also available in: Atom PDF