Project

General

Profile

Actions

Bug #20167

closed

osd/PG.cc: 806: FAILED assert(past_intervals.empty())

Added by Sage Weil almost 7 years ago. Updated almost 7 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

    -2> 2017-06-02 13:35:41.966528 7fed8da4c700 -1 log_channel(cluster) log [ERR] : 4.37e62 required past_interval bounds are empty [38671,36979) but past_intervals is not: ([36971,36978] intervals=([36971,36978] acting 3959,4347,4791))
    -1> 2017-06-02 13:35:41.966544 7fed8da4c700 -1 osd.3959 pg_epoch: 41208 pg[4.37e62( v 36978'1495 (0'0,36978'1495] local-lis/les=36971/36978 n=747 ec=36971/36688 lis/c 36971/36971 les/c/f 36978/36978/0 36979/36979/36979) [2358,2154,690] r=-1 lpr=41207 pi=[36971,36979)/1 crt=36978'1495 lcod 0'0 unknown NOTIFY] 4.37e62 required past_interval bounds are empty [38671,36979) but past_intervals is not: ([36971,36978] intervals=([36971,36978] acting 3959,4347,4791))
     0> 2017-06-02 13:35:41.970220 7fed8da4c700 -1 /mnt/jenkins/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.0.2-1883-gb3f5819/rpm/el7/BUILD/ceph-12.0.2-1883-gb3f5819/src/osd/PG.cc: In function 'void PG::check_past_interval_bounds() const' thread 7fed8da4c700 time 2017-06-02 13:35:41.966557
/mnt/jenkins/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/release/12.0.2-1883-gb3f5819/rpm/el7/BUILD/ceph-12.0.2-1883-gb3f5819/src/osd/PG.cc: 806: FAILED assert(past_intervals.empty())

 ceph version 12.0.2-1883-gb3f5819 (b3f5819735f1d09f5595c8db176774b3eadf8ca8)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x7feda78d8b00]
 2: (PG::check_past_interval_bounds() const+0x8f0) [0x7feda7420650]
 3: (PG::RecoveryState::Reset::react(PG::AdvMap const&)+0x217) [0x7feda74620c7]
 4: (boost::statechart::simple_state<PG::RecoveryState::Reset, PG::RecoveryState::RecoveryMachine, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x21c) [0x7feda74a678c]
 5: (boost::statechart::state_machine<PG::RecoveryState::RecoveryMachine, PG::RecoveryState::Initial, std::allocator<void>, boost::statechart::null_exception_translator>::send_event(boost::statechart::event_base const&)+0x6b) [0x7feda748305b]
 6: (PG::handle_advance_map(std::shared_ptr<OSDMap const>, std::shared_ptr<OSDMap const>, std::vector<int, std::allocator<int> >&, int, std::vector<int, std::allocator<int> >&, int, PG::RecoveryCtx*)+0x48b) [0x7feda744ee8b]
 7: (OSD::advance_pg(unsigned int, PG*, ThreadPool::TPHandle&, PG::RecoveryCtx*, std::set<boost::intrusive_ptr<PG>, std::less<boost::intrusive_ptr<PG> >, std::allocator<boost::intrusive_ptr<PG> > >*)+0x2c7) [0x7feda739f8f7]
 8: (OSD::process_peering_events(std::list<PG*, std::allocator<PG*> > const&, ThreadPool::TPHandle&)+0x173) [0x7feda73a0373]
 9: (OSD::PeeringWQ::_process(std::list<PG*, std::allocator<PG*> > const&, ThreadPool::TPHandle&)+0x17) [0x7feda73fda27]
 10: (ThreadPool::worker(ThreadPool::WorkThread*)+0xb59) [0x7feda78df939]
 11: (ThreadPool::WorkThread::entry()+0x10) [0x7feda78e0910]
 12: (()+0x7dc5) [0x7feda4512dc5]
 13: (clone()+0x6d) [0x7feda340176d]


(bigbang)

Related issues 1 (0 open1 closed)

Related to Ceph - Bug #20687: osd: crashing on ec read partial failureResolvedDavid Zafman07/19/2017

Actions
Actions #1

Updated by Sage Weil almost 7 years ago

  • Description updated (diff)
Actions #2

Updated by Sage Weil almost 7 years ago

  • Status changed from New to Need More Info
  • Priority changed from Immediate to High
Actions #3

Updated by Sage Weil almost 7 years ago

  • Status changed from Need More Info to Resolved
Actions #4

Updated by David Zafman over 6 years ago

  • Related to Bug #20687: osd: crashing on ec read partial failure added
Actions

Also available in: Atom PDF