Project

General

Profile

Actions

Bug #6101

closed

ceph-osd crash on corrupted store

Added by Mikaël Cluseau over 10 years ago. Updated over 5 years ago.

Status:
Can't reproduce
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I see a problem with one of my dumpling OSD under debian, on a compressed btrfs. I think my BTRFS is corrupted or somewhat inconsistent (its common at least on my workload).

But this corruption makes the OSD crash so I think it's worth reporting as a bug. It seems that the OSD doesn't crash when set "out".

 ceph version 0.67.1 (e23b817ad0cf1ea19c0a7b7c9999b30bed37d533)
 1: ceph-osd() [0x8fd702]
 2: (()+0xf030) [0x7fc5a59b3030]
 3: (gsignal()+0x35) [0x7fc5a3e5d475]
 4: (abort()+0x180) [0x7fc5a3e606f0]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7fc5a46b289d]
 6: (()+0x63996) [0x7fc5a46b0996]
 7: (()+0x639c3) [0x7fc5a46b09c3]
 8: (()+0x63bee) [0x7fc5a46b0bee]
 9: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x40a) [0x9bf04a]
 10: (ReplicatedPG::trim_object(hobject_t const&)+0x150) [0x6f8500]
 11: (ReplicatedPG::TrimmingObjects::react(ReplicatedPG::SnapTrim const&)+0x249) [0x701ba9]
 12: (boost::statechart::detail::reaction_result boost::statechart::simple_state<ReplicatedPG::TrimmingObjects, ReplicatedPG::SnapTrimmer, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::local_react_impl_non_empty::local_react_impl<boost::mpl::list<boost::statechart::custom_reaction<ReplicatedPG::SnapTrim>, boost::statechart::transition<ReplicatedPG::Reset, ReplicatedPG::NotTrimming, boost::statechart::detail::no_context<ReplicatedPG::Reset>, &boost::statechart::detail::no_context<ReplicatedPG::Reset>::no_function>, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, boost::statechart::simple_state<ReplicatedPG::TrimmingObjects, ReplicatedPG::SnapTrimmer, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0> >(boost::statechart::simple_state<ReplicatedPG::TrimmingObjects, ReplicatedPG::SnapTrimmer, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>&, boost::statechart::event_base const&, void const*)+0x96) [0x7540c6]
 13: (boost::statechart::state_machine<ReplicatedPG::SnapTrimmer, ReplicatedPG::NotTrimming, std::allocator<void>, boost::statechart::null_exception_translator>::process_queued_events()+0x137) [0x72ed47]
 14: (boost::statechart::state_machine<ReplicatedPG::SnapTrimmer, ReplicatedPG::NotTrimming, std::allocator<void>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)+0x26) [0x72ff06]
 15: (ReplicatedPG::snap_trimmer()+0x423) [0x6ca773]
 16: (OSD::SnapTrimWQ::_process(PG*)+0x14) [0x7a33f4]
 17: (ThreadPool::worker(ThreadPool::WorkThread*)+0x68a) [0x9b38ea]
 18: (ThreadPool::WorkThread::entry()+0x10) [0x9b4b40]
 19: (()+0x6b50) [0x7fc5a59aab50]
 20: (clone()+0x6d) [0x7fc5a3f05a7d]

Files

ceph-osd.1.log.xz (1.53 MB) ceph-osd.1.log.xz osd.1 log Mikaël Cluseau, 08/22/2013 08:53 PM
ceph-mon.dbo-prod-mx-bdd1.log.xz (2.46 MB) ceph-mon.dbo-prod-mx-bdd1.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
ceph-osd.1.log.xz (3.31 MB) ceph-osd.1.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
ceph-osd.5.log.xz (345 KB) ceph-osd.5.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
ceph.log.xz (1.31 MB) ceph.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
ceph-mon.dsi-sei-dev-3.log.xz (6.49 MB) ceph-mon.dsi-sei-dev-3.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
ceph-osd.0.log.xz (1.62 MB) ceph-osd.0.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
ceph-osd.2.log.xz (38.9 MB) ceph-osd.2.log.xz Mikaël Cluseau, 08/27/2013 01:11 AM
0001-Don-t-assert-if-trimming-of-object-fails.patch (1.32 KB) 0001-Don-t-assert-if-trimming-of-object-fails.patch Daniel Poelzleithner, 02/10/2014 06:44 PM
0001-Don-t-assert-if-trimming-of-object-fails.patch (1.34 KB) 0001-Don-t-assert-if-trimming-of-object-fails.patch Daniel Poelzleithner, 02/11/2014 12:48 PM
ceph-trim-missing-snapshot.patch (2.24 KB) ceph-trim-missing-snapshot.patch Nathan O'Sullivan, 02/18/2015 11:07 PM
Actions

Also available in: Atom PDF