Project

General

Profile

Bug #1833

mon: failed decode in LogMonitor::update_from_paxos

Added by Greg Farnum almost 8 years ago. Updated almost 8 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
Monitor
Target version:
Start date:
12/15/2011
Due date:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature:

Description

Saw this on benjamin today. It was during catchup; mon.beta had been out for a day or more and was catching up. Perhaps it's already fixed in master after Sage's rewrite of some of that code?
Log attached!

2011-12-14 14:12:43.422513 392a9114700 store(/var/ceph/mon) get_int logm/last_consumed = 68115
2011-12-14 14:12:43.422533 392a9114700 store(/var/ceph/mon) get_bl logm/64894 No such file or directory
*** Caught signal (Aborted) **
 in thread 392a9114700
 ceph version 0.37-274-g58eb8c5 (commit:58eb8c5e46bc5641d23643eab8a01add9667bc89)
 1: /usr/bin/ceph-mon() [0x56b7a9]
 2: (()+0xef60) [0x392ac97bf60]
 3: (gsignal()+0x35) [0x392ab363165]
 4: (abort()+0x180) [0x392ab365f70]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x115) [0x392abbf6dc5]
 6: (()+0xcb166) [0x392abbf5166]
 7: (()+0xcb193) [0x392abbf5193]
 8: (()+0xcb28e) [0x392abbf528e]
 9: (ceph::buffer::list::iterator::copy(unsigned int, char*)+0x13e) [0x554f9e]
 10: (LogMonitor::update_from_paxos()+0x1ae) [0x4e5d8e]
 11: (PaxosService::_active()+0x49) [0x4885c9]
 12: (Context::complete(int)+0xa) [0x4730da]
 13: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0xda) [0x48577a]
 14: (Paxos::handle_lease(MMonPaxos*)+0x3a8) [0x47ee98]
 15: (Paxos::dispatch(PaxosServiceMessage*)+0x22b) [0x48469b]
 16: (Monitor::_ms_dispatch(Message*)+0xc24) [0x471734]
 17: (Monitor::ms_dispatch(Message*)+0xa4) [0x47b3e4]
 18: (SimpleMessenger::dispatch_entry()+0x869) [0x50fd09]
 19: (SimpleMessenger::DispatchThread::entry()+0x1c) [0x45e9ec]
 20: (()+0x68ba) [0x392ac9738ba]
 21: (clone()+0x6d) [0x392ab40002d]

mon.beta.log.1 (7.87 MB) Greg Farnum, 12/15/2011 02:32 PM

History

#1 Updated by Sage Weil almost 8 years ago

  • Status changed from New to Resolved

Yeah, this is one of the things I hit (and fixed) in a few different ways when doing the mon thrashing on the new code.

Also available in: Atom PDF