Project

General

Profile

Actions

Bug #8998

closed

osd: SEGV in OSD::heartbeat()

Added by Sage Weil over 9 years ago. Updated over 9 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

     0> 2014-08-02 00:13:47.060726 7fe4d5c76700 -1 *** Caught signal (Segmentation fault) **
 in thread 7fe4d5c76700

 ceph version 0.83-419-gc2fc1a9 (c2fc1a942964fade41cce3902f96615578aa911a)
 1: ceph-osd() [0x9a7f1a]
 2: (()+0xfcb0) [0x7fe4f58cfcb0]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
</prE>

ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-08-01_02:32:01-rados-master-testing-basic-plana/392102
<pre>
Thread 114 (Thread 0x7fe4e6c98700 (LWP 25329)):
#0  0x00007fe4f58ce89c in __lll_lock_wait () from /lib/x86_64-linux-gnu/libpthread.so.0
#1  0x00007fe4f58ca065 in _L_lock_858 () from /lib/x86_64-linux-gnu/libpthread.so.0
#2  0x00007fe4f58c9eba in pthread_mutex_lock () from /lib/x86_64-linux-gnu/libpthread.so.0
#3  0x0000000000a42733 in Mutex::Lock (this=0x2358370, no_lockdep=<optimized out>) at common/Mutex.cc:89
#4  0x0000000000636f57 in Locker (m=..., this=<synthetic pointer>) at ./common/Mutex.h:120
#5  OSD::maybe_update_heartbeat_peers (this=0x2357700) at osd/OSD.cc:2917
#6  0x000000000066fa00 in OSD::handle_osd_map (this=0x2357700, m=0xd1c0d80) at osd/OSD.cc:6035
#7  0x00000000006719fb in OSD::_dispatch (this=0x2357700, m=0xd1c0d80) at osd/OSD.cc:5321
#8  0x0000000000672168 in OSD::ms_dispatch (this=0x2357700, m=0xd1c0d80) at osd/OSD.cc:4990
#9  0x0000000000b37827 in Messenger::ms_deliver_dispatch (this=0x1e5b800, m=0xd1c0d80) at msg/Messenger.h:636
#10 0x0000000000b345c2 in DispatchQueue::entry (this=0x1e5b908) at msg/DispatchQueue.cc:185
#11 0x0000000000a6b8ad in DispatchQueue::DispatchThread::entry (this=<optimized out>) at msg/DispatchQueue.h:104
#12 0x00007fe4f58c7e9a in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0
#13 0x00007fe4f42783fd in clone () from /lib/x86_64-linux-gnu/libc.so.6
#14 0x0000000000000000 in ?? ()

Thread 1 (Thread 0x7fe4d5c76700 (LWP 25363)):
#0  0x00007fe4f58cfb7b in raise () from /lib/x86_64-linux-gnu/libpthread.so.0
#1  0x00000000009a806e in reraise_fatal (signum=11) at global/signal_handler.cc:59
#2  handle_fatal_signal (signum=11) at global/signal_handler.cc:105
---Type <return> to continue, or q <return> to quit---
#3  <signal handler called>
#4  0x0000000000000000 in ?? ()
#5  0x000000000067dfe9 in _M_release (this=0x3443d80) at /usr/include/c++/4.6/tr1/shared_ptr.h:147
#6  std::tr1::__shared_count<(__gnu_cxx::_Lock_policy)2>::~__shared_count (this=<optimized out>, __in_chrg=<optimized out>)
    at /usr/include/c++/4.6/tr1/shared_ptr.h:348
#7  0x00000000006189cd in ~__shared_ptr (this=0x7fe4d5c75990, __in_chrg=<optimized out>) at /usr/include/c++/4.6/tr1/shared_ptr.h:548
#8  ~shared_ptr (this=0x7fe4d5c75990, __in_chrg=<optimized out>) at /usr/include/c++/4.6/tr1/shared_ptr.h:992
#9  OSDService::update_osd_stat (this=0x2358da0, hb_peers=...) at osd/OSD.cc:2835
#10 0x0000000000639147 in OSD::heartbeat (this=0x2357700) at osd/OSD.cc:3242
#11 0x0000000000639e1d in OSD::heartbeat_entry (this=0x2357700) at osd/OSD.cc:3170
#12 0x000000000069f91d in OSD::T_Heartbeat::entry (this=<optimized out>) at osd/OSD.h:1287
#13 0x00007fe4f58c7e9a in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0
#14 0x00007fe4f42783fd in clone () from /lib/x86_64-linux-gnu/libc.so.6
#15 0x0000000000000000 in ?? ()

</pre>
Actions #1

Updated by Sage Weil over 9 years ago

ubuntu@teuthology:/a/teuthology-2014-08-03_02:30:01-rados-next-testing-basic-plana/394893

Actions #2

Updated by Sage Weil over 9 years ago

  • Status changed from New to Fix Under Review
Actions #3

Updated by Sage Weil over 9 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #4

Updated by Sage Weil over 9 years ago

  • Status changed from Pending Backport to Resolved

no backport needed; this happened bc update_osd_stats() was in OSDService but still using hte other dout macro, but firefly never got the patch that moved it from OSD

Actions

Also available in: Atom PDF