Actions
Bug #992
closedosd: OSD::_lookup_lock_pg
% Done:
0%
Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
When testing with recovery of a failed machine, I encountered a few OSD crashes.
Both osd5 and osd11 went down with the same backtrace:
(gdb) bt #0 0x00007f9a5e4437bb in raise () from /lib/libpthread.so.0 #1 0x0000000000620ce3 in reraise_fatal (signum=13291) at common/signal.cc:63 #2 0x0000000000621a0b in handle_fatal_signal (signum=6) at common/signal.cc:110 #3 <signal handler called> #4 0x00007f9a5d013a75 in raise () from /lib/libc.so.6 #5 0x00007f9a5d0175c0 in abort () from /lib/libc.so.6 #6 0x00007f9a5d8c98e5 in __gnu_cxx::__verbose_terminate_handler() () from /usr/lib/libstdc++.so.6 #7 0x00007f9a5d8c7d16 in ?? () from /usr/lib/libstdc++.so.6 #8 0x00007f9a5d8c7d43 in std::terminate() () from /usr/lib/libstdc++.so.6 #9 0x00007f9a5d8c7e3e in __cxa_throw () from /usr/lib/libstdc++.so.6 #10 0x0000000000607caa in ceph::__ceph_assert_fail (assertion=<value optimized out>, file=<value optimized out>, line=<value optimized out>, func=0x6450a0 "PG* OSD::_lookup_lock_pg(pg_t)") at common/assert.cc:86 #11 0x00000000004e441e in OSD::_lookup_lock_pg (this=0xef6000, pgid=<value optimized out>) at osd/OSD.cc:1041 #12 0x00000000004e8f7f in OSD::lookup_lock_raw_pg (this=0xef6000, pgid=DWARF-2 expression error: DW_OP_reg operations must be used either alone or in conjuction with DW_OP_piece. ) at osd/OSD.cc:1059 #13 0x00000000004e9640 in OSD::put_object_context (this=0x33eb, _obc=0x5138d80, pgid=DWARF-2 expression error: DW_OP_reg operations must be used either alone or in conjuction with DW_OP_piece. ) at osd/OSD.cc:1833 #14 0x00000000004e9a1c in OSD::ms_handle_reset (this=0xef6000, con=<value optimized out>) at osd/OSD.cc:1911 #15 0x0000000000473e49 in Messenger::ms_deliver_handle_reset (this=0xef4000) at msg/Messenger.h:115 #16 SimpleMessenger::dispatch_entry (this=0xef4000) at msg/SimpleMessenger.cc:337 #17 0x000000000046a4fc in SimpleMessenger::DispatchThread::entry (this=0xef4488) at msg/SimpleMessenger.h:533 #18 0x00007f9a5e43a9ca in start_thread () from /lib/libpthread.so.0 #19 0x00007f9a5d0c670d in clone () from /lib/libc.so.6 #20 0x0000000000000000 in ?? () (gdb)
The coredumps can be found at atom1.ceph.widodh.nl and atom2.ceph.widodh.nl
Actions