Project

General

Profile

Actions

Bug #9584

closed

OpTracker segfault on shutdown (firefly)

Added by Samuel Just over 9 years ago. Updated about 7 years ago.

Status:
Can't reproduce
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

#0 0x00007f5ec74baf07 in dl_map_object_deps (map=map@entry=0x7f5ec76bc4e8, preloads=preloads@entry=0x0, npreloads=npreloads@entry=0, trace_mode=trace_mode@entry=0, open_mode=open_mode@entry=-2147483648) at dl-deps.c:528
#1 0x00007f5ec74c1aab in dl_open_worker (a=a@entry=0x7f5eb0bc61e8) at dl-open.c:272
#2 0x00007f5ec74bcff4 in _dl_catch_error (objname=objname@entry=0x7f5eb0bc61d8, errstring=errstring@entry=0x7f5eb0bc61e0, mallocedp=mallocedp@entry=0x7f5eb0bc61d0, operate=operate@entry=0x7f5ec74c19a0 <dl_open_worker>, args=args@entry=0x7f5eb0bc61e8) at dl-error.c:187
#3 0x00007f5ec74c13bb in _dl_open (file=0x7f5ec53d34de "libgcc_s.so.1", mode=-2147483647, caller_dlopen=<optimized out>, nsid=-2, argc=4, argv=0x7fff3aeda788, env=0x3de6000) at dl-open.c:661
#4 0x00007f5ec538d002 in do_dlopen (ptr=ptr@entry=0x7f5eb0bc6400) at dl-libc.c:87
#5 0x00007f5ec74bcff4 in _dl_catch_error (objname=0x7f5eb0bc63e0, errstring=0x7f5eb0bc63f0, mallocedp=0x7f5eb0bc63d0, operate=0x7f5ec538cfc0 <do_dlopen>, args=0x7f5eb0bc6400) at dl-error.c:187
#6 0x00007f5ec538d0c2 in dlerror_run (args=0x7f5eb0bc6400, operate=0x7f5ec538cfc0 <do_dlopen>) at dl-libc.c:46
#7 GI_libc_dlopen_mode (name=name@entry=0x7f5ec53d34de "libgcc_s.so.1", mode=mode@entry=-2147483647) at dl-libc.c:163
#8 0x00007f5ec5361c65 in init () at ../sysdeps/x86_64/backtrace.c:52
#9 0x00007f5ec6be2a90 in pthread_once () at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_once.S:103
#10 0x00007f5ec5361d7c in GI_backtrace (array=<optimized out>, size=100) at ../sysdeps/x86_64/backtrace.c:103
#11 0x00000000009808ff in BackTrace (s=0, this=0x7f5eb0bc6730) at ./common/BackTrace.h:19
#12 handle_fatal_signal (signum=11) at global/signal_handler.cc:90
#13 <signal handler called>
#14 0x00000000006aa340 in pair (this=0xeea850 <vtable for std::tr1::_Sp_counted_base_impl<DeletingState*, SharedPtrRegistry<spg_t, DeletingState>::OnRemoval, (
_gnu_cxx::_Lock_policy)2>48>) at /usr/include/c+/4.8/bits/stl_pair.h:96
#15 construct (this=<optimized out>, _val=..., __p=0xeea850 <vtable for std::tr1::_Sp_counted_base_impl<DeletingState*, SharedPtrRegistry<spg_t, DeletingState>::OnRemoval, (_gnu_cxx::_Lock_policy)2>48>) at /usr/include/c+/4.8/ext/new_allocator.h:130
#16 M_create_node (this=<optimized out>, __x=...) at /usr/include/c++/4.8/bits/stl_tree.h:382
#17 _M_insert
(_v=..., __p=0x6c90cc0, __x=0x0, this=0x3ebadc8) at /usr/include/c++/4.8/bits/stl_tree.h:1023
#18 std::_Rb_tree<std::pair<utime_t, std::tr1::shared_ptr<TrackedOp> >, std::pair<utime_t, std::tr1::shared_ptr<TrackedOp> >, std::_Identity<std::pair<utime_t, std::tr1::shared_ptr<TrackedOp> > >, std::less<std::pair<utime_t, std::tr1::shared_ptr<TrackedOp> > >, std::allocator<std::pair<utime_t, std::tr1::shared_ptr<TrackedOp> > > >::_M_insert_unique (this=this@entry=0x3ebadc8, __v=...) at /usr/include/c++/4.8/bits/stl_tree.h:1382
#19 0x00000000006a5e05 in insert (
_x=..., this=0x3ebadc8) at /usr/include/c++/4.8/bits/stl_set.h:463
#20 OpHistory::insert (this=this@entry=0x3ebadc8, now=..., now@entry=..., op=...) at common/TrackedOp.cc:43
#21 0x00000000006a63c4 in OpTracker::unregister_inflight_op (this=0x3ebad50, i=i@entry=0x7009590) at common/TrackedOp.cc:131
#22 0x00000000006a6741 in OpTracker::RemoveOnDelete::operator() (this=0x8dc79f8, op=0x7009590) at common/TrackedOp.cc:250
#23 0x0000000000664449 in std::tr1::_Sp_counted_base<(_gnu_cxx::_Lock_policy)2>::_M_release (this=0x8dc79e0) at /usr/include/c++/4.8/tr1/shared_ptr.h:141
#24 0x000000000061a0b7 in ~
_shared_count (this=<synthetic pointer>, _in_chrg=<optimized out>) at /usr/include/c++/4.8/tr1/shared_ptr.h:341
#25 ~
_shared_ptr (this=<synthetic pointer>, __in_chrg=<optimized out>) at /usr/include/c++/4.8/tr1/shared_ptr.h:541
#26 ~shared_ptr (this=<synthetic pointer>, __in_chrg=<optimized out>) at /usr/include/c++/4.8/tr1/shared_ptr.h:985
#27 OSD::OpWQ::_process (this=0x3ebae58, pg=..., handle=...) at osd/OSD.cc:7750
#28 0x000000000065c77c in ThreadPool::WorkQueueVal<std::pair<boost::intrusive_ptr<PG>, std::tr1::shared_ptr<OpRequest> >, boost::intrusive_ptr<PG> >::_void_process (this=0x3ebae58, handle=...) at ./common/WorkQueue.h:191
#29 0x0000000000a540f1 in ThreadPool::worker (this=0x3eba470, wt=0x40c8b40) at common/WorkQueue.cc:128
#30 0x0000000000a54fe0 in ThreadPool::WorkThread::entry (this=<optimized out>) at common/WorkQueue.h:318
#31 0x00007f5ec6bdd182 in start_thread (arg=0x7f5eb0bc8700) at pthread_create.c:312
#32 0x00007f5ec535138d in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111

The osd was in the process of shutting down at the time.


Related issues 1 (0 open1 closed)

Has duplicate Ceph - Bug #11055: firefly ceph-osd SEGV in tcmalloc testing wip-cot-fireflyDuplicate03/06/2015

Actions
Actions #1

Updated by Samuel Just over 9 years ago

/a/samuelj-2014-09-23_14:40:50-rados-firefly-wip-testing-old-vanilla-basic-multi/507309 (once it times out)

Actions #2

Updated by Sage Weil over 9 years ago

  • Priority changed from Urgent to High

shutdown race is not so important

Actions #3

Updated by Sage Weil about 7 years ago

  • Status changed from New to Can't reproduce
Actions

Also available in: Atom PDF