Project

General

Profile

Actions

Bug #44566

closed

ceph tell segv: librados fini vs protocolv2

Added by Sage Weil about 4 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2020-03-11T12:45:56.818 INFO:teuthology.orchestra.run.smithi112:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph --log-early -- tell 'osd.*' injectargs --osd_enable_op_tracker=true
2020-03-11T12:45:57.143 INFO:teuthology.orchestra.run.smithi112.stderr:osd.0: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.144 INFO:teuthology.orchestra.run.smithi112.stderr:osd.1: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.144 INFO:teuthology.orchestra.run.smithi112.stderr:osd.2: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.144 INFO:teuthology.orchestra.run.smithi112.stderr:Error ENXIO: problem getting command descriptions from osd.3
2020-03-11T12:45:57.144 INFO:teuthology.orchestra.run.smithi112.stderr:osd.3: problem getting command descriptions from osd.3
2020-03-11T12:45:57.145 INFO:teuthology.orchestra.run.smithi112.stderr:osd.4: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.145 INFO:teuthology.orchestra.run.smithi112.stderr:osd.5: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.145 INFO:teuthology.orchestra.run.smithi112.stderr:osd.6: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.145 INFO:teuthology.orchestra.run.smithi112.stderr:osd.7: osd_enable_op_tracker = 'true'
2020-03-11T12:45:57.177 INFO:teuthology.orchestra.run.smithi112.stderr:timeout: the monitored command dumped core

/a/sage-2020-03-11_12:18:48-rados-wip-sage3-testing-2020-03-10-2044-distro-basic-smithi/4846762

the core reveals:

Thread 12 (Thread 0x7f51a0757740 (LWP 15439)):
#0  0x00007f519ff4381a in __pthread_rwlock_wrlock_full (abstime=0x0, rwlock=0x7f5193b650c0) at pthread_rwlock_common.c:595
#1  __GI___pthread_rwlock_wrlock (rwlock=0x7f5193b650c0) at pthread_rwlock_wrlock.c:27
#2  0x00007f519392290e in rtnl_link_unregister_info () from /usr/lib/x86_64-linux-gnu/libnl-route-3.so.200
#3  0x00007f51a0557b73 in _dl_fini () at dl-fini.c:138
#4  0x00007f51a0199041 in __run_exit_handlers (status=6, listp=0x7f51a0541718 <__exit_funcs>, run_list_atexit=run_list_atexit@entry=true, run_dtors=run_dtors@entry=true) at exit.c:108
#5  0x00007f51a019913a in __GI_exit (status=<optimized out>) at exit.c:139
#6  0x00000000006384f7 in ?? ()
#7  0x0000000000000006 in ?? ()
#8  0x00000000006385c0 in ?? ()
#9  0x0000000001e34630 in ?? ()
#10 0x0000000000a68b60 in ?? ()
#11 0x0000000000000001 in ?? ()
#12 0x00007f5192d1af08 in ?? ()
#13 0x0000000000a68b60 in ?? ()
#14 0x0000000000a1af20 in ?? ()
#15 0x0000000000000001 in ?? ()
#16 0x00000000006385ec in ?? ()
#17 0x0000000000a68b60 in ?? ()
#18 0x00007f51a06cd288 in ?? ()
#19 0x00007f51a06cd288 in ?? ()
#20 0x00007f51a06c5d68 in ?? ()
#21 0x0000000000000000 in ?? ()

and
Thread 1 (Thread 0x7f518b7fe700 (LWP 15446)):
#0  __memcmp_avx2_movbe () at ../sysdeps/x86_64/multiarch/memcmp-avx2-movbe.S:244
#1  0x00007f51957ec764 in std::char_traits<char>::compare (__n=<optimized out>, __s2=<optimized out>, __s1=<optimized out>) at /usr/include/c++/7/bits/char_traits.h:310
#2  std::basic_string_view<char, std::char_traits<char> >::compare (__str=..., this=<synthetic pointer>) at /usr/include/c++/7/string_view:268
#3  std::operator< <char, std::char_traits<char> > (__y=..., Python Exception <class 'gdb.error'> value has been optimized out: 
__x=) at /usr/include/c++/7/string_view:488
#4  std::less<std::basic_string_view<char, std::char_traits<char> > >::operator() (this=<optimized out>, __y="ms_inject_delay_type", __x=<error: Cannot access memory at address 0x7f5193625ab0>) at /usr/include/c++/7/bits/stl_function.h:386
#5  std::_Rb_tree<std::basic_string_view<char, std::char_traits<char> >, std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&>, std::_Select1st<std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&> >, std::less<std::basic_string_view<char, std::char_traits<char> > >, std::allocator<std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&> > >::_M_lower_bound (this=0x7f518c0049a8, __k="ms_inject_delay_type", __y=0x7f518c00bbb0, __x=0x7f518c00b470) at /usr/include/c++/7/bits/stl_tree.h:1888
#6  std::_Rb_tree<std::basic_string_view<char, std::char_traits<char> >, std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&>, std::_Select1st<std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&> >, std::less<std::basic_string_view<char, std::char_traits<char> > >, std::allocator<std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&> > >::find (this=0x7f518c0049a8, __k="ms_inject_delay_type") at /usr/include/c++/7/bits/stl_tree.h:2536
#7  0x00007f519580f551 in std::map<std::basic_string_view<char, std::char_traits<char> >, Option const&, std::less<std::basic_string_view<char, std::char_traits<char> > >, std::allocator<std::pair<std::basic_string_view<char, std::char_traits<char> > const, Option const&> > >::find (__x=..., this=<optimized out>)
    at /usr/include/c++/7/bits/stl_map.h:1188
#8  md_config_t::find_option (this=<optimized out>, name="ms_inject_delay_type") at ./src/common/config.cc:245
#9  0x00007f51958132ab in md_config_t::_get_val (this=0x7f518c004970, values=..., key=..., stack=0x0, err=0x0) at ./src/common/config.cc:1050
---Type <return> to continue, or q <return> to quit---
#10 0x00007f5195813592 in md_config_t::get_val_generic[abi:cxx11](ConfigValues const&, std::basic_string_view<char, std::char_traits<char> >) const (this=<optimized out>, this@entry=0x7f518c004970, values=..., key=...) at ./src/common/config.cc:1034
#11 0x00007f5195980a79 in md_config_t::with_val<std::__cxx11::basic_string<char>, AsyncConnection::maybe_start_delay_thread()::<lambda(const string&)> > (cb=..., key=..., values=..., this=0x7f518c004970) at ./src/common/config.h:201
#12 ceph::common::ConfigProxy::with_val<std::__cxx11::basic_string<char>, AsyncConnection::maybe_start_delay_thread()::<lambda(const string&)> > (cb=..., key="ms_inject_delay_type", this=0x7f518c001518) at ./src/common/config_proxy.h:150
#13 AsyncConnection::maybe_start_delay_thread (this=0x7f517802d080) at ./src/msg/async/AsyncConnection.cc:169
#14 0x00007f51959bf1ef in ProtocolV2::ready (this=this@entry=0x7f517802f540) at ./src/msg/async/ProtocolV2.cc:1309
#15 0x00007f51959c8930 in ProtocolV2::handle_server_ident (this=this@entry=0x7f517802f540, payload=...) at ./src/msg/async/ProtocolV2.cc:2142
#16 0x00007f51959d34cb in ProtocolV2::handle_frame_payload (this=this@entry=0x7f517802f540) at ./src/msg/async/ProtocolV2.cc:1261
#17 0x00007f51959d3800 in ProtocolV2::handle_read_frame_dispatch (this=this@entry=0x7f517802f540) at ./src/msg/async/ProtocolV2.cc:1159
#18 0x00007f51959d3a60 in ProtocolV2::handle_read_frame_epilogue_main (this=0x7f517802f540, buffer=..., r=<optimized out>) at ./src/msg/async/ProtocolV2.cc:1385
#19 0x00007f51959b9074 in ProtocolV2::run_continuation (this=0x7f517802f540, continuation=...) at ./src/msg/async/ProtocolV2.cc:45
#20 0x00007f519598303c in std::function<void (char*, long)>::operator()(char*, long) const (__args#1=<optimized out>, __args#0=<optimized out>, this=0x7f517802d420) at /usr/include/c++/7/bits/std_function.h:706
#21 AsyncConnection::process (this=0x7f517802d080) at ./src/msg/async/AsyncConnection.cc:457
#22 0x00007f51959d9d6d in EventCenter::process_events (this=this@entry=0x7f518c043c90, timeout_microseconds=<optimized out>, timeout_microseconds@entry=30000000, working_dur=working_dur@entry=0x7f518b7fde28) at ./src/msg/async/Event.cc:406
#23 0x00007f51959e1a08 in NetworkStack::<lambda()>::operator() (__closure=0x7f518c060b28) at ./src/msg/async/Stack.cc:53
#24 std::_Function_handler<void(), NetworkStack::add_thread(unsigned int)::<lambda()> >::_M_invoke(const std::_Any_data &) (__functor=...) at /usr/include/c++/7/bits/std_function.h:316
#25 0x00007f5194fc06ef in ?? () from /usr/lib/x86_64-linux-gnu/libstdc++.so.6
#26 0x00007f519ff3e6db in start_thread (arg=0x7f518b7fe700) at pthread_create.c:463
#27 0x00007f51a027788f in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:95


Related issues 1 (0 open1 closed)

Related to Orchestrator - Bug #44526: sporatic cephadm bootstrap failures: 'timed out'ResolvedSage Weil

Actions
Actions

Also available in: Atom PDF