Project

General

Profile

Actions

Bug #16771

open

mon crash in MDSMonitor::prepare_beacon on ARM

Added by stephane beuret almost 8 years ago. Updated over 7 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
fs
Component(FS):
MDSMonitor
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

ceph 10.2.2
ubuntu 16.10
in Docker version 1.11.1, build 5604cbe
on arch armhf (rapsberry pi running hypriot)

   -27> 2016-07-21 20:55:40.828826 73594d00  1 -- 172.31.100.2:6789/0 <== osd.2 192.168.100.152:6800/382 1 ==== auth(proto 0 26 bytes epoch 3) v1 ==== 56+0+0 (3518702297 0 0) 0x5bf35c20 con 0x5cc801c0
   -26> 2016-07-21 20:55:40.828972 73594d00  5 -- op tracker -- seq: 1126, time: 2016-07-21 20:55:40.828971, event: mon:_ms_dispatch, op: auth(proto 0 26 bytes epoch 3)
   -25> 2016-07-21 20:55:40.829030 73594d00  5 -- op tracker -- seq: 1126, time: 2016-07-21 20:55:40.829029, event: mon:dispatch_op, op: auth(proto 0 26 bytes epoch 3)
   -24> 2016-07-21 20:55:40.829067 73594d00  5 -- op tracker -- seq: 1126, time: 2016-07-21 20:55:40.829065, event: psvc:dispatch, op: auth(proto 0 26 bytes epoch 3)
   -23> 2016-07-21 20:55:40.829103 73594d00  5 mon.ceph2@1(leader).paxos(paxos updating c 1..316) is_readable = 1 - now=2016-07-21 20:55:40.829121 lease_expire=2016-07-21 20:55:45.748494 has v0 lc 316
   -22> 2016-07-21 20:55:40.829459 73594d00  5 -- op tracker -- seq: 1126, time: 2016-07-21 20:55:40.829457, event: send_reply, op: auth(proto 0 26 bytes epoch 3)
   -21> 2016-07-21 20:55:40.829497 73594d00  2 mon.ceph2@1(leader) e3 send_reply 0x5bdb5600 0x5bf34080 auth_reply(proto 2 0 (0) Success) v1
   -20> 2016-07-21 20:55:40.829549 73594d00  1 -- 172.31.100.2:6789/0 --> 192.168.100.152:6800/382 -- auth_reply(proto 2 0 (0) Success) v1 -- ?+0 0x5bf34080 con 0x5cc801c0
   -19> 2016-07-21 20:55:40.829630 73594d00  5 -- op tracker -- seq: 1126, time: 2016-07-21 20:55:40.829629, event: reply: send, op: auth(proto 0 26 bytes epoch 3)
   -18> 2016-07-21 20:55:40.829671 73594d00  5 -- op tracker -- seq: 1126, time: 2016-07-21 20:55:40.829669, event: done, op: auth(proto 0 26 bytes epoch 3)
   -17> 2016-07-21 20:55:40.829748 73594d00  1 -- 172.31.100.2:6789/0 <== osd.3 192.168.100.152:6802/383 3 ==== mon_subscribe({monmap=4+,osd_pg_creates=0+}) v2 ==== 50+0+0 (3172493646 0 0) 0x5bda3760 con 0x5cc7fe00
   -16> 2016-07-21 20:55:40.829826 73594d00  5 -- op tracker -- seq: 1127, time: 2016-07-21 20:55:40.829824, event: mon:_ms_dispatch, op: mon_subscribe({monmap=4+,osd_pg_creates=0+})
   -15> 2016-07-21 20:55:40.829859 73594d00  5 -- op tracker -- seq: 1127, time: 2016-07-21 20:55:40.829858, event: mon:dispatch_op, op: mon_subscribe({monmap=4+,osd_pg_creates=0+})
   -14> 2016-07-21 20:55:40.830032 73594d00  5 -- op tracker -- seq: 1127, time: 2016-07-21 20:55:40.830031, event: done, op: mon_subscribe({monmap=4+,osd_pg_creates=0+})
   -13> 2016-07-21 20:55:40.830109 73594d00  1 -- 172.31.100.2:6789/0 <== mds.? 192.168.100.151:6804/185 3 ==== mon_subscribe({mdsmap=3+,monmap=4+}) v2 ==== 42+0+0 (3086761029 0 0) 0x5bda5de0 con 0x5cc7ff40
   -12> 2016-07-21 20:55:40.830735 73594d00  5 -- op tracker -- seq: 1128, time: 2016-07-21 20:55:40.830734, event: mon:_ms_dispatch, op: mon_subscribe({mdsmap=3+,monmap=4+})
   -11> 2016-07-21 20:55:40.830809 73594d00  5 -- op tracker -- seq: 1128, time: 2016-07-21 20:55:40.830808, event: mon:dispatch_op, op: mon_subscribe({mdsmap=3+,monmap=4+})
   -10> 2016-07-21 20:55:40.831251 73594d00  5 -- op tracker -- seq: 1128, time: 2016-07-21 20:55:40.831250, event: done, op: mon_subscribe({mdsmap=3+,monmap=4+})
    -9> 2016-07-21 20:55:40.831339 73594d00  1 -- 172.31.100.2:6789/0 <== mds.? 192.168.100.151:6804/185 4 ==== mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2) v7 ==== 768+0+0 (1025980531 0 0) 0x5bfc6080 con 0x5cc7ff40
    -8> 2016-07-21 20:55:40.831416 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.831415, event: mon:_ms_dispatch, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
    -7> 2016-07-21 20:55:40.831459 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.831457, event: mon:dispatch_op, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
    -6> 2016-07-21 20:55:40.831571 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.831570, event: psvc:dispatch, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
    -5> 2016-07-21 20:55:40.831805 73594d00  5 mon.ceph2@1(leader).paxos(paxos updating c 1..316) is_readable = 1 - now=2016-07-21 20:55:40.831829 lease_expire=2016-07-21 20:55:45.748494 has v0 lc 316
    -4> 2016-07-21 20:55:40.831871 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.831870, event: mdsmap:preprocess_query, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
    -3> 2016-07-21 20:55:40.831913 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.831912, event: mdsmap:preprocess_beacon, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
    -2> 2016-07-21 20:55:40.831974 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.831972, event: mdsmap:prepare_update, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
    -1> 2016-07-21 20:55:40.832012 73594d00  5 -- op tracker -- seq: 1129, time: 2016-07-21 20:55:40.832011, event: mdsmap:prepare_beacon, op: mdsbeacon(14283/mds-ceph1 up:boot seq 5 v2)
     0> 2016-07-21 20:55:40.849882 73594d00 -1 *** Caught signal (Segmentation fault) **
 in thread 73594d00 thread_name:ms_dispatch

 ceph version 10.2.2 (45107e21c568dd033c2f0a3107dec8f0b0e58374)
 1: (()+0x3ebf7a) [0x54eb0f7a]
 2: (()+0x25250) [0x76a04250]
 3: (std::_Rb_tree_iterator<std::pair<mds_gid_t const, unsigned int> > std::_Rb_tree<mds_gid_t, std::pair<mds_gid_t const, unsigned int>, std::_Select1st<std::pair<mds_gid_t const, unsigned int> >, std::less<mds_gid_t>, std::allocator<std::pair<mds_gid_t const, unsigned int> > >::_M_emplace_hint_unique<std::piecewise_construct_t const&, std::tuple<mds_gid_t const&>, std::tuple<> >(std::_Rb_tree_const_iterator<std::pair<mds_gid_t const, unsigned int> >, std::piecewise_construct_t const&, std::tuple<mds_gid_t const&>&&, std::tuple<>&&)+0x2f) [0x54d9ade0]
 4: (FSMap::insert(MDSMap::mds_info_t const&)+0x11f) [0x54edf068]
 5: (MDSMonitor::prepare_beacon(std::shared_ptr<MonOpRequest>)+0xadd) [0x54d8f6f6]
 6: (MDSMonitor::prepare_update(std::shared_ptr<MonOpRequest>)+0x133) [0x54d93370]
 7: (PaxosService::dispatch(std::shared_ptr<MonOpRequest>)+0x749) [0x54d3e392]
 8: (Monitor::dispatch_op(std::shared_ptr<MonOpRequest>)+0x28f) [0x54d15470]
 9: (Monitor::_ms_dispatch(Message*)+0x391) [0x54d15eae]
 10: (Monitor::ms_dispatch(Message*)+0x19) [0x54d2d0b2]
 11: (DispatchQueue::entry()+0x9cd) [0x5500cb32]
 12: (DispatchQueue::DispatchThread::entry()+0x7) [0x54f60554]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- logging levels ---
   0/ 5 none
   0/ 1 lockdep
   0/ 1 context
   1/ 1 crush
   1/ 5 mds
   1/ 5 mds_balancer
   1/ 5 mds_locker
   1/ 5 mds_log
   1/ 5 mds_log_expire
   1/ 5 mds_migrator
   0/ 1 buffer
   0/ 1 timer
   0/ 1 filer
   0/ 1 striper
   0/ 1 objecter
   0/ 5 rados
   0/ 5 rbd
   0/ 5 rbd_mirror
   0/ 5 rbd_replay
   0/ 5 journaler
   0/ 5 objectcacher
   0/ 5 client
   0/ 5 osd
   0/ 5 optracker
   0/ 5 objclass
   1/ 3 filestore
   1/ 3 journal
   0/ 5 ms
   1/ 5 mon
   0/10 monc
   1/ 5 paxos
   0/ 5 tp
   1/ 5 auth
   1/ 5 crypto
   1/ 1 finisher
   1/ 5 heartbeatmap
   1/ 5 perfcounter
   1/ 5 rgw
   1/10 civetweb
   1/ 5 javaclient
   1/ 5 asok
   1/ 1 throttle
   0/ 0 refs
   1/ 5 xio
   1/ 5 compressor
   1/ 5 newstore
   1/ 5 bluestore
   1/ 5 bluefs
   1/ 3 bdev
   1/ 5 kstore
   4/ 5 rocksdb
   4/ 5 leveldb
   1/ 5 kinetic
   1/ 5 fuse
  -2/-2 (syslog threshold)
  99/99 (stderr threshold)
  max_recent     10000
  max_new         1000
  log_file
--- end dump of recent events ---
reraise_fatal: default handler for signal 11 didn't terminate the process?
Actions

Also available in: Atom PDF