Project

General

Profile

Actions

Bug #10287

closed

Bug #9889: mon: leveldb weirdness

ceph v0.80.7 ceph-mon --mkfs crash

Added by wei li over 9 years ago. Updated over 9 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

ceph version v0.80.7 a new install machine "CentOS Linux release 7.0.1406 (Core)" run, the rpm build in same OS platform
ceph-mon --mkfs -i 0 crash, dump in log

2014-12-10 00:08:13.245832 7fa47c4437c0 -1 *** Caught signal (Aborted) **
 in thread 7fa47c4437c0

 ceph version  ()
 1: ceph-mon() [0x8828d6]
 2: (()+0xf130) [0x7fa47b5df130]
 3: (gsignal()+0x39) [0x7fa47a201989]
 4: (abort()+0x148) [0x7fa47a203098]
 5: (()+0x156b7) [0x7fa47bd856b7]
 6: (leveldb::port::Mutex::Unlock()+0) [0x7fa47bdb60b0]
 7: (leveldb::DBImpl::~DBImpl()+0x2a) [0x7fa47bd8ae1a]
 8: (leveldb::DBImpl::~DBImpl()+0x9) [0x7fa47bd8b239]
 9: (LevelDBStore::LevelDBWholeSpaceIteratorImpl::lower_bound(std::string const&, std::string const&)+0x49) [0x8397b9]
 10: (MonitorDBStore::exists(std::string const&, std::string const&)+0xe9) [0x53e399]
 11: (Monitor::check_fsid()+0x5d) [0x55678d]
 12: (Monitor::mkfs(ceph::buffer::list&)+0x47) [0x557f27]
 13: (main()+0x706) [0x52c286]
 14: (__libc_start_main()+0xf5) [0x7fa47a1edaf5]
 15: ceph-mon() [0x539e19]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- begin dump of recent events ---
   -14> 2014-12-10 00:08:13.238746 7fa47c4437c0  5 asok(0x3508d20) register_command perfcounters_dump hook 0x34c8010
   -13> 2014-12-10 00:08:13.238784 7fa47c4437c0  5 asok(0x3508d20) register_command 1 hook 0x34c8010
   -12> 2014-12-10 00:08:13.238790 7fa47c4437c0  5 asok(0x3508d20) register_command perf dump hook 0x34c8010
   -11> 2014-12-10 00:08:13.238795 7fa47c4437c0  5 asok(0x3508d20) register_command perfcounters_schema hook 0x34c8010
   -10> 2014-12-10 00:08:13.238801 7fa47c4437c0  5 asok(0x3508d20) register_command 2 hook 0x34c8010
    -9> 2014-12-10 00:08:13.238803 7fa47c4437c0  5 asok(0x3508d20) register_command perf schema hook 0x34c8010
    -8> 2014-12-10 00:08:13.238806 7fa47c4437c0  5 asok(0x3508d20) register_command config show hook 0x34c8010
    -7> 2014-12-10 00:08:13.238809 7fa47c4437c0  5 asok(0x3508d20) register_command config set hook 0x34c8010
    -6> 2014-12-10 00:08:13.238814 7fa47c4437c0  5 asok(0x3508d20) register_command config get hook 0x34c8010
    -5> 2014-12-10 00:08:13.238816 7fa47c4437c0  5 asok(0x3508d20) register_command log flush hook 0x34c8010
    -4> 2014-12-10 00:08:13.238818 7fa47c4437c0  5 asok(0x3508d20) register_command log dump hook 0x34c8010
    -3> 2014-12-10 00:08:13.238825 7fa47c4437c0  5 asok(0x3508d20) register_command log reopen hook 0x34c8010
    -2> 2014-12-10 00:08:13.244926 7fa47c4437c0  5 adding auth protocol: none
    -1> 2014-12-10 00:08:13.244935 7fa47c4437c0  5 adding auth protocol: none
     0> 2014-12-10 00:08:13.245832 7fa47c4437c0 -1 *** Caught signal (Aborted) **
 in thread 7fa47c4437c0

 ceph version  ()
 1: ceph-mon() [0x8828d6]
 2: (()+0xf130) [0x7fa47b5df130]
 3: (gsignal()+0x39) [0x7fa47a201989]
 4: (abort()+0x148) [0x7fa47a203098]
 5: (()+0x156b7) [0x7fa47bd856b7]
 6: (leveldb::port::Mutex::Unlock()+0) [0x7fa47bdb60b0]
 7: (leveldb::DBImpl::~DBImpl()+0x2a) [0x7fa47bd8ae1a]
 8: (leveldb::DBImpl::~DBImpl()+0x9) [0x7fa47bd8b239]
 9: (LevelDBStore::LevelDBWholeSpaceIteratorImpl::lower_bound(std::string const&, std::string const&)+0x49) [0x8397b9]
 10: (MonitorDBStore::exists(std::string const&, std::string const&)+0xe9) [0x53e399]
 11: (Monitor::check_fsid()+0x5d) [0x55678d]
 12: (Monitor::mkfs(ceph::buffer::list&)+0x47) [0x557f27]
 13: (main()+0x706) [0x52c286]
 14: (__libc_start_main()+0xf5) [0x7fa47a1edaf5]
 15: ceph-mon() [0x539e19]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

--- logging levels ---
   0/ 5 none
   0/ 1 lockdep
   0/ 1 context
   1/ 1 crush
   1/ 5 mds
   1/ 5 mds_balancer
   1/ 5 mds_locker
   1/ 5 mds_log
   1/ 5 mds_log_expire
   1/ 5 mds_migrator
   0/ 1 buffer
   0/ 1 timer
   0/ 1 filer
   0/ 1 striper
   0/ 1 objecter
   0/ 5 rados
   0/ 5 rbd
   0/ 5 journaler
   0/ 5 objectcacher
   0/ 5 client
   0/ 5 osd
   0/ 5 optracker
   0/ 5 objclass
   1/ 3 filestore
   1/ 3 keyvaluestore
   1/ 3 journal
   0/ 5 ms
   1/ 5 mon
   0/10 monc
   1/ 5 paxos
   0/ 5 tp
   1/ 5 auth
   1/ 5 crypto
   1/ 1 finisher
   1/ 5 heartbeatmap
   1/ 5 perfcounter
   1/ 5 rgw
   1/ 5 javaclient
   1/ 5 asok
   1/ 1 throttle
  -2/-2 (syslog threshold)
  -1/-1 (stderr threshold)
  max_recent     10000
  max_new         1000
  log_file /var/log/ceph/ceph-mon.0.log
--- end dump of recent events ---
2014-12-10 10:44:35.704112 7f5ec9e547c0  0 ceph version  (), process ceph-mon, pid 9306
2014-12-10 10:44:35.721588 7f5ec9e547c0 -1 *** Caught signal (Aborted) **
 in thread 7f5ec9e547c0

In this machine levedb version is 1.16.0

[root@ceph-1 ceph]# rpm -aq |grep leveldb
libleveldb1-1.16.0-1.2.x86_64

Actions #1

Updated by wei li over 9 years ago

ceph.conf file

[root@ceph-1 ceph]# cat /etc/ceph/ceph.conf 
[global]
  auth cluster required = none
  auth service required = none
  auth client required = none
  keyring = /etc/ceph/keyring
  cluster network = 10.181.19.1/23
  public network = 10.181.19.1/23
  osd pool default pg num = 1024
  osd pool default pgp num = 1024
  osd pool default size = 2

  fsid = a3972be0-382d-4227-a3d0-c6e98c18b818

[mon]
  mon data = /var/lib/ceph/mon/mon.$id

[osd]
  osd journal size = 4096
  filestore flusher = false
  osd data = /var/lib/ceph/osd/ceph-$id
  osd journal = /var/lib/ceph/osd/ceph-$id/journal
  osd mkfs type = xfs
  osd mkfs options xfs = -f 
  osd mount options xfs = rw,noatime
  keyring = /var/lib/ceph/osd/ceph-$id/keyring

[mon.0]
  host = ceph-1
  mon addr = 10.181.19.1:6789

[mon.1]
  host = ceph-2
  mon addr = 10.181.19.2:6789

[mon.2]
  host = ceph-3
  mon addr = 10.181.19.3:6789

Actions #2

Updated by wei li over 9 years ago

change to leveldb 1.12, everything works fine. Please close it.

Actions #3

Updated by Samuel Just over 9 years ago

  • Status changed from New to Resolved
Actions #4

Updated by Joao Eduardo Luis over 9 years ago

  • Parent task set to #9889
Actions

Also available in: Atom PDF