Project

General

Profile

Actions

Bug #11558

closed

assert(0 == "hit suicide timeout") in HeartbeatMap due to dead lock in rocksdb

Added by Xinze Chi almost 9 years ago. Updated almost 9 years ago.

Status:
Won't Fix
Priority:
Urgent
Assignee:
-
Category:
OSD
Target version:
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

hi, all:

when I use kvstore(I use rocksdb as its kv impl). the ceph verion is 0.94, the rocksdb 05da593(which is default branch in hammer).

There is bug.

The track log is:

#1  0x00000000009e701d in rocksdb::port::CondVar::Wait (this=this@entry=0x41d08b8) at port/port_posix.cc:80
#2  0x00000000009b4a00 in rocksdb::DBImpl::MakeRoomForWrite (this=this@entry=0x41d0780, cfd=cfd@entry=0x40c0700, force=false, superversions_to_free=superversions_to_free@entry=0x7f69cb405490, 
     logs_to_free=logs_to_free@entry=0x7f69cb4054f0) at db/db_impl.cc:3958
#3  0x00000000009bd1cf in rocksdb::DBImpl::Write (this=0x41d0780, options=..., my_batch=0x4087760) at db/db_impl.cc:3671
#4  0x00000000009a2e60 in RocksDBStore::submit_transaction_sync (this=0x41c0b60, t=...) at os/RocksDBStore.cc:231

#1  0x00000000009e701d in rocksdb::port::CondVar::Wait (this=this@entry=0x7f69cac04450) at port/port_posix.cc:80
#2  0x00000000009bd044 in rocksdb::DBImpl::Write (this=0x41d0780, options=..., my_batch=0x4084fb0) at db/db_impl.cc:3632
#3  0x00000000009a2e60 in RocksDBStore::submit_transaction_sync (this=0x41c0b60, t=...) at os/RocksDBStore.cc:231
#4  0x000000000094694a in submit_transaction_sync (t=..., this=<optimized out>) at os/GenericObjectMap.h:133

And then when I change the rocksdb version to 6ca7bef(default branch ceph master brach use), everyting go well.

Actions

Also available in: Atom PDF