Project

General

Profile

Actions

Bug #23250

closed

mds: crash during replay: interval_set.h: 396: FAILED assert(p->first > start+len)

Added by Christoffer Lilja about 6 years ago. Updated over 5 years ago.

Status:
Closed
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
fs
Component(FS):
MDS
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

MDS crash during replay
Full log attached.

starting mds.orbit at -
/build/ceph-12.2.4/src/include/interval_set.h: In function 'void interval_set<T>::insert(T, T, T*, T*) [with T = inodeno_t]' thread 7fba87588700 time 2018-03-06 18:47:24.258340
/build/ceph-12.2.4/src/include/interval_set.h: 396: FAILED assert(p->first > start+len)
 ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x55ebf9502942]
 2: (InoTable::replay_release_ids(interval_set<inodeno_t>&)+0x9f7) [0x55ebf94051a7]
 3: (ESession::replay(MDSRank*)+0x3f0) [0x55ebf9491020]
 4: (MDLog::_replay_thread()+0xc6b) [0x55ebf94554bb]
 5: (MDLog::ReplayThread::entry()+0xd) [0x55ebf91d0fcd]
 6: (()+0x76ba) [0x7fba9488a6ba]
 7: (clone()+0x6d) [0x7fba938f641d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
2018-03-06 18:47:24.259376 7fba87588700 -1 /build/ceph-12.2.4/src/include/interval_set.h: In function 'void interval_set<T>::insert(T, T, T*, T*) [with T = inodeno_t]' thread 7fba87588700 time 2018-03-06 18:47:24.258340
/build/ceph-12.2.4/src/include/interval_set.h: 396: FAILED assert(p->first > start+len)

 ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x55ebf9502942]
 2: (InoTable::replay_release_ids(interval_set<inodeno_t>&)+0x9f7) [0x55ebf94051a7]
 3: (ESession::replay(MDSRank*)+0x3f0) [0x55ebf9491020]
 4: (MDLog::_replay_thread()+0xc6b) [0x55ebf94554bb]
 5: (MDLog::ReplayThread::entry()+0xd) [0x55ebf91d0fcd]
 6: (()+0x76ba) [0x7fba9488a6ba]
 7: (clone()+0x6d) [0x7fba938f641d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

     0> 2018-03-06 18:47:24.259376 7fba87588700 -1 /build/ceph-12.2.4/src/include/interval_set.h: In function 'void interval_set<T>::insert(T, T, T*, T*) [with T = inodeno_t]' thread 7fba87588700 time 2018-03-06 18:47:24.258340
/build/ceph-12.2.4/src/include/interval_set.h: 396: FAILED assert(p->first > start+len)

 ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x55ebf9502942]
 2: (InoTable::replay_release_ids(interval_set<inodeno_t>&)+0x9f7) [0x55ebf94051a7]
 3: (ESession::replay(MDSRank*)+0x3f0) [0x55ebf9491020]
 4: (MDLog::_replay_thread()+0xc6b) [0x55ebf94554bb]
 5: (MDLog::ReplayThread::entry()+0xd) [0x55ebf91d0fcd]
 6: (()+0x76ba) [0x7fba9488a6ba]
 7: (clone()+0x6d) [0x7fba938f641d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

*** Caught signal (Aborted) **
 in thread 7fba87588700 thread_name:md_log_replay
 ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable)
 1: (()+0x5ab254) [0x55ebf94bc254]
 2: (()+0x11390) [0x7fba94894390]
 3: (gsignal()+0x38) [0x7fba93824428]
 4: (abort()+0x16a) [0x7fba9382602a]
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x55ebf9502ace]
 6: (InoTable::replay_release_ids(interval_set<inodeno_t>&)+0x9f7) [0x55ebf94051a7]
 7: (ESession::replay(MDSRank*)+0x3f0) [0x55ebf9491020]
 8: (MDLog::_replay_thread()+0xc6b) [0x55ebf94554bb]
 9: (MDLog::ReplayThread::entry()+0xd) [0x55ebf91d0fcd]
 10: (()+0x76ba) [0x7fba9488a6ba]
 11: (clone()+0x6d) [0x7fba938f641d]
2018-03-06 18:47:24.261559 7fba87588700 -1 *** Caught signal (Aborted) **
 in thread 7fba87588700 thread_name:md_log_replay

 ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable)
 1: (()+0x5ab254) [0x55ebf94bc254]
 2: (()+0x11390) [0x7fba94894390]
 3: (gsignal()+0x38) [0x7fba93824428]
 4: (abort()+0x16a) [0x7fba9382602a]
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x55ebf9502ace]
 6: (InoTable::replay_release_ids(interval_set<inodeno_t>&)+0x9f7) [0x55ebf94051a7]
 7: (ESession::replay(MDSRank*)+0x3f0) [0x55ebf9491020]
 8: (MDLog::_replay_thread()+0xc6b) [0x55ebf94554bb]
 9: (MDLog::ReplayThread::entry()+0xd) [0x55ebf91d0fcd]
 10: (()+0x76ba) [0x7fba9488a6ba]
 11: (clone()+0x6d) [0x7fba938f641d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

     0> 2018-03-06 18:47:24.261559 7fba87588700 -1 *** Caught signal (Aborted) **
 in thread 7fba87588700 thread_name:md_log_replay

 ceph version 12.2.4 (52085d5249a80c5f5121a76d6288429f35e4e77b) luminous (stable)
 1: (()+0x5ab254) [0x55ebf94bc254]
 2: (()+0x11390) [0x7fba94894390]
 3: (gsignal()+0x38) [0x7fba93824428]
 4: (abort()+0x16a) [0x7fba9382602a]
 5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x28e) [0x55ebf9502ace]
 6: (InoTable::replay_release_ids(interval_set<inodeno_t>&)+0x9f7) [0x55ebf94051a7]
 7: (ESession::replay(MDSRank*)+0x3f0) [0x55ebf9491020]
 8: (MDLog::_replay_thread()+0xc6b) [0x55ebf94554bb]
 9: (MDLog::ReplayThread::entry()+0xd) [0x55ebf91d0fcd]
 10: (()+0x76ba) [0x7fba9488a6ba]
 11: (clone()+0x6d) [0x7fba938f641d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Aborted (core dumped)

Files

ceph-mds.orbit.log.gz (312 KB) ceph-mds.orbit.log.gz Christoffer Lilja, 03/06/2018 06:17 PM
Actions

Also available in: Atom PDF