Project

General

Profile

Actions

Bug #13511

closed

core dump during osd start

Added by Vaidyanath Manogaran over 8 years ago. Updated over 8 years ago.

Status:
Rejected
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
rados
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

root@stmon:~# ceph-osd -i 90 -c /etc/ceph/ceph.conf --mkjournal --mkfs -f --debug-osd 20 --debug-ms 1
SG_IO: bad/missing sense data, sb[]: 70 00 05 00 00 00 00 0a 00 00 00 00 20 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
SG_IO: bad/missing sense data, sb[]: 70 00 05 00 00 00 00 0a 00 00 00 00 20 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
2015-10-16 04:03:29.668459 7fe932a05900 -1 journal Unable to read past sequence 2 but header indicates the journal has committed up through 3512, journal is corrupt
os/FileJournal.cc: In function 'bool FileJournal::read_entry(ceph::bufferlist&, uint64_t&, bool*)' thread 7fe932a05900 time 2015-10-16 04:03:29.668479
os/FileJournal.cc: 1780: FAILED assert(0)
ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc376b]
2: (FileJournal::read_entry(ceph::buffer::list&, unsigned long&, bool*)+0xae3) [0xa789b3]
3: (JournalingObjectStore::journal_replay(unsigned long)+0x191) [0x940711]
4: (FileStore::mount()+0x3bb6) [0x911786]
5: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0xf0) [0x68c020]
6: (main()+0xa1f) [0x65025f]
7: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
8: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
2015-10-16 04:03:29.673209 7fe932a05900 -1 os/FileJournal.cc: In function 'bool FileJournal::read_entry(ceph::bufferlist&, uint64_t&, bool*)' thread 7fe932a05900 time 2015-10-16 04:03:29.668479
os/FileJournal.cc: 1780: FAILED assert(0)

ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc376b]
2: (FileJournal::read_entry(ceph::buffer::list&, unsigned long&, bool*)+0xae3) [0xa789b3]
3: (JournalingObjectStore::journal_replay(unsigned long)+0x191) [0x940711]
4: (FileStore::mount()+0x3bb6) [0x911786]
5: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0xf0) [0x68c020]
6: (main()+0xa1f) [0x65025f]
7: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
8: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS &lt;executable&gt;` is needed to interpret this.
-1> 2015-10-16 04:03:29.668459 7fe932a05900 -1 journal Unable to read past sequence 2 but header indicates the journal has committed up through 3512, journal is corrupt
0> 2015-10-16 04:03:29.673209 7fe932a05900 -1 os/FileJournal.cc: In function 'bool FileJournal::read_entry(ceph::bufferlist&, uint64_t&, bool*)' thread 7fe932a05900 time 2015-10-16 04:03:29.668479
os/FileJournal.cc: 1780: FAILED assert(0)
ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc376b]
2: (FileJournal::read_entry(ceph::buffer::list&, unsigned long&, bool*)+0xae3) [0xa789b3]
3: (JournalingObjectStore::journal_replay(unsigned long)+0x191) [0x940711]
4: (FileStore::mount()+0x3bb6) [0x911786]
5: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0xf0) [0x68c020]
6: (main()+0xa1f) [0x65025f]
7: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
8: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS &lt;executable&gt;` is needed to interpret this.

2015-10-16 04:03:29.674677 7fe932a05900 -1 OSD::mkfs: caught unknown exception.
common/config.cc: In function 'void md_config_t::remove_observer(md_config_obs_t*)' thread 7fe932a05900 time 2015-10-16 04:03:29.674954
common/config.cc: 196: FAILED assert(found_obs)
ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc376b]
2: (md_config_t::remove_observer(md_config_obs_t*)+0xd4) [0xbe0094]
3: (ThreadPool::stop(bool)+0x1ce) [0xbb115e]
4: (FileStore::umount()+0x170) [0x8f11e0]
5: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0x6bd) [0x68c5ed]
6: (main()+0xa1f) [0x65025f]
7: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
8: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
2015-10-16 04:03:29.679166 7fe932a05900 -1 common/config.cc: In function 'void md_config_t::remove_observer(md_config_obs_t*)' thread 7fe932a05900 time 2015-10-16 04:03:29.674954
common/config.cc: 196: FAILED assert(found_obs)

ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc376b]
2: (md_config_t::remove_observer(md_config_obs_t*)+0xd4) [0xbe0094]
3: (ThreadPool::stop(bool)+0x1ce) [0xbb115e]
4: (FileStore::umount()+0x170) [0x8f11e0]
5: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0x6bd) [0x68c5ed]
6: (main()+0xa1f) [0x65025f]
7: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
8: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS &lt;executable&gt;` is needed to interpret this.
-1> 2015-10-16 04:03:29.674677 7fe932a05900 -1 OSD::mkfs: caught unknown exception.
0> 2015-10-16 04:03:29.679166 7fe932a05900 -1 common/config.cc: In function 'void md_config_t::remove_observer(md_config_obs_t*)' thread 7fe932a05900 time 2015-10-16 04:03:29.674954
common/config.cc: 196: FAILED assert(found_obs)
ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbc376b]
2: (md_config_t::remove_observer(md_config_obs_t*)+0xd4) [0xbe0094]
3: (ThreadPool::stop(bool)+0x1ce) [0xbb115e]
4: (FileStore::umount()+0x170) [0x8f11e0]
5: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0x6bd) [0x68c5ed]
6: (main()+0xa1f) [0x65025f]
7: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
8: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS &lt;executable&gt;` is needed to interpret this.
terminate called after throwing an instance of 'ceph::FailedAssertion'
  • Caught signal (Aborted)
    in thread 7fe932a05900
    ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
    1: ceph-osd() [0xacb49a]
    2: (()+0x10340) [0x7fe9316b9340]
    3: (gsignal()+0x39) [0x7fe92fb58cc9]
    4: (abort()+0x148) [0x7fe92fb5c0d8]
    5: (_gnu_cxx::_verbose_terminate_handler()+0x155) [0x7fe930463535]
    6: (()+0x5e6d6) [0x7fe9304616d6]
    7: (()+0x5e703) [0x7fe930461703]
    8: (()+0x5e922) [0x7fe930461922]
    9: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x278) [0xbc3958]
    10: (md_config_t::remove_observer(md_config_obs_t*)+0xd4) [0xbe0094]
    11: (ThreadPool::stop(bool)+0x1ce) [0xbb115e]
    12: (FileStore::umount()+0x170) [0x8f11e0]
    13: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0x6bd) [0x68c5ed]
    14: (main()+0xa1f) [0x65025f]
    15: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
    16: ceph-osd() [0x66b0d7]
    2015-10-16 04:03:29.684739 7fe932a05900 -1
    Caught signal (Aborted) *
    in thread 7fe932a05900
ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: ceph-osd() [0xacb49a]
2: (()+0x10340) [0x7fe9316b9340]
3: (gsignal()+0x39) [0x7fe92fb58cc9]
4: (abort()+0x148) [0x7fe92fb5c0d8]
5: (_gnu_cxx::_verbose_terminate_handler()+0x155) [0x7fe930463535]
6: (()+0x5e6d6) [0x7fe9304616d6]
7: (()+0x5e703) [0x7fe930461703]
8: (()+0x5e922) [0x7fe930461922]
9: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x278) [0xbc3958]
10: (md_config_t::remove_observer(md_config_obs_t*)+0xd4) [0xbe0094]
11: (ThreadPool::stop(bool)+0x1ce) [0xbb115e]
12: (FileStore::umount()+0x170) [0x8f11e0]
13: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0x6bd) [0x68c5ed]
14: (main()+0xa1f) [0x65025f]
15: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
16: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS &lt;executable&gt;` is needed to interpret this.
0> 2015-10-16 04:03:29.684739 7fe932a05900 -1 ** Caught signal (Aborted) *
in thread 7fe932a05900
ceph version 0.94.3 (95cefea9fd9ab740263bf8bb4796fd864d9afe2b)
1: ceph-osd() [0xacb49a]
2: (()+0x10340) [0x7fe9316b9340]
3: (gsignal()+0x39) [0x7fe92fb58cc9]
4: (abort()+0x148) [0x7fe92fb5c0d8]
5: (_gnu_cxx::_verbose_terminate_handler()+0x155) [0x7fe930463535]
6: (()+0x5e6d6) [0x7fe9304616d6]
7: (()+0x5e703) [0x7fe930461703]
8: (()+0x5e922) [0x7fe930461922]
9: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x278) [0xbc3958]
10: (md_config_t::remove_observer(md_config_obs_t*)+0xd4) [0xbe0094]
11: (ThreadPool::stop(bool)+0x1ce) [0xbb115e]
12: (FileStore::umount()+0x170) [0x8f11e0]
13: (OSD::mkfs(CephContext*, ObjectStore*, std::string const&, uuid_d, int)+0x6bd) [0x68c5ed]
14: (main()+0xa1f) [0x65025f]
15: (__libc_start_main()+0xf5) [0x7fe92fb43ec5]
16: ceph-osd() [0x66b0d7]
NOTE: a copy of the executable, or `objdump -rdS &lt;executable&gt;` is needed to interpret this.

Aborted (core dumped)


Related issues 1 (0 open1 closed)

Has duplicate Ceph - Bug #13989: OSD boot fails with os/FileJournal.cc: 1907: FAILED assert(0)Duplicate12/05/2015

Actions
Actions

Also available in: Atom PDF