Project

General

Profile

Actions

Bug #924

closed

Various OSD/FileStore bugs

Added by Greg Farnum about 13 years ago. Updated about 13 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
-
Category:
OSD
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Via Martin Wilderoth, we have a number of logs with crashes. Not a lot of debugging though -- just messages and the backtrace.
I suspect these may be configuration issues, but don't know.

The logs are available at kai:~gregf/logs/host[1,2]-ceph-logs.tar. (Too large to upload here.)
Backtraces are:

2011-03-21 22:10:11.271281 7f0077984700 -- 10.0.6.10:6804/2341 <== mon0 10.0.6.10:6789/0 4 ==== osd_map(552,553) v1 ==== 837+0+0 (3253532737 0 0) 0x2380e00 con 0x2b833c0
os/FileStore.cc: In function 'void FileStore::sync_entry()', in thread '0x7f007b98c700'
os/FileStore.cc: 2580: FAILED assert(r == 0)
 ceph version 0.25.1 (commit:7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
 1: (FileStore::sync_entry()+0x198d) [0x59663d]
 2: (FileStore::SyncThread::entry()+0xd) [0x5a277d]
 3: (()+0x68ba) [0x7f0081fe98ba]
 4: (clone()+0x6d) [0x7f0080c7e02d]

2011-03-21 22:15:44.220099 7f2eee709700 -- 0.0.0.0:6802/2242 <== osd3 10.0.6.11:6805/1725 145 ==== PGq v1 ==== 26730+0+0 (2176933432 0 0) 0x1f35000 con 0x193e500
osd/ReplicatedPG.cc: In function 'virtual bool ReplicatedPG::snap_trimmer()', in thread '0x7f2eeb602700'
osd/ReplicatedPG.cc: 693: FAILED assert(r == 0)
 ceph version 0.25.1 (commit:7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
 1: (ReplicatedPG::snap_trimmer()+0x57f3) [0x4a77d3]
 2: (ThreadPool::worker()+0x519) [0x602669]
 3: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
 4: (()+0x68ba) [0x7f2ef956f8ba]
 5: (clone()+0x6d) [0x7f2ef820402d]
2011-03-21 22:09:59.344377 7fa9498b4700 -- 0.0.0.0:6802/1645 mark_down 10.0.6.10:6805/1389 -- pipe dne
os/FileStore.cc: In function 'void FileStore::sync_entry()', in thread '0x7fa94d8bc700'
os/FileStore.cc: 2577: FAILED assert(r == 0)
 ceph version 0.25 (commit:3fa121bf0a8c142897a684c92a66b98d7db1672e)
 1: (FileStore::sync_entry()+0x194d) [0x59867d]
 2: (FileStore::SyncThread::entry()+0xd) [0x5a224d]
 3: (()+0x68ba) [0x7fa953f198ba]
 4: (clone()+0x6d) [0x7fa952bae02d]
Actions #1

Updated by Samuel Just about 13 years ago

os/FileStore.cc: In function 'unsigned int FileStore::_do_transaction(ObjectStore::Transaction&)', in thread '0x7fab375de700'
os/FileStore.cc: 1994: FAILED assert(0 == "ENOTEMPTY suggests garbage data in osd data dir")
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
2: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
3: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
4: (ThreadPool::worker()+0x519) [0x602669]
5: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
6: (()+0x68ba) [0x7fab3ec3d8ba]
7: (clone()+0x6d) [0x7fab3d8d202d]
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
2: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
3: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
4: (ThreadPool::worker()+0x519) [0x602669]
5: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
6: (()+0x68ba) [0x7fab3ec3d8ba]
7: (clone()+0x6d) [0x7fab3d8d202d]
  • Caught signal (Aborted) *
    in thread 0x7fab375de700
    ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
    1: /usr/bin/cosd() [0x6178ac]
    2: (()+0xef60) [0x7fab3ec45f60]
    3: (gsignal()+0x35) [0x7fab3d835165]
    4: (abort()+0x180) [0x7fab3d837f70]
    5: (_gnu_cxx::_verbose_terminate_handler()+0x115) [0x7fab3e0c8dc5]
    6: (()+0xcb166) [0x7fab3e0c7166]
    7: (()+0xcb193) [0x7fab3e0c7193]
    8: (()+0xcb28e) [0x7fab3e0c728e]
    9: (ceph::__ceph_assert_fail(char const
    , char const*, int, char const*)+0x373) [0x5ff803]
    10: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
    11: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
    12: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
    13: (ThreadPool::worker()+0x519) [0x602669]
    14: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
    15: (()+0x68ba) [0x7fab3ec3d8ba]
    16: (clone()+0x6d) [0x7fab3d8d202d]
Actions #2

Updated by Sage Weil about 13 years ago

  • Target version changed from v0.26 to v0.27
  • Translation missing: en.field_position set to 323
Actions #3

Updated by Sage Weil about 13 years ago

  • Status changed from New to Can't reproduce
  • Translation missing: en.field_position deleted (324)
  • Translation missing: en.field_position set to 324
Actions #4

Updated by Sage Weil about 13 years ago

  • Translation missing: en.field_position deleted (326)
  • Translation missing: en.field_position set to 598
Actions #5

Updated by Samuel Just about 13 years ago

This actually may have been fixed in e26626a8b4d91bb827a4909fd7f45f2c5f06f331.

Actions

Also available in: Atom PDF