Actions
Bug #924
closedVarious OSD/FileStore bugs
% Done:
0%
Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
Via Martin Wilderoth, we have a number of logs with crashes. Not a lot of debugging though -- just messages and the backtrace.
I suspect these may be configuration issues, but don't know.
The logs are available at kai:~gregf/logs/host[1,2]-ceph-logs.tar. (Too large to upload here.)
Backtraces are:
2011-03-21 22:10:11.271281 7f0077984700 -- 10.0.6.10:6804/2341 <== mon0 10.0.6.10:6789/0 4 ==== osd_map(552,553) v1 ==== 837+0+0 (3253532737 0 0) 0x2380e00 con 0x2b833c0 os/FileStore.cc: In function 'void FileStore::sync_entry()', in thread '0x7f007b98c700' os/FileStore.cc: 2580: FAILED assert(r == 0) ceph version 0.25.1 (commit:7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05) 1: (FileStore::sync_entry()+0x198d) [0x59663d] 2: (FileStore::SyncThread::entry()+0xd) [0x5a277d] 3: (()+0x68ba) [0x7f0081fe98ba] 4: (clone()+0x6d) [0x7f0080c7e02d]
2011-03-21 22:15:44.220099 7f2eee709700 -- 0.0.0.0:6802/2242 <== osd3 10.0.6.11:6805/1725 145 ==== PGq v1 ==== 26730+0+0 (2176933432 0 0) 0x1f35000 con 0x193e500 osd/ReplicatedPG.cc: In function 'virtual bool ReplicatedPG::snap_trimmer()', in thread '0x7f2eeb602700' osd/ReplicatedPG.cc: 693: FAILED assert(r == 0) ceph version 0.25.1 (commit:7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05) 1: (ReplicatedPG::snap_trimmer()+0x57f3) [0x4a77d3] 2: (ThreadPool::worker()+0x519) [0x602669] 3: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d] 4: (()+0x68ba) [0x7f2ef956f8ba] 5: (clone()+0x6d) [0x7f2ef820402d]
2011-03-21 22:09:59.344377 7fa9498b4700 -- 0.0.0.0:6802/1645 mark_down 10.0.6.10:6805/1389 -- pipe dne os/FileStore.cc: In function 'void FileStore::sync_entry()', in thread '0x7fa94d8bc700' os/FileStore.cc: 2577: FAILED assert(r == 0) ceph version 0.25 (commit:3fa121bf0a8c142897a684c92a66b98d7db1672e) 1: (FileStore::sync_entry()+0x194d) [0x59867d] 2: (FileStore::SyncThread::entry()+0xd) [0x5a224d] 3: (()+0x68ba) [0x7fa953f198ba] 4: (clone()+0x6d) [0x7fa952bae02d]
Updated by Samuel Just about 13 years ago
os/FileStore.cc: In function 'unsigned int FileStore::_do_transaction(ObjectStore::Transaction&)', in thread '0x7fab375de700'
os/FileStore.cc: 1994: FAILED assert(0 == "ENOTEMPTY suggests garbage data in osd data dir")
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
2: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
3: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
4: (ThreadPool::worker()+0x519) [0x602669]
5: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
6: (()+0x68ba) [0x7fab3ec3d8ba]
7: (clone()+0x6d) [0x7fab3d8d202d]
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
2: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
3: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
4: (ThreadPool::worker()+0x519) [0x602669]
5: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
6: (()+0x68ba) [0x7fab3ec3d8ba]
7: (clone()+0x6d) [0x7fab3d8d202d]
os/FileStore.cc: 1994: FAILED assert(0 == "ENOTEMPTY suggests garbage data in osd data dir")
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
2: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
3: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
4: (ThreadPool::worker()+0x519) [0x602669]
5: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
6: (()+0x68ba) [0x7fab3ec3d8ba]
7: (clone()+0x6d) [0x7fab3d8d202d]
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
2: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
3: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
4: (ThreadPool::worker()+0x519) [0x602669]
5: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
6: (()+0x68ba) [0x7fab3ec3d8ba]
7: (clone()+0x6d) [0x7fab3d8d202d]
- Caught signal (Aborted) *
in thread 0x7fab375de700
ceph version 0.25.1 (7f4a161e7fd1bb0d0eab3958b427a3c4c585bc05)
1: /usr/bin/cosd() [0x6178ac]
2: (()+0xef60) [0x7fab3ec45f60]
3: (gsignal()+0x35) [0x7fab3d835165]
4: (abort()+0x180) [0x7fab3d837f70]
5: (_gnu_cxx::_verbose_terminate_handler()+0x115) [0x7fab3e0c8dc5]
6: (()+0xcb166) [0x7fab3e0c7166]
7: (()+0xcb193) [0x7fab3e0c7193]
8: (()+0xcb28e) [0x7fab3e0c728e]
9: (ceph::__ceph_assert_fail(char const, char const*, int, char const*)+0x373) [0x5ff803]
10: (FileStore::_do_transaction(ObjectStore::Transaction&)+0x475) [0x58e445]
11: (FileStore::do_transactions(std::list<ObjectStore::Transaction*, std::allocator<ObjectStore::Transaction*> >&, unsigned long)+0x145) [0x592745]
12: (FileStore::_do_op(FileStore::OpSequencer*)+0x13e) [0x57db5e]
13: (ThreadPool::worker()+0x519) [0x602669]
14: (ThreadPool::WorkThread::entry()+0xd) [0x51fd3d]
15: (()+0x68ba) [0x7fab3ec3d8ba]
16: (clone()+0x6d) [0x7fab3d8d202d]
Updated by Sage Weil about 13 years ago
- Target version changed from v0.26 to v0.27
- Translation missing: en.field_position set to 323
Updated by Sage Weil about 13 years ago
- Status changed from New to Can't reproduce
- Translation missing: en.field_position deleted (
324) - Translation missing: en.field_position set to 324
Updated by Sage Weil about 13 years ago
- Translation missing: en.field_position deleted (
326) - Translation missing: en.field_position set to 598
Updated by Samuel Just about 13 years ago
This actually may have been fixed in e26626a8b4d91bb827a4909fd7f45f2c5f06f331.
Actions