Project

General

Profile

Actions

Bug #631

closed

OSD: FileJournal::committed_thru

Added by Wido den Hollander over 13 years ago. Updated over 13 years ago.

Status:
Won't Fix
Priority:
Urgent
Assignee:
Category:
OSD
Target version:
% Done:

0%

Spent time:
Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

On a small cluster (1 MDS, 1 MON, 3 OSD's) I just saw 2 OSD's crashing with the same backtrace:

2010-12-04 22:01:01.823801 7f7bfd2f1710 osd0 96 update_osd_stat osd_stat(3258 MB used, 99141 MB avail, 102400 MB total, peers []/[])
2010-12-04 22:01:01.823864 7f7bfd2f1710 osd0 96 heartbeat: stat(2010-12-04 22:01:01.823671 oprate=0 qlen=0 recent_qlen=0 rdlat=0 / 0 fshedin=0)
2010-12-04 22:01:01.823898 7f7bfd2f1710 osd0 96 heartbeat: osd_stat(3258 MB used, 99141 MB avail, 102400 MB total, peers []/[])
os/FileJournal.cc: In function 'virtual void FileJournal::committed_thru(uint64_t)':
os/FileJournal.cc:854: FAILED assert(seq >= last_committed_seq)
 ceph version 0.24~rc (commit:23f370436eeb20f02c0d20699f53071a1424cebf)
 1: (FileJournal::committed_thru(unsigned long)+0xb1) [0x58a931]
 2: (JournalingObjectStore::commit_finish()+0x8c) [0x58110c]
 3: (FileStore::sync_entry()+0xd03) [0x573853]
 4: (FileStore::SyncThread::entry()+0xd) [0x507bed]
 5: (Thread::_entry_func(void*)+0xa) [0x4797fa]
 6: (()+0x69ca) [0x7f7c0aa7d9ca]
 7: (clone()+0x6d) [0x7f7c0970970d]

This is on the RC branch.

I've ran cdebugpack to collect data, it's stored at logger.ceph.widodh.nl:/srv/ceph/issues/osd_crash_committed_thru

Note that the logs were already rotated before I found out the OSD's crashed, so you should the the *.1.gz files.


Files

osd.1.log (3.92 MB) osd.1.log Wido den Hollander, 12/06/2010 03:13 AM
Actions

Also available in: Atom PDF