Project

General

Profile

Actions

Bug #8724

closed

osd/ReplicatedPG.cc: 10278: FAILED assert(obc) in recover_backfill()

Added by Sage Weil almost 10 years ago. Updated almost 10 years ago.

Status:
Rejected
Priority:
Urgent
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

this was on a vstart cluster. log attached.

2014-07-01 17:33:36.083978 7ffbf1ffb700 -1 osd/ReplicatedPG.cc: In function 'int ReplicatedPG::recover_backfill(int, ThreadPool::TPHandle&, bool*)' thread 7ffbf1ffb700 time 2014-07-01 17:33:36.063324
osd/ReplicatedPG.cc: 10278: FAILED assert(obc)

 ceph version 0.82-397-g31e9c34 (31e9c3427b7df9aee9a625ed9b8ab32602a8da1e)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x95) [0x18c3a91]
 2: (ReplicatedPG::recover_backfill(int, ThreadPool::TPHandle&, bool*)+0x1c39) [0x159d0eb]
 3: (ReplicatedPG::start_recovery_ops(int, PG::RecoveryCtx*, ThreadPool::TPHandle&, int*)+0x890) [0x159683c]
 4: (OSD::do_recovery(PG*, ThreadPool::TPHandle&)+0x798) [0x12d8c88]
 5: (OSD::RecoveryWQ::_process(PG*, ThreadPool::TPHandle&)+0x30) [0x12f8678]
 6: (ThreadPool::WorkQueue<PG>::_void_process(void*, ThreadPool::TPHandle&)+0x33) [0x138a9a3]
 7: (ThreadPool::worker(ThreadPool::WorkThread*)+0x734) [0x18b39d2]
 8: (ThreadPool::WorkThread::entry()+0x23) [0x18b79c3]
 9: (Thread::entry_wrapper()+0x79) [0x18ac145]
 10: (Thread::_entry_func(void*)+0x18) [0x18ac0c2]
 11: (()+0x8182) [0x7ffc3e350182]
 12: (clone()+0x6d) [0x7ffc3c87730d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.


Files

osd.1.log.gz (10.9 MB) osd.1.log.gz Sage Weil, 07/02/2014 09:32 AM
Actions #1

Updated by Sage Weil almost 10 years ago

Actions #2

Updated by David Zafman almost 10 years ago

There are multiple simultaneous asserts. The objects involved in the backfill assert is benchmark_data_maetl_4884_object5176.

Earlier this error occurred.

2014-07-01 17:33:36.063286 7ffbf1ffb700 10 filestore(/home/sage/src/ceph/src/dev/osd1) error opening file /home/sage/src/ceph/src/dev/osd1/current/2.4_head/DIR_C/DIR_E/benchmark\udata\umaetl\u4884\uobject5176__head_A4C62CEC__2 with flags=2: (24) Too many open files

Actions #3

Updated by Sage Weil almost 10 years ago

  • Status changed from New to Rejected

sweet

Actions

Also available in: Atom PDF