Project

General

Profile

Actions

Bug #10327

closed

"FAILED assert(allow_eio || !m_filestore_fail_eio || got != -5)" in upgrade:giant-giant-distro-basic-multi run

Added by Yuri Weinstein over 9 years ago. Updated over 9 years ago.

Status:
Closed
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Assertion due to bad drive, this may not be a bug, just making sure our assertion is correct.

Drives on mira076 are bad:

ubuntu@mira076:~$ /usr/libexec/smart.pl 
Drive 8 has 522 reallocated sectors;     Drive 8 has 804 uncorrect sectors;     Drive 8 has 29 pending sectors;     Drive 8 has 29 pending sectors

Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-12-14_17:13:01-upgrade:giant-giant-distro-basic-multi/658242/

Assertion: os/FileStore.cc: 2800: FAILED assert(allow_eio || !m_filestore_fail_eio || got != -5)
ceph version 0.87-72-g6ec14b0 (6ec14b07940ff64d6a121e21a730f691a1a71546)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x7f) [0xaa033f]
 2: (FileStore::read(coll_t, ghobject_t const&, unsigned long, unsigned long, ceph::buffer::list&, bool)+0x94f) [0x8b39ff]
 3: (ReplicatedBackend::build_push_op(ObjectRecoveryInfo const&, ObjectRecoveryProgress const&, ObjectRecoveryProgress*, PushOp*, object_stat_sum_t*)+0x24f) [0x7eefef]
 4: (ReplicatedBackend::prep_push(std::tr1::shared_ptr<ObjectContext>, hobject_t const&, pg_shard_t, eversion_t, interval_set<unsigned long>&, std::map<hobject_t, interval_set<unsigned long>, std::less<hobject_t>, std::allocator<std::pair<hobject_t const, interval_set<unsigned long> > > >&, PushOp*)+0x283) [0x829783]
 5: (ReplicatedBackend::prep_push_to_replica(std::tr1::shared_ptr<ObjectContext>, hobject_t const&, pg_shard_t, PushOp*)+0x372) [0x829d42]
 6: (ReplicatedBackend::start_pushes(hobject_t const&, std::tr1::shared_ptr<ObjectContext>, ReplicatedBackend::RPGHandle*)+0x737) [0x82aae7]
 7: (ReplicatedBackend::recover_object(hobject_t const&, eversion_t, std::tr1::shared_ptr<ObjectContext>, std::tr1::shared_ptr<ObjectContext>, PGBackend::RecoveryHandle*)+0xf6) [0x9348f6]
 8: (ReplicatedPG::prep_object_replica_pushes(hobject_t const&, eversion_t, PGBackend::RecoveryHandle*)+0x49d) [0x7ffa6d]
 9: (ReplicatedPG::recover_replicas(int, ThreadPool::TPHandle&)+0xbc1) [0x800ce1]
 10: (ReplicatedPG::start_recovery_ops(int, PG::RecoveryCtx*, ThreadPool::TPHandle&, int*)+0x32d) [0x84351d]
 11: (OSD::do_recovery(PG*, ThreadPool::TPHandle&)+0x287) [0x64b087]
 12: (OSD::RecoveryWQ::_process(PG*, ThreadPool::TPHandle&)+0x11) [0x6a8951]
 13: (ThreadPool::worker(ThreadPool::WorkThread*)+0x48e) [0xa8ff6e]
 14: (ThreadPool::WorkThread::entry()+0x10) [0xa92d00]
 15: (()+0x7e9a) [0x7fa395176e9a]
 16: (clone()+0x6d) [0x7fa393b283fd]

Actions #1

Updated by Yuri Weinstein over 9 years ago

  • Status changed from New to Closed

Not a bug

Actions

Also available in: Atom PDF