Bug #14015
osd/PG.cc: 288: FAILED assert(info.last_epoch_started >= info.history.last_epoch_started)
Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
other
Tags:
Backport:
infernalis, hammer
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
Hit this assert when setting osd_find_best_info_ignore_history_les to get incomplete PGs to active+clean with known data loss.
Fixed in master with commit:
02a9a41f151a3d968bf8066749658659dc6e3ac4
Related issues
History
#1 Updated by Loïc Dachary over 8 years ago
- Backport set to infernalis
#2 Updated by Loïc Dachary over 8 years ago
- Copied to Backport #14018: infernalis: osd/PG.cc: 288: FAILED assert(info.last_epoch_started >= info.history.last_epoch_started) added
#3 Updated by Loïc Dachary over 8 years ago
@David, this is to be backported to infernalis ? Or also hammer ?
#4 Updated by David Zafman over 8 years ago
- Backport changed from infernalis to infernalis, hammer
We need to backport to Hammer too, so I added it.
#5 Updated by Ken Dreyer over 8 years ago
- Backport changed from infernalis, hammer to jewel, infernalis, hammer
PR to master: https://github.com/ceph/ceph/pull/6793
#7 Updated by Loïc Dachary over 8 years ago
- Copied to Backport #14043: hammer: osd/PG.cc: 288: FAILED assert(info.last_epoch_started >= info.history.last_epoch_started) added
#8 Updated by Loïc Dachary over 8 years ago
- Backport changed from jewel, infernalis, hammer to infernalis, hammer
#10 Updated by Nathan Cutler about 8 years ago
- Status changed from Pending Backport to Resolved
#11 Updated by Loïc Dachary about 8 years ago
- Status changed from Resolved to New
@david: could this be a regression introduced by this fix ?
http://pulpito.ceph.com/loic-2016-02-16_22:00:52-rados-hammer-backports---basic-multi/12463/
2016-02-17T00:01:11.051 INFO:tasks.ceph.osd.1.mira080.stderr:2016-02-17 08:01:11.067384 7fb677316700 -1 osd.1 68 heartbeat_check: no reply from osd.5 since back 2016-02-17 08:00:49.045566 front 2016-02-17 08:00:49.045566 (cutoff 2016-02-17 08:00:51.067383) 2016-02-17T00:01:11.232 INFO:tasks.ceph.osd.4.smithi016.stderr:2016-02-17 08:01:11.240531 7f7b5ce5b700 -1 osd.4 68 heartbeat_check: no reply from osd.5 since back 2016-02-17 08:00:46.936030 front 2016-02-17 08:00:46.936030 (cutoff 2016-02-17 08:00:51.240529) 2016-02-17T00:01:12.673 INFO:tasks.ceph.osd.1.mira080.stderr:osd/PG.cc: In function 'void PG::proc_master_log(ObjectStore::Transaction&, pg_info_t&, pg_log_t&, pg_missing_t&, pg_shard_t)' thread 7fb666996700 time 2016-02-17 08:01:12.688125 2016-02-17T00:01:12.673 INFO:tasks.ceph.osd.1.mira080.stderr:osd/PG.cc: 303: FAILED assert(cct->_conf->osd_find_best_info_ignore_history_les || info.last_epoch_started >= info.history.last_epoch_started) 2016-02-17T00:01:12.674 INFO:tasks.ceph.osd.1.mira080.stderr: ceph version 0.94.5-372-gc8e5459 (c8e54591aa45fe42f3fd53164de8f89161696097) 2016-02-17T00:01:12.674 INFO:tasks.ceph.osd.1.mira080.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xbaa65b] 2016-02-17T00:01:12.674 INFO:tasks.ceph.osd.1.mira080.stderr: 2: (PG::proc_master_log(ObjectStore::Transaction&, pg_info_t&, pg_log_t&, pg_missing_t&, pg_shard_t)+0x34d) [0x7c83bd] 2016-02-17T00:01:12.674 INFO:tasks.ceph.osd.1.mira080.stderr: 3: (PG::RecoveryState::GetLog::react(PG::RecoveryState::GotLog const&)+0x1b1) [0x7e1141] 2016-02-17T00:01:12.674 INFO:tasks.ceph.osd.1.mira080.stderr: 4: (boost::statechart::simple_state<PG::RecoveryState::GetLog, PG::RecoveryState::Peering, boost::mpl::list<mpl_::
#12 Updated by Loïc Dachary about 8 years ago
- Status changed from New to Pending Backport
#13 Updated by Loïc Dachary about 8 years ago
- Status changed from Pending Backport to New
#14 Updated by Loïc Dachary over 7 years ago
- Status changed from New to Resolved