Project

General

Profile

Actions

Bug #37509

closed

require past_interval bounds mismatch due to osd oldest_map

Added by Sage Weil over 5 years ago. Updated almost 4 years ago.

Status:
Can't reproduce
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
nautilus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2018-12-03 18:58:37.688 7f93af6a9700 10 osd.1 pg_epoch: 704 pg[1.30( empty local-lis/les=496/497 n=0 ec=496/15 lis/c 496/496 les/c/f 497/497/0 498/498/486) [7,6] r=-1 lpr=704 pi=[496,498)/1 crt=0'0 unknown NOTIFY mbc={}] check_recovery_sources no source osds () went down
2018-12-03 18:58:37.688 7f93af6a9700 -1 log_channel(cluster) log [ERR] : 1.30 required past_interval bounds are empty [515,498) but past_intervals is not: ([496,497] intervals=([496,497] acting 1,7))

note from first line that last_epoch_clean is 497, same_interval_since is 498, and pi is ([496,497] intervals=([496,497] acting 1,7)), which is all fine.

the required bound looks like [515,498) because superblock.oldest_map is 515.

//a/sage-2018-12-03_17:39:26-rados-wip-sage2-testing-2018-12-03-0942-distro-basic-smithi/3304196


Related issues 1 (0 open1 closed)

Related to RADOS - Bug #39546: Warning about past_interval bounds on deleting pgResolved04/30/2019

Actions
Actions #1

Updated by Sage Weil over 5 years ago

I don't think the superblock.oldest_map should be a factor in this calculation. I suspect it is in there to deal with skipped maps, but in those situations I think we should use a different strategy (e.g., not munge required intervals at this level) to proceed... by, say, not checking interval bounds at all. or something.

Actions #2

Updated by Neha Ojha over 5 years ago

/a/nojha-2019-01-23_02:37:14-rados:thrash-erasure-code-master-distro-basic-smithi/3494085/

Actions #3

Updated by Neha Ojha about 5 years ago

  • Backport set to nautilus

/a/yuriw-2019-04-05_00:28:05-rados-wip-yuri2-testing-2019-04-04-1953-nautilus-distro-basic-smithi/3811215/

Actions #4

Updated by Neha Ojha almost 5 years ago

  • Related to Bug #39546: Warning about past_interval bounds on deleting pg added
Actions #5

Updated by Patrick Donnelly over 4 years ago

  • Status changed from 12 to New
Actions #6

Updated by Neha Ojha almost 4 years ago

  • Status changed from New to Can't reproduce
Actions

Also available in: Atom PDF