Bug #49988: Global Recovery Event never completes - RADOS - Ceph

Actions

Copy link

Bug #49988

closed

Global Recovery Event never completes

Added by Sage Weil about 3 years ago. Updated almost 3 years ago.

Status:

Resolved

Priority:

Urgent

Assignee:

Kamoltat (Junior) Sirivadhna

Category:

Target version:

% Done:

Source:

Tags:

Backport:

pacific

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Component(RADOS):

Pull request ID:

40480

Crash signature (v1):

Crash signature (v2):

Description

  services:
    mon: 3 daemons, quorum a,b,c (age 29m)
    mgr: x(active, since 29m)
    osd: 6 osds: 6 up (since 17m), 6 in (since 29m)

  data:
    pools:   3 pools, 160 pgs
    objects: 0 objects, 0 B
    usage:   6.1 GiB used, 600 GiB / 606 GiB avail
    pgs:     160 active+clean

  progress:
    Global Recovery Event (5m)
      [==================..........] (remaining: 2m)

this is a OSD=6 vstart cluster that just ran qa/workunits/rados/test_python.sh, but i've seen this in other cases too.

Related issues 2 (0 open — 2 closed)

Actions

Copy link

Updated by Neha Ojha about 3 years ago

Assignee set to Kamoltat (Junior) Sirivadhna

Actions

Copy link

Updated by Kamoltat (Junior) Sirivadhna about 3 years ago

Status changed from New to Fix Under Review
Pull request ID set to 40480

Problem was that I did not subtract pgs that I skip because (reported_epoch_of_pg < start_epoch_of_event) from total_pg_num, this results in always (active_clean_pg < total_pg_num).
Therefore, a fix for this is just to subtract the pgs I skipped from total_pg_num.

Actions

Copy link

Updated by Neha Ojha almost 3 years ago

Related to Bug #50243: test_turn_off_module (tasks.mgr.test_progress.TestProgress) AssertionError: False is not true added

Actions

Copy link

Updated by Kefu Chai almost 3 years ago

Related to deleted (Bug #50243: test_turn_off_module (tasks.mgr.test_progress.TestProgress) AssertionError: False is not true)

Actions

Copy link

Updated by Kefu Chai almost 3 years ago

Has duplicate Bug #50243: test_turn_off_module (tasks.mgr.test_progress.TestProgress) AssertionError: False is not true added

Actions

Copy link

Updated by Kefu Chai almost 3 years ago

Status changed from Fix Under Review to Resolved

Actions

Copy link

Updated by Neha Ojha almost 3 years ago

Status changed from Resolved to Pending Backport
Backport set to pacific

Actions

Copy link

Updated by Backport Bot almost 3 years ago

Copied to Backport #51215: pacific: Global Recovery Event never completes added

Actions

Copy link

Updated by Yuri Weinstein almost 3 years ago

https://github.com/ceph/ceph/pull/41872 merged

Actions

Copy link

#10

Updated by Loïc Dachary almost 3 years ago

Status changed from Pending Backport to Resolved

While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Ceph » RADOS

Custom queries

Bug #49988

Global Recovery Event never completes

Updated by Neha Ojha about 3 years ago

Updated by Kamoltat (Junior) Sirivadhna about 3 years ago

Updated by Neha Ojha almost 3 years ago

Updated by Kefu Chai almost 3 years ago

Updated by Kefu Chai almost 3 years ago

Updated by Kefu Chai almost 3 years ago

Updated by Neha Ojha almost 3 years ago

Updated by Backport Bot almost 3 years ago

Updated by Yuri Weinstein almost 3 years ago

Updated by Loïc Dachary almost 3 years ago