Project

General

Profile

Actions

Bug #36659

closed

[rbd-mirror] forced promotion after killing remote cluster results in stuck state

Added by Jason Dillaman over 5 years ago. Updated about 5 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Jason Dillaman
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
luminous,mimic
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

The rbd-mirror daemon detects that the image has been locally promoted and attempts to shut down, but it hangs since the remote cluster is unresponsive and skips the status update.

2018-10-31 10:45:32.341 7f32f8ff9700 20 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] on_stop_journal_replay: enter
2018-10-31 10:45:32.341 7f32f8ff9700 20 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] set_state_description: 0 force promoted
2018-10-31 10:45:32.341 7f32f8ff9700 20 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] update_mirror_image_status:
2018-10-31 10:45:32.341 7f32f8ff9700 20 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] start_mirror_image_status_update: shut down in-progress: ignoring update
2018-10-31 10:45:32.341 7f32f8ff9700 15 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] reschedule_update_status_task: canceling existing status update task
2018-10-31 10:45:32.341 7f32f8ff9700 15 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] finish_mirror_image_status_update:
2018-10-31 10:45:32.341 7f32f8ff9700 10 rbd::mirror::ImageReplayer: 0x7f333800e4e0 [1/41ddd4e2-5716-4c14-9568-c1340762addd] shut_down: r=0

Related issues 2 (0 open2 closed)

Copied to rbd - Backport #36692: luminous: [rbd-mirror] forced promotion after killing remote cluster results in stuck stateResolvedAshish SinghActions
Copied to rbd - Backport #36693: mimic: [rbd-mirror] forced promotion after killing remote cluster results in stuck stateResolvedJonathan BrielmaierActions
Actions

Also available in: Atom PDF