Project

General

Profile

Bug #37768

mon gets stuck op for failing OSDs

Added by Jonas Jelten about 5 years ago. Updated about 5 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Monitor
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

6 slow ops, oldest one blocked for 736706 sec, mon.rofl has slow ops

I have several slow monitor ops that were triggered when OSDs went down. Those OSDs have come up shortly after their failure (they started listening on a different port of course), but the monitor still has ops running for them.

Attached is the dump of sudo ceph daemon mon.rofl ops | jq . > /tmp/lol

Workaround is to restart the mon service that has the slow requests.

slow_mon_ops - currently running "slow" mon ops (13.1 KB) Jonas Jelten, 12/27/2018 06:52 PM


Related issues

Duplicates RADOS - Bug #24531: Mimic MONs have slow/long running ops Resolved

History

#1 Updated by Josh Durgin about 5 years ago

  • Duplicates Bug #24531: Mimic MONs have slow/long running ops added

#2 Updated by Josh Durgin about 5 years ago

  • Status changed from New to Duplicate

Also available in: Atom PDF