Project

General

Profile

Actions

Bug #43181

closed

Module 'balancer' has failed: (104,) - with Unhandled Exception

Added by Rory Shanks over 4 years ago. Updated over 4 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
balancer module
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

In our production cluster we came across this issue with the Ceph balancer. The cluster was rebalancing at the time of the crash.

2019-12-06 20:18:01.031 xxxxxxxxxxxx -1 log_channel(cluster) log [ERR] : Unhandled exception from module 'balancer' while running on mgr.xxxxx: (104,)
2019-12-06 20:18:01.031 xxxxxxxxxxxx -1 balancer.serve:
2019-12-06 20:18:01.031 xxxxxxxxxxxx -1 Traceback (most recent call last):
File "/usr/share/ceph/mgr/balancer/module.py", line 624, in serve
r, detail = self.optimize(plan)
File "/usr/share/ceph/mgr/balancer/module.py", line 891, in optimize
return self.do_crush_compat(plan)
File "/usr/share/ceph/mgr/balancer/module.py", line 1053, in do_crush_compat
weight = best_ws[osd]
KeyError: (104,)

Unfortunately I do not have any more verbose logging. Please let me know if you need any further info


Related issues 1 (0 open1 closed)

Is duplicate of mgr - Bug #42721: mgr/balancer: KeyError messages in balancer moduleResolvedSage Weil

Actions
Actions #1

Updated by Brad Hubbard over 4 years ago

  • Project changed from Ceph to mgr
  • Category set to balancer module
Actions #2

Updated by Lenz Grimmer over 4 years ago

  • Is duplicate of Bug #42721: mgr/balancer: KeyError messages in balancer module added
Actions #3

Updated by Lenz Grimmer over 4 years ago

  • Status changed from New to Duplicate

This seems to be a duplicate of #42721

Actions

Also available in: Atom PDF