Bug #43181
closedModule 'balancer' has failed: (104,) - with Unhandled Exception
0%
Description
In our production cluster we came across this issue with the Ceph balancer. The cluster was rebalancing at the time of the crash.
2019-12-06 20:18:01.031 xxxxxxxxxxxx -1 log_channel(cluster) log [ERR] : Unhandled exception from module 'balancer' while running on mgr.xxxxx: (104,)
2019-12-06 20:18:01.031 xxxxxxxxxxxx -1 balancer.serve:
2019-12-06 20:18:01.031 xxxxxxxxxxxx -1 Traceback (most recent call last):
File "/usr/share/ceph/mgr/balancer/module.py", line 624, in serve
r, detail = self.optimize(plan)
File "/usr/share/ceph/mgr/balancer/module.py", line 891, in optimize
return self.do_crush_compat(plan)
File "/usr/share/ceph/mgr/balancer/module.py", line 1053, in do_crush_compat
weight = best_ws[osd]
KeyError: (104,)
Unfortunately I do not have any more verbose logging. Please let me know if you need any further info
Updated by Brad Hubbard over 4 years ago
- Project changed from Ceph to mgr
- Category set to balancer module
Updated by Lenz Grimmer over 4 years ago
- Is duplicate of Bug #42721: mgr/balancer: KeyError messages in balancer module added
Updated by Lenz Grimmer over 4 years ago
- Status changed from New to Duplicate
This seems to be a duplicate of #42721