Project

General

Profile

Bug #46142

Dashboard fails to load, internal server error in API

Added by Anssi Kolehmainen over 3 years ago. Updated over 1 year ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
General
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
octopus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

We just updated rook-managed ceph cluster from 14.2.9 to 15.2.3. After the update dashboard frontpage doesn't show and only "The server encountered an unexpected condition which prevented it from fulfilling the request." box is shown.

mgr logs show that /api/health/minimal call results in

[21/Jun/2020:19:56:54] HTTP
Traceback (most recent call last):
  File "/lib/python3.6/site-packages/cherrypy/_cprequest.py", line 638, in respond
    self._do_respond(path_info)
  File "/lib/python3.6/site-packages/cherrypy/_cprequest.py", line 697, in _do_respond
    response.body = self.handler()
  File "/lib/python3.6/site-packages/cherrypy/lib/encoding.py", line 219, in __call__
    self.body = self.oldhandler(*args, **kwargs)
  File "/lib/python3.6/site-packages/cherrypy/_cpdispatch.py", line 54, in __call__
10.42.6.161 - - [21/Jun/2020:19:56:54] "GET /metrics HTTP/1.1" 500 1823 "" "Prometheus/2.16.0" 
    return self.callable(*self.args, **self.kwargs)
  File "/usr/share/ceph/mgr/prometheus/module.py", line 1047, in metrics
    return self._metrics(instance)
  File "/usr/share/ceph/mgr/prometheus/module.py", line 1062, in _metrics
    instance.collect_cache = instance.collect()
  File "/usr/share/ceph/mgr/prometheus/module.py", line 916, in collect
    self.get_pg_status()
  File "/usr/share/ceph/mgr/prometheus/module.py", line 524, in get_pg_status
    num_by_state[state] += count
KeyError: 'laggy'

I think this is caused by these two lists not being identical

failed_repair, laggy, premerge and wait are missing from PG_STATES


Related issues

Copied to mgr - Backport #48550: nautilus: Dashboard fails to load, internal server error in API Rejected
Copied to mgr - Backport #48551: octopus: Dashboard fails to load, internal server error in API Resolved

History

#1 Updated by Neha Ojha almost 3 years ago

  • Project changed from Ceph to mgr
  • Category set to 132
  • Status changed from New to Fix Under Review
  • Pull request ID set to 37909

#2 Updated by Neha Ojha almost 3 years ago

  • Backport set to nautilus,octopus

#3 Updated by Kefu Chai almost 3 years ago

  • Status changed from Fix Under Review to Pending Backport

#4 Updated by Backport Bot almost 3 years ago

  • Copied to Backport #48550: nautilus: Dashboard fails to load, internal server error in API added

#5 Updated by Backport Bot almost 3 years ago

  • Copied to Backport #48551: octopus: Dashboard fails to load, internal server error in API added

#6 Updated by Ernesto Puerta over 2 years ago

  • Project changed from mgr to Dashboard
  • Category changed from 132 to General

#7 Updated by Konstantin Shalygin over 1 year ago

  • Status changed from Pending Backport to Resolved
  • Backport changed from nautilus,octopus to octopus

Also available in: Atom PDF