Project

General

Profile

Actions

Bug #46142

closed

Dashboard fails to load, internal server error in API

Added by Anssi Kolehmainen almost 4 years ago. Updated about 2 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
General
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
octopus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

We just updated rook-managed ceph cluster from 14.2.9 to 15.2.3. After the update dashboard frontpage doesn't show and only "The server encountered an unexpected condition which prevented it from fulfilling the request." box is shown.

mgr logs show that /api/health/minimal call results in

[21/Jun/2020:19:56:54] HTTP
Traceback (most recent call last):
  File "/lib/python3.6/site-packages/cherrypy/_cprequest.py", line 638, in respond
    self._do_respond(path_info)
  File "/lib/python3.6/site-packages/cherrypy/_cprequest.py", line 697, in _do_respond
    response.body = self.handler()
  File "/lib/python3.6/site-packages/cherrypy/lib/encoding.py", line 219, in __call__
    self.body = self.oldhandler(*args, **kwargs)
  File "/lib/python3.6/site-packages/cherrypy/_cpdispatch.py", line 54, in __call__
10.42.6.161 - - [21/Jun/2020:19:56:54] "GET /metrics HTTP/1.1" 500 1823 "" "Prometheus/2.16.0" 
    return self.callable(*self.args, **self.kwargs)
  File "/usr/share/ceph/mgr/prometheus/module.py", line 1047, in metrics
    return self._metrics(instance)
  File "/usr/share/ceph/mgr/prometheus/module.py", line 1062, in _metrics
    instance.collect_cache = instance.collect()
  File "/usr/share/ceph/mgr/prometheus/module.py", line 916, in collect
    self.get_pg_status()
  File "/usr/share/ceph/mgr/prometheus/module.py", line 524, in get_pg_status
    num_by_state[state] += count
KeyError: 'laggy'

I think this is caused by these two lists not being identical

failed_repair, laggy, premerge and wait are missing from PG_STATES


Related issues 2 (0 open2 closed)

Copied to mgr - Backport #48550: nautilus: Dashboard fails to load, internal server error in APIRejectedActions
Copied to mgr - Backport #48551: octopus: Dashboard fails to load, internal server error in APIResolvedLaura PaduanoActions
Actions #1

Updated by Neha Ojha over 3 years ago

  • Project changed from Ceph to mgr
  • Category set to 132
  • Status changed from New to Fix Under Review
  • Pull request ID set to 37909
Actions #2

Updated by Neha Ojha over 3 years ago

  • Backport set to nautilus,octopus
Actions #3

Updated by Kefu Chai over 3 years ago

  • Status changed from Fix Under Review to Pending Backport
Actions #4

Updated by Backport Bot over 3 years ago

  • Copied to Backport #48550: nautilus: Dashboard fails to load, internal server error in API added
Actions #5

Updated by Backport Bot over 3 years ago

  • Copied to Backport #48551: octopus: Dashboard fails to load, internal server error in API added
Actions #6

Updated by Ernesto Puerta about 3 years ago

  • Project changed from mgr to Dashboard
  • Category changed from 132 to General
Actions #7

Updated by Konstantin Shalygin about 2 years ago

  • Status changed from Pending Backport to Resolved
  • Backport changed from nautilus,octopus to octopus
Actions

Also available in: Atom PDF