Project

General

Profile

Actions

Bug #45096

open

telemetry.ceph.com certificate has expired

Added by Tim Bishop about 4 years ago. Updated about 4 years ago.

Status:
New
Priority:
Normal
Assignee:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

The SSL certificate for https://telemetry.ceph.com/ has expired.

# ceph health detail
HEALTH_ERR Module 'telemetry' has failed: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:590)
MGR_MODULE_ERROR Module 'telemetry' has failed: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:590)
    Module 'telemetry' has failed: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:590)

https://www.ssllabs.com/ssltest/analyze.html?d=telemetry.ceph.com&hideResults=on

It looks to be from Let's Encrypt so I guess the update script isn't running?

Actions #1

Updated by Tobias Fischer about 4 years ago

same here on several clusters - had to disable telemetry module to get back to health_ok :-(
expired on 14.4.2020

Actions #2

Updated by Tim Bishop about 4 years ago

Tobias Fischer wrote:

same here on several clusters - had to disable telemetry module to get back to health_ok :-(
expired on 14.4.2020

Hi Tobias. How did you disable it? I tried and got:

# ceph telemetry off
Error EIO: Module 'telemetry' has experienced an error and cannot handle commands: [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:590)

This is on Nautilus 14.2.5.

Actions #3

Updated by Felix Barbeira about 4 years ago

It's renewed. You can try to restart mgr daemon on the active monitor node. When the mgr daemon start again checks the https certificate and turns cluster healthy again.

Actions #4

Updated by Tim Bishop about 4 years ago

Felix Barbeira wrote:

It's renewed. You can try to restart mgr daemon on the active monitor node. When the mgr daemon start again checks the https certificate and turns cluster healthy again.

Yup, confirmed, it's now working after a mgr restart.

Actions

Also available in: Atom PDF