Project

General

Profile

Actions

Bug #62650

closed

Various SSH errors are preventing many jobs from completing properly

Added by Zack Cerza 9 months ago. Updated 7 months ago.

Status:
Resolved
Priority:
Immediate
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

We've been having more and more issues with SSH errors recently:
https://sentry.ceph.com/organizations/ceph/issues/?end=2023-08-30T23%3A59%3A59&query=paramiko&start=2023-08-22T00%3A00%3A00&utc=true

I found a fix for the AttributeError: https://sentry.ceph.com/share/issue/e9092ab6059e4ea299350022b9b2cb52/ https://github.com/ceph/teuthology/pull/1886 - but there's clearly more going on at this point.

This issue alone has occurred over 600 times in the last 24h: https://sentry.ceph.com/share/issue/ef95cc1bf37f4e89a849c9a1c5e26a6b/

I noticed that all of the hosts it affected were CentOS 9.Stream, and I've narrowed this particular issue down to an SSH key incompatibility.

Actions

Also available in: Atom PDF