Project

General

Profile

Actions

Bug #57175

open

cephadm: don't try to write client/os tuning files to offline hosts

Added by Adam King over 1 year ago. Updated about 1 year ago.

Status:
Pending Backport
Priority:
High
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
backport_processed
Backport:
quincy, pacific
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

if a host is known to be offline, we shouldn't continue to try handle os tuning profiles or client files on that host. This can actually cause an exception that will block further progress in the serve loop

2022-08-17T21:46:17.371392+0000 mgr.vm-00.eudebg [DBG] Running command: ls /etc/sysctl.d
2022-08-17T21:46:17.395290+0000 mgr.vm-00.eudebg [DBG] Running command: ls /etc/sysctl.d
2022-08-17T21:46:17.417827+0000 mgr.vm-00.eudebg [DBG] Running command: ls /etc/sysctl.d
2022-08-17T21:46:17.443274+0000 mgr.vm-00.eudebg [DBG] Opening connection to root@192.168.122.80 with ssh options '-F /tmp/cephadm-conf-f351v8jc -i /tmp/cephadm-identity-nxo5q57o'
2022-08-17T21:46:20.400567+0000 mgr.vm-00.eudebg [ERR] Can't communicate with remote host `192.168.122.80`, possibly because python3 is not installed there or you are missing NOPASSWD in sudoers. [Errno 113] Connect call failed ('192.168.122.80', 22)
Traceback (most recent call last):
  File "/usr/share/ceph/mgr/cephadm/ssh.py", line 103, in redirect_log
    yield
  File "/usr/share/ceph/mgr/cephadm/ssh.py", line 82, in _remote_connection
    preferred_auth=['publickey'], options=ssh_options)
  File "/lib/python3.6/site-packages/asyncssh/connection.py", line 6804, in connect
    'Opening SSH connection to')
  File "/lib/python3.6/site-packages/asyncssh/connection.py", line 299, in _connect
    local_addr=local_addr)
  File "/lib64/python3.6/asyncio/base_events.py", line 794, in create_connection
    raise exceptions[0]
  File "/lib64/python3.6/asyncio/base_events.py", line 781, in create_connection
    yield from self.sock_connect(sock, address)
  File "/lib64/python3.6/asyncio/selector_events.py", line 439, in sock_connect
    return (yield from fut)
  File "/lib64/python3.6/asyncio/selector_events.py", line 469, in _sock_connect_cb
    raise OSError(err, 'Connect call failed %s' % (address,))
OSError: [Errno 113] Connect call failed ('192.168.122.80', 22)

Related issues 3 (1 open2 closed)

Related to Orchestrator - Bug #63756: Can't communicate with remote host, possibly because the host is not reachable or python3 is not installed on the hostNew

Actions
Copied to Orchestrator - Backport #57377: quincy: cephadm: don't try to write client/os tuning files to offline hostsResolvedAdam KingActions
Copied to Orchestrator - Backport #59649: pacific: cephadm: don't try to write client/os tuning files to offline hostsResolvedMichael FritchActions
Actions

Also available in: Atom PDF