Project

General

Profile

Actions

Bug #55774

closed

octopus: prometheus, and selftest failures due to timed-out or failed connections

Added by Ernesto Puerta almost 2 years ago. Updated almost 2 years ago.

Status:
Can't reproduce
Priority:
Immediate
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Octopus runs in the teuthology rados suite are experiencing many consistent failures of this kind:

/a/yuriw-2022-05-19_14:09:24-rados-wip-yuri6-testing-2022-05-17-1603-octopus-distro-default-smithi/6841360

2022-05-19T14:38:06.416 INFO:tasks.cephfs_test_runner:======================================================================
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:ERROR: test_standby (tasks.mgr.test_prometheus.TestPrometheus)
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 175, in _new_conn
2022-05-19T14:38:06.418 INFO:tasks.cephfs_test_runner:    (self._dns_host, self.port), self.timeout, **extra_kw
2022-05-19T14:38:06.418 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/util/connection.py", line 72, in create_connection
2022-05-19T14:38:06.418 INFO:tasks.cephfs_test_runner:    for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
2022-05-19T14:38:06.419 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/socket.py", line 745, in getaddrinfo
2022-05-19T14:38:06.419 INFO:tasks.cephfs_test_runner:    for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
2022-05-19T14:38:06.419 INFO:tasks.cephfs_test_runner:socket.gaierror: [Errno -2] Name or service not known
2022-05-19T14:38:06.420 INFO:tasks.cephfs_test_runner:
2022-05-19T14:38:06.420 INFO:tasks.cephfs_test_runner:During handling of the above exception, another exception occurred:
2022-05-19T14:38:06.421 INFO:tasks.cephfs_test_runner:
2022-05-19T14:38:06.421 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:38:06.421 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 710, in urlopen
2022-05-19T14:38:06.422 INFO:tasks.cephfs_test_runner:    chunked=chunked,
2022-05-19T14:38:06.422 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 398, in _make_request
2022-05-19T14:38:06.423 INFO:tasks.cephfs_test_runner:    conn.request(method, url, **httplib_request_kw)
2022-05-19T14:38:06.423 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 239, in request
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner:    super(HTTPConnection, self).request(method, url, body=body, headers=headers)
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/http/client.py", line 1285, in request
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner:    self._send_request(method, url, body, headers, encode_chunked)
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/http/client.py", line 1331, in _send_request
2022-05-19T14:38:06.425 INFO:tasks.cephfs_test_runner:    self.endheaders(body, encode_chunked=encode_chunked)
2022-05-19T14:38:06.425 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/http/client.py", line 1280, in endheaders
2022-05-19T14:38:06.425 INFO:tasks.cephfs_test_runner:    self._send_output(message_body, encode_chunked=encode_chunked)
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/http/client.py", line 1046, in _send_output
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner:    self.send(msg)
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/http/client.py", line 984, in send
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner:    self.connect()
2022-05-19T14:38:06.427 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 205, in connect
2022-05-19T14:38:06.427 INFO:tasks.cephfs_test_runner:    conn = self._new_conn()
2022-05-19T14:38:06.427 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 187, in _new_conn
2022-05-19T14:38:06.428 INFO:tasks.cephfs_test_runner:    self, "Failed to establish a new connection: %s" % e
2022-05-19T14:38:06.428 INFO:tasks.cephfs_test_runner:urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f76c52785c0>: Failed to establish a new connection: [Errno -2] Name or service not known

2022-05-19T14:58:10.334 INFO:tasks.cephfs_test_runner:======================================================================
2022-05-19T14:58:10.335 INFO:tasks.cephfs_test_runner:ERROR: test_selftest_command_spam (tasks.mgr.test_module_selftest.TestModuleSelftest)
2022-05-19T14:58:10.335 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2022-05-19T14:58:10.335 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:58:10.336 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 175, in _new_conn
2022-05-19T14:58:10.336 INFO:tasks.cephfs_test_runner:    (self._dns_host, self.port), self.timeout, **extra_kw
2022-05-19T14:58:10.336 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/util/connection.py", line 72, in create_connection
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner:    for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner:  File "/usr/lib/python3.6/socket.py", line 745, in getaddrinfo
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner:    for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner:socket.gaierror: [Errno -2] Name or service not known
2022-05-19T14:58:10.338 INFO:tasks.cephfs_test_runner:
2022-05-19T14:58:10.338 INFO:tasks.cephfs_test_runner:During handling of the above exception, another exception occurred:
2022-05-19T14:58:10.338 INFO:tasks.cephfs_test_runner:
2022-05-19T14:58:10.339 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:58:10.339 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 710, in urlopen
2022-05-19T14:58:10.339 INFO:tasks.cephfs_test_runner:    chunked=chunked,
2022-05-19T14:58:10.340 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 386, in _make_request
2022-05-19T14:58:10.340 INFO:tasks.cephfs_test_runner:    self._validate_conn(conn)
2022-05-19T14:58:10.340 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 1040, in _validate_conn
2022-05-19T14:58:10.341 INFO:tasks.cephfs_test_runner:    conn.connect()
2022-05-19T14:58:10.341 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 358, in connect
2022-05-19T14:58:10.341 INFO:tasks.cephfs_test_runner:    self.sock = conn = self._new_conn()
2022-05-19T14:58:10.342 INFO:tasks.cephfs_test_runner:  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 187, in _new_conn
2022-05-19T14:58:10.342 INFO:tasks.cephfs_test_runner:    self, "Failed to establish a new connection: %s" % e
2022-05-19T14:58:10.342 INFO:tasks.cephfs_test_runner:urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPSConnection object at 0x7f4e65e69320>: Failed to establish a new connection: [Errno -2] Name or service not known

Other examples can be found in this run: /a/yuriw-2022-05-16_21:54:48-rados-wip-yuri3-testing-2022-05-12-1609-octopus-distro-default-smithi/

Note: Filing under mgr project, but it could well go under sepia or teuthology.

During handling of the above exception, another exception occurred:

Traceback (most recent call last):             
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 710, in urlope
    chunked=chunked,                           
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 398, in _make_
    conn.request(method, url, **httplib_request_kw)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 239, in request
    super(HTTPConnection, self).request(method, url, body=body, headers=headers)
  File "/usr/lib/python3.6/http/client.py", line 1285, in request
    self._send_request(method, url, body, headers, encode_chunked)
  File "/usr/lib/python3.6/http/client.py", line 1331, in _send_request
    self.endheaders(body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.6/http/client.py", line 1280, in endheaders
    self._send_output(message_body, encode_chunked=encode_chunked)
  File "/usr/lib/python3.6/http/client.py", line 1046, in _send_output
    self.send(msg)                             
  File "/usr/lib/python3.6/http/client.py", line 984, in send
    self.connect()                             
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 205, in connect
    conn = self._new_conn()                    
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 187, in _new_conn
    self, "Failed to establish a new connection: %s" % e
urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f76c52785c0>: Failed to establish a new connection: [Errno -2] Name or service not known

During handling of the above exception, another exception occurred:

Traceback (most recent call last):             
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/adapters.py", line 450, in send
    timeout=timeout                            
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 786, in urlope
    method, url, error=e, _pool=self, _stacktrace=sys.exc_info()[2]
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/util/retry.py", line 592, in increment
    raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='smithi134', port=7789): Max retries exceeded with url: / (Caused by NewConnectionError('<urllib3.connection.HTTPConnection obj

During handling of the above exception, another exception occurred:

Traceback (most recent call last):             
  File "/home/teuthworker/src/github.com_ceph_ceph-c_9dfe5561e7f8bbf1095613ed99b58dd72943d57a/qa/tasks/mgr/test_prometheus.py", line 47, in test_standby
    r = requests.get(original_uri, allow_redirects=False)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/api.py", line 75, in get
    return request('get', url, params=params, **kwargs)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/api.py", line 61, in request
    return session.request(method=method, url=url, **kwargs)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/sessions.py", line 529, in request
    resp = self.send(prep, **send_kwargs)      
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/sessions.py", line 645, in send
    r = adapter.send(request, **kwargs)        
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/adapters.py", line 519, in send
    raise ConnectionError(e, request=request)  
requests.exceptions.ConnectionError: HTTPConnectionPool(host='smithi134', port=7789): Max retries exceeded with url: / (Caused by NewConnectionError('<urllib3.connection.HTTPConnection 

----------------------------------------------------------------------
Ran 2 tests in 159.362s                                                                                                                                                                  

FAILED (errors=1)                            

Related issues 1 (1 open0 closed)

Copied from Dashboard - Bug #55723: octopus: dashboard failures due to timed-out or failed connectionsTriagedAvan Thakkar

Actions
Actions #1

Updated by Ernesto Puerta almost 2 years ago

  • Copied from Bug #55723: octopus: dashboard failures due to timed-out or failed connections added
Actions #2

Updated by Laura Flores almost 2 years ago

  • Status changed from Triaged to Can't reproduce

Marking this as "Can't reproduce" since it is not coming up in octopus runs anymore. Please re-open if it does.

Actions

Also available in: Atom PDF