Actions
Bug #55774
closedoctopus: prometheus, and selftest failures due to timed-out or failed connections
Status:
Can't reproduce
Priority:
Immediate
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
Q/A
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
Octopus runs in the teuthology rados suite are experiencing many consistent failures of this kind:
/a/yuriw-2022-05-19_14:09:24-rados-wip-yuri6-testing-2022-05-17-1603-octopus-distro-default-smithi/6841360
2022-05-19T14:38:06.416 INFO:tasks.cephfs_test_runner:======================================================================
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:ERROR: test_standby (tasks.mgr.test_prometheus.TestPrometheus)
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:38:06.417 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 175, in _new_conn
2022-05-19T14:38:06.418 INFO:tasks.cephfs_test_runner: (self._dns_host, self.port), self.timeout, **extra_kw
2022-05-19T14:38:06.418 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/util/connection.py", line 72, in create_connection
2022-05-19T14:38:06.418 INFO:tasks.cephfs_test_runner: for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
2022-05-19T14:38:06.419 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/socket.py", line 745, in getaddrinfo
2022-05-19T14:38:06.419 INFO:tasks.cephfs_test_runner: for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
2022-05-19T14:38:06.419 INFO:tasks.cephfs_test_runner:socket.gaierror: [Errno -2] Name or service not known
2022-05-19T14:38:06.420 INFO:tasks.cephfs_test_runner:
2022-05-19T14:38:06.420 INFO:tasks.cephfs_test_runner:During handling of the above exception, another exception occurred:
2022-05-19T14:38:06.421 INFO:tasks.cephfs_test_runner:
2022-05-19T14:38:06.421 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:38:06.421 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 710, in urlopen
2022-05-19T14:38:06.422 INFO:tasks.cephfs_test_runner: chunked=chunked,
2022-05-19T14:38:06.422 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 398, in _make_request
2022-05-19T14:38:06.423 INFO:tasks.cephfs_test_runner: conn.request(method, url, **httplib_request_kw)
2022-05-19T14:38:06.423 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 239, in request
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner: super(HTTPConnection, self).request(method, url, body=body, headers=headers)
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/http/client.py", line 1285, in request
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner: self._send_request(method, url, body, headers, encode_chunked)
2022-05-19T14:38:06.424 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/http/client.py", line 1331, in _send_request
2022-05-19T14:38:06.425 INFO:tasks.cephfs_test_runner: self.endheaders(body, encode_chunked=encode_chunked)
2022-05-19T14:38:06.425 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/http/client.py", line 1280, in endheaders
2022-05-19T14:38:06.425 INFO:tasks.cephfs_test_runner: self._send_output(message_body, encode_chunked=encode_chunked)
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/http/client.py", line 1046, in _send_output
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner: self.send(msg)
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/http/client.py", line 984, in send
2022-05-19T14:38:06.426 INFO:tasks.cephfs_test_runner: self.connect()
2022-05-19T14:38:06.427 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 205, in connect
2022-05-19T14:38:06.427 INFO:tasks.cephfs_test_runner: conn = self._new_conn()
2022-05-19T14:38:06.427 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 187, in _new_conn
2022-05-19T14:38:06.428 INFO:tasks.cephfs_test_runner: self, "Failed to establish a new connection: %s" % e
2022-05-19T14:38:06.428 INFO:tasks.cephfs_test_runner:urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f76c52785c0>: Failed to establish a new connection: [Errno -2] Name or service not known
2022-05-19T14:58:10.334 INFO:tasks.cephfs_test_runner:======================================================================
2022-05-19T14:58:10.335 INFO:tasks.cephfs_test_runner:ERROR: test_selftest_command_spam (tasks.mgr.test_module_selftest.TestModuleSelftest)
2022-05-19T14:58:10.335 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2022-05-19T14:58:10.335 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:58:10.336 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 175, in _new_conn
2022-05-19T14:58:10.336 INFO:tasks.cephfs_test_runner: (self._dns_host, self.port), self.timeout, **extra_kw
2022-05-19T14:58:10.336 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/util/connection.py", line 72, in create_connection
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner: for res in socket.getaddrinfo(host, port, family, socket.SOCK_STREAM):
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner: File "/usr/lib/python3.6/socket.py", line 745, in getaddrinfo
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner: for res in _socket.getaddrinfo(host, port, family, type, proto, flags):
2022-05-19T14:58:10.337 INFO:tasks.cephfs_test_runner:socket.gaierror: [Errno -2] Name or service not known
2022-05-19T14:58:10.338 INFO:tasks.cephfs_test_runner:
2022-05-19T14:58:10.338 INFO:tasks.cephfs_test_runner:During handling of the above exception, another exception occurred:
2022-05-19T14:58:10.338 INFO:tasks.cephfs_test_runner:
2022-05-19T14:58:10.339 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2022-05-19T14:58:10.339 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 710, in urlopen
2022-05-19T14:58:10.339 INFO:tasks.cephfs_test_runner: chunked=chunked,
2022-05-19T14:58:10.340 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 386, in _make_request
2022-05-19T14:58:10.340 INFO:tasks.cephfs_test_runner: self._validate_conn(conn)
2022-05-19T14:58:10.340 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 1040, in _validate_conn
2022-05-19T14:58:10.341 INFO:tasks.cephfs_test_runner: conn.connect()
2022-05-19T14:58:10.341 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 358, in connect
2022-05-19T14:58:10.341 INFO:tasks.cephfs_test_runner: self.sock = conn = self._new_conn()
2022-05-19T14:58:10.342 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 187, in _new_conn
2022-05-19T14:58:10.342 INFO:tasks.cephfs_test_runner: self, "Failed to establish a new connection: %s" % e
2022-05-19T14:58:10.342 INFO:tasks.cephfs_test_runner:urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPSConnection object at 0x7f4e65e69320>: Failed to establish a new connection: [Errno -2] Name or service not known
Other examples can be found in this run: /a/yuriw-2022-05-16_21:54:48-rados-wip-yuri3-testing-2022-05-12-1609-octopus-distro-default-smithi/
Note: Filing under mgr project, but it could well go under sepia or teuthology.
During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 710, in urlope chunked=chunked, File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 398, in _make_ conn.request(method, url, **httplib_request_kw) File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 239, in request super(HTTPConnection, self).request(method, url, body=body, headers=headers) File "/usr/lib/python3.6/http/client.py", line 1285, in request self._send_request(method, url, body, headers, encode_chunked) File "/usr/lib/python3.6/http/client.py", line 1331, in _send_request self.endheaders(body, encode_chunked=encode_chunked) File "/usr/lib/python3.6/http/client.py", line 1280, in endheaders self._send_output(message_body, encode_chunked=encode_chunked) File "/usr/lib/python3.6/http/client.py", line 1046, in _send_output self.send(msg) File "/usr/lib/python3.6/http/client.py", line 984, in send self.connect() File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 205, in connect conn = self._new_conn() File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connection.py", line 187, in _new_conn self, "Failed to establish a new connection: %s" % e urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f76c52785c0>: Failed to establish a new connection: [Errno -2] Name or service not known During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/adapters.py", line 450, in send timeout=timeout File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/connectionpool.py", line 786, in urlope method, url, error=e, _pool=self, _stacktrace=sys.exc_info()[2] File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/urllib3/util/retry.py", line 592, in increment raise MaxRetryError(_pool, url, error or ResponseError(cause)) urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='smithi134', port=7789): Max retries exceeded with url: / (Caused by NewConnectionError('<urllib3.connection.HTTPConnection obj During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/home/teuthworker/src/github.com_ceph_ceph-c_9dfe5561e7f8bbf1095613ed99b58dd72943d57a/qa/tasks/mgr/test_prometheus.py", line 47, in test_standby r = requests.get(original_uri, allow_redirects=False) File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/api.py", line 75, in get return request('get', url, params=params, **kwargs) File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/api.py", line 61, in request return session.request(method=method, url=url, **kwargs) File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/sessions.py", line 529, in request resp = self.send(prep, **send_kwargs) File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/sessions.py", line 645, in send r = adapter.send(request, **kwargs) File "/home/teuthworker/src/git.ceph.com_git_teuthology_b018fa018d3f885cbc018f839b42536477653f84/virtualenv/lib/python3.6/site-packages/requests/adapters.py", line 519, in send raise ConnectionError(e, request=request) requests.exceptions.ConnectionError: HTTPConnectionPool(host='smithi134', port=7789): Max retries exceeded with url: / (Caused by NewConnectionError('<urllib3.connection.HTTPConnection ---------------------------------------------------------------------- Ran 2 tests in 159.362s FAILED (errors=1)
Updated by Ernesto Puerta almost 2 years ago
- Copied from Bug #55723: octopus: dashboard failures due to timed-out or failed connections added
Updated by Laura Flores almost 2 years ago
- Status changed from Triaged to Can't reproduce
Marking this as "Can't reproduce" since it is not coming up in octopus runs anymore. Please re-open if it does.
Actions