Bug #50536
"Command failed (workunit test rados/test.sh)" - rados/test.sh times out on master.
Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:
0%
Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
/a/sseshasa-2021-04-23_18:11:53-rados-wip-sseshasa-testing-2021-04-23-2212-distro-basic-smithi/6068991
Noticed a test failure shown below:
2021-04-24T02:57:01.237 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [==========] Running 4 tests from 1 test suite.
2021-04-24T02:57:01.237 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [----------] Global test environment set-up.
2021-04-24T02:57:01.237 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [----------] 4 tests from LibRadosServicePP
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ RUN ] LibRadosServicePP.RegisterEarly
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ OK ] LibRadosServicePP.RegisterEarly (5076 ms)
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ RUN ] LibRadosServicePP.RegisterLate
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ OK ] LibRadosServicePP.RegisterLate (50 ms)
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ RUN ] LibRadosServicePP.Status
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ OK ] LibRadosServicePP.Status (20021 ms)
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ RUN ] LibRadosServicePP.Close
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: attempt 0 of 20
2021-04-24T02:57:01.238 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.0.0-3552-g2a676895/rpm/el8/BUILD/ceph-17.0.0-3552-g2a676895/src/test/librados/service_cxx.cc:73: Failure
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: Expected equality of these values:
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: 0
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: cluster.connect()
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: Which is: -110
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ FAILED ] LibRadosServicePP.Close (3000021 ms)
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [----------] 4 tests from LibRadosServicePP (3025168 ms total)
2021-04-24T02:57:01.239 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp:
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [----------] Global test environment tear-down
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [==========] 4 tests from 1 test suite ran. (3025168 ms total)
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ PASSED ] 3 tests.
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ FAILED ] 1 test, listed below:
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: [ FAILED ] LibRadosServicePP.Close
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp:
2021-04-24T02:57:01.240 INFO:tasks.workunit.client.0.smithi003.stdout: api_service_pp: 1 FAILED TEST
Much later the test times out:
2021-04-24T05:06:35.852 INFO:tasks.workunit.client.0.smithi003.stderr:++ cleanup
2021-04-24T05:06:35.852 INFO:tasks.workunit.client.0.smithi003.stderr:++ pkill -P 40307
2021-04-24T05:06:35.863 INFO:teuthology.orchestra.run.smithi037.stderr:admin_socket: exception getting command descriptions: [Errno 111] Connection refused
2021-04-24T05:06:35.864 DEBUG:teuthology.orchestra.run:got remote process result: 22
2021-04-24T05:06:35.865 DEBUG:teuthology.orchestra.run.smithi037:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 30 ceph --cluster ceph --admin-daemon /var/run/ceph/ceph-osd.7.asok dump_blocked_ops
2021-04-24T05:06:35.870 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40312 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.870 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40320 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.871 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40352 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.871 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40364 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.871 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40436 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.871 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40465 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.871 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40480 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.871 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40501 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.872 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40524 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.872 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40551 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.872 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40599 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.872 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40629 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.872 INFO:tasks.workunit.client.0.smithi003.stderr:/home/ubuntu/cephtest/clone.client.0/qa/workunits/rados/test.sh: line 10: 40758 Terminated bash -o pipefail -exc "ceph_test_rados_$f $color 2>&1 | tee ceph_test_rados_$ff.log | sed \"s/^/$r: /\""
2021-04-24T05:06:35.873 DEBUG:teuthology.orchestra.run:got remote process result: 124
2021-04-24T05:06:35.874 INFO:tasks.workunit.client.0.smithi003.stderr:++ true
2021-04-24T05:06:35.874 INFO:tasks.workunit:Stopping ['rados/test.sh'] on client.0...
2021-04-24T05:06:35.874 DEBUG:teuthology.orchestra.run.smithi003:> sudo rm -rf -- /home/ubuntu/cephtest/workunits.list.client.0 /home/ubuntu/cephtest/clone.client.0
2021-04-24T05:06:35.890 ERROR:teuthology.orchestra.daemon.state:Failed to send signal 1: None
Traceback (most recent call last):
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/teuthology/orchestra/daemon/state.py", line 108, in signal
self.proc.stdin.write(struct.pack('!b', sig))
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/virtualenv/lib/python3.6/site-packages/paramiko/file.py", line 405, in write
self._write_all(data)
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/virtualenv/lib/python3.6/site-packages/paramiko/file.py", line 522, in _write_all
count = self._write(data)
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/virtualenv/lib/python3.6/site-packages/paramiko/channel.py", line 1364, in _write
self.channel.sendall(data)
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/virtualenv/lib/python3.6/site-packages/paramiko/channel.py", line 846, in sendall
sent = self.send(s)
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/virtualenv/lib/python3.6/site-packages/paramiko/channel.py", line 801, in send
return self._send(s, m)
File "/home/teuthworker/src/git.ceph.com_git_teuthology_2713a3cd31b17738a50039eaa9d859b5dc39fb8a/virtualenv/lib/python3.6/site-packages/paramiko/channel.py", line 1198, in _send
raise socket.error("Socket is closed")
OSError: Socket is closed
History
#1 Updated by Greg Farnum over 2 years ago
- Project changed from Ceph to RADOS
#2 Updated by Matan Breizman about 1 year ago
OSError: Socket is closed
/a/yuriw-2022-08-15_17:54:08-rados-wip-yuri2-testing-2022-08-15-0848-quincy-distro-default-smithi/6973901
#3 Updated by Matan Breizman about 1 year ago
/a/yuriw-2022-08-22_21:19:34-rados-wip-yuri4-testing-2022-08-18-1020-pacific-distro-default-smithi/6986471
#4 Updated by Radoslaw Zarzynski about 1 year ago
It think the reoccurrences are about failure in a different place – at least in the latest one the LibRadosServicePP.Close
is green.
rzarzynski@teuthology:/a/yuriw-2022-08-22_21:19:34-rados-wip-yuri4-testing-2022-08-18-1020-pacific-distro-default-smithi/6986471$ less teuthology.log ... 2022-08-22T23:30:52.573 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [----------] Global test environment set-up. 2022-08-22T23:30:52.573 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [----------] 4 tests from LibRadosServicePP 2022-08-22T23:30:52.573 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ RUN ] LibRadosServicePP.RegisterEarly 2022-08-22T23:30:52.573 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ OK ] LibRadosServicePP.RegisterEarly (5394 ms) 2022-08-22T23:30:52.573 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ RUN ] LibRadosServicePP.RegisterLate 2022-08-22T23:30:52.574 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ OK ] LibRadosServicePP.RegisterLate (638 ms) 2022-08-22T23:30:52.574 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ RUN ] LibRadosServicePP.Status 2022-08-22T23:30:52.574 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ OK ] LibRadosServicePP.Status (20578 ms) 2022-08-22T23:30:52.574 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ RUN ] LibRadosServicePP.Close 2022-08-22T23:30:52.575 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: attempt 0 of 20 2022-08-22T23:30:52.575 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ OK ] LibRadosServicePP.Close (5381 ms) 2022-08-22T23:30:52.575 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [----------] 4 tests from LibRadosServicePP (31991 ms total) 2022-08-22T23:30:52.575 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: 2022-08-22T23:30:52.576 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [----------] Global test environment tear-down 2022-08-22T23:30:52.576 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [==========] 4 tests from 1 test suite ran. (31991 ms total) 2022-08-22T23:30:52.577 INFO:tasks.workunit.client.0.smithi114.stdout: api_service_pp: [ PASSED ] 4 tests.