Project

General

Profile

Actions

Bug #10869

closed

Workunit task should not use the same filename to store the list of workunits

Added by Yuri Weinstein about 9 years ago. Updated about 9 years ago.

Status:
Resolved
Priority:
Normal
Category:
-
Target version:
% Done:

0%

Source:
Q/A
Tags:
Backport:
dumpling
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

Release giant v0.87.1

Run: http://pulpito.front.sepia.ceph.com/teuthology-2015-02-11_15:40:51-upgrade:firefly-x-giant-distro-basic-multi/
Jobs: 750897, 750899
Logs: http://qa-proxy.ceph.com/teuthology/teuthology-2015-02-11_15:40:51-upgrade:firefly-x-giant-distro-basic-multi/750897/teuthology.log

2015-02-11T21:34:42.357 INFO:tasks.workunit:Stopping ['rbd/test_librbd_python.sh'] on client.4...
2015-02-11T21:34:42.357 INFO:teuthology.orchestra.run.plana17:Running: 'rm -rf -- /home/ubuntu/cephtest/workunits.list /home/ubuntu/cephtest/workunit.client.4'
2015-02-11T21:34:42.361 INFO:tasks.workunit.client.2.plana17.stdout:make[1]: Leaving directory `/home/ubuntu/cephtest/workunit.client.2/fs'
2015-02-11T21:34:42.373 ERROR:teuthology.parallel:Exception in parallel execution
Traceback (most recent call last):
  File "/home/teuthworker/src/teuthology_master/teuthology/parallel.py", line 82, in __exit__
    for result in self:
  File "/home/teuthworker/src/teuthology_master/teuthology/parallel.py", line 101, in next
    resurrect_traceback(result)
  File "/home/teuthworker/src/teuthology_master/teuthology/parallel.py", line 19, in capture_traceback
    return func(*args, **kwargs)
  File "/var/lib/teuthworker/src/ceph-qa-suite_giant/tasks/workunit.py", line 317, in _run_tests
    '{tdir}/workunits.list'.format(tdir=testdir)).split('\0'))
  File "/home/teuthworker/src/teuthology_master/teuthology/misc.py", line 699, in get_file
    args.extend([
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/remote.py", line 233, in get_file
    self._sftp_get_file(path, local_temp_path)
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/remote.py", line 192, in _sftp_get_file
    sftp.get(remote_path, local_path)
  File "/usr/lib/python2.7/dist-packages/paramiko/sftp_client.py", line 606, in get
    fr = self.file(remotepath, 'rb')
  File "/usr/lib/python2.7/dist-packages/paramiko/sftp_client.py", line 245, in open
    t, msg = self._request(CMD_OPEN, filename, imode, attrblock)
  File "/usr/lib/python2.7/dist-packages/paramiko/sftp_client.py", line 635, in _request
    return self._read_response(num)
  File "/usr/lib/python2.7/dist-packages/paramiko/sftp_client.py", line 682, in _read_response
    self._convert_status(msg)
  File "/usr/lib/python2.7/dist-packages/paramiko/sftp_client.py", line 708, in _convert_status
    raise IOError(errno.ENOENT, text)
IOError: [Errno 2] No such file

Notice:

2015-02-11T21:34:42.357 INFO:teuthology.orchestra.run.plana17:Running: 'rm -rf -- /home/ubuntu/cephtest/workunits.list /home/ubuntu/cephtest/workunit.client.4'

and then:

  File "/var/lib/teuthworker/src/ceph-qa-suite_giant/tasks/workunit.py", line 317, in _run_tests
    '{tdir}/workunits.list'.format(tdir=testdir)).split('\0'))

So in upgrade tests and others when we we run workloads in parallel we "sometimes" see those failures.

Actions #1

Updated by Yuri Weinstein about 9 years ago

  • Description updated (diff)
Actions #2

Updated by Yuri Weinstein about 9 years ago

  • Status changed from New to In Progress
  • Assignee set to Yuri Weinstein
Actions #3

Updated by Yuri Weinstein about 9 years ago

  • Status changed from In Progress to Resolved
Actions

Also available in: Atom PDF