Project

General

Profile

Actions

Bug #11035

open

paramiko: Teuthology shouldn't have to time-out when getting a socket error copying a file

Added by David Zafman about 9 years ago. Updated over 8 years ago.

Status:
In Progress
Priority:
Normal
Assignee:
Category:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Crash signature (v1):
Crash signature (v2):

Description

The result is a job takes over 2 hours to terminate and get marked "dead."

2015-03-04T19:17:00.172 INFO:tasks.thrashosds.thrasher:Moving pg 3.4s1 from osd.2 to osd.3
2015-03-04T19:17:00.172 INFO:tasks.thrashosds.thrasher:Transfer export file from ubuntu@burnupi04.front.sepia.ceph.com to ubuntu@burnupi17.front.sepia.ceph.com
2015-03-04T19:17:04.679 INFO:tasks.radosbench.radosbench.0.burnupi04.stderr:2015-03-04 19:17:01.928951 7fe61c257700  0 -- 10.214.133.12:0/1008759 >> 10.214.133.12:6810/7659 pipe(0x7fe60c02a390 sd=12 :36542 s=2 pgs=74 cs=1 l=1 c=0x7fe60c019cc0).injecting socket failure
2015-03-04T19:18:29.749 ERROR:paramiko.transport:Exception: Remote transport is ignoring rekey requests
2015-03-04T19:18:29.768 ERROR:paramiko.transport:Traceback (most recent call last):
2015-03-04T19:18:29.768 ERROR:paramiko.transport:  File "/usr/lib/python2.7/dist-packages/paramiko/transport.py", line 1528, in run
2015-03-04T19:18:29.768 ERROR:paramiko.transport:    ptype, m = self.packetizer.read_message()
2015-03-04T19:18:29.768 ERROR:paramiko.transport:  File "/usr/lib/python2.7/dist-packages/paramiko/packet.py", line 378, in read_message
2015-03-04T19:18:29.768 ERROR:paramiko.transport:    raise SSHException('Remote transport is ignoring rekey requests')
2015-03-04T19:18:29.769 ERROR:paramiko.transport:SSHException: Remote transport is ignoring rekey requests
2015-03-04T19:18:29.769 ERROR:paramiko.transport:
2015-03-04T19:18:29.785 INFO:tasks.radosbench.radosbench.0.burnupi04.stderr:2015-03-04 19:18:29.788649 7fe617fff700  0 monclient: hunting for new mon
2015-03-04T19:18:29.798 INFO:tasks.radosbench.radosbench.0.burnupi04.stderr:2015-03-04 19:18:29.801231 7fe603fe2700  0 -- 10.214.133.12:0/1008759 >> 10.214.134.18:6805/56832 pipe(0x7fe60c01b150 sd=10 :0 s=1 pgs=0 cs=0 l=1 c=0x7fe60c019e60).fault
2015-03-04T19:18:29.798 INFO:tasks.radosbench.radosbench.0.burnupi04.stderr:2015-03-04 19:18:29.801454 7fe61c156700  0 -- 10.214.133.12:0/1008759 >> 10.214.134.18:6810/57820 pipe(0x7fe60c022870 sd=12 :0 s=1 pgs=0 cs=0 l=1 c=0x7fe60c00b300).fault
2015-03-04T21:08:23.570 ERROR:teuthology.run_tasks:Manager failed: radosbench
Traceback (most recent call last):
  File "/home/teuthworker/src/teuthology_master/teuthology/run_tasks.py", line 125, in run_tasks
    suppress = manager.__exit__(*exc_info)
  File "/usr/lib/python2.7/contextlib.py", line 24, in __exit__
    self.gen.next()
  File "/var/lib/teuthworker/src/ceph-qa-suite_wip-cot-firefly/tasks/radosbench.py", line 80, in task
    run.wait(radosbench.itervalues(), timeout=timeout)
  File "/home/teuthworker/src/teuthology_master/teuthology/orchestra/run.py", line 395, in wait
    check_time()
  File "/home/teuthworker/src/teuthology_master/teuthology/contextutil.py", line 134, in __call__
    raise MaxWhileTries(error_msg)
MaxWhileTries: reached maximum tries (1500) after waiting for 9000 seconds
2015-03-04T21:08:23.701 INFO:tasks.thrashosds:joining thrashosds
2015-03-04T21:08:23.702 ERROR:teuthology.run_tasks:Manager failed: thrashosds
Traceback (most recent call last):
  File "/home/teuthworker/src/teuthology_master/teuthology/run_tasks.py", line 125, in run_tasks
    suppress = manager.__exit__(*exc_info)
  File "/usr/lib/python2.7/contextlib.py", line 35, in __exit__
    self.gen.throw(type, value, traceback)
  File "/var/lib/teuthworker/src/ceph-qa-suite_wip-cot-firefly/tasks/thrashosds.py", line 183, in task
    thrash_proc.do_join()
  File "/var/lib/teuthworker/src/ceph-qa-suite_wip-cot-firefly/tasks/ceph_manager.py", line 353, in do_join
    self.thread.get()
  File "/usr/lib/python2.7/dist-packages/gevent/greenlet.py", line 308, in get
    raise self._exception
EOFError


Related issues 1 (0 open1 closed)

Related to teuthology - Bug #10776: paramiko: "Exception: Remote transport is ignoring rekey requests" in smoke-master-distro-basic-multi runCan't reproduceDan Mick02/05/2015

Actions
Actions #1

Updated by Loïc Dachary almost 9 years ago

  • Project changed from Ceph to teuthology
  • Category deleted (teuthology)
  • Regression set to No
Actions #2

Updated by Zack Cerza almost 9 years ago

  • Description updated (diff)

Fixing description formatting...

Actions #3

Updated by Zack Cerza almost 9 years ago

  • Subject changed from Teuthology shouldn't have to time-out when getting a socket error copying a file to paramiko: Teuthology shouldn't have to time-out when getting a socket error copying a file

This might be https://github.com/paramiko/paramiko/issues/49

We need to find a way to upgrade paramiko

Actions #4

Updated by Dan Mick over 8 years ago

  • Status changed from New to In Progress
Actions #5

Updated by Dan Mick over 8 years ago

  • Assignee set to Zack Cerza
Actions

Also available in: Atom PDF