Project

General

Profile

Actions

Bug #42026

closed

test: LibCephFS.ShutdownRace segfaults (msgr v2 related part)

Added by Patrick Donnelly over 4 years ago. Updated about 4 years ago.

Status:
Resolved
Priority:
Urgent
Category:
-
Target version:
% Done:

0%

Source:
Development
Tags:
Backport:
nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
fs
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2019-09-24T10:04:18.579 INFO:tasks.workunit.client.0.smithi026.stdout:[ RUN      ] LibCephFS.ShutdownRace
2019-09-24T10:04:18.935 INFO:teuthology.orchestra.run.smithi026:Running:
2019-09-24T10:04:18.935 INFO:teuthology.orchestra.run.smithi026:> sudo logrotate /etc/logrotate.d/ceph-test.conf
2019-09-24T10:04:18.939 INFO:teuthology.orchestra.run.smithi033:Running:
2019-09-24T10:04:18.939 INFO:teuthology.orchestra.run.smithi033:> sudo logrotate /etc/logrotate.d/ceph-test.conf
2019-09-24T10:04:37.483 DEBUG:teuthology.orchestra.run:got remote process result: 141
2019-09-24T10:04:37.483 INFO:tasks.workunit:Stopping ['libcephfs/test.sh'] on client.0...
2019-09-24T10:04:37.483 INFO:teuthology.orchestra.run.smithi026:Running:
2019-09-24T10:04:37.484 INFO:teuthology.orchestra.run.smithi026:> sudo rm -rf -- /home/ubuntu/cephtest/workunits.list.client.0 /home/ubuntu/cephtest/clone.client.0
2019-09-24T10:04:37.701 ERROR:teuthology.run_tasks:Saw exception from tasks.
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/run_tasks.py", line 86, in run_tasks
    manager = run_one_task(taskname, ctx=ctx, config=config)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/run_tasks.py", line 65, in run_one_task
    return task(**kwargs)
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/workunit.py", line 123, in task
    timeout=timeout,cleanup=cleanup)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/parallel.py", line 85, in __exit__
    for result in self:
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/parallel.py", line 99, in next
    resurrect_traceback(result)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/parallel.py", line 22, in capture_traceback
    return func(*args, **kwargs)
  File "/home/teuthworker/src/github.com_ceph_ceph_master/qa/tasks/workunit.py", line 409, in _run_tests
    label="workunit test {workunit}".format(workunit=workunit)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/remote.py", line 205, in run
    r = self._runner(client=self.ssh, name=self.shortname, **kwargs)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/run.py", line 437, in run
    r.wait()
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/run.py", line 162, in wait
    self._raise_for_status()
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_master/teuthology/orchestra/run.py", line 184, in _raise_for_status
    node=self.hostname, label=self.label
CommandFailedError: Command failed (workunit test libcephfs/test.sh) on smithi026 with status 141: 'mkdir -p -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=e5b766ed46c976b9ad65d1ddccb3a3c03cb4823e TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin CEPH_BASE=/home/ubuntu/cephtest/clone.client.0 CEPH_ROOT=/home/ubuntu/cephtest/clone.client.0 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/clone.client.0/qa/workunits/libcephfs/test.sh' 

From: /ceph/teuthology-archive/pdonnell-2019-09-24_08:40:13-fs-master-distro-basic-smithi/4330830/teuthology.log

also a similar glibc corruption found:

2019-09-22T21:30:49.800 INFO:tasks.workunit.client.0.smithi026.stdout:[ RUN      ] LibCephFS.ShutdownRace
2019-09-22T21:30:52.068 INFO:teuthology.orchestra.run.smithi026:Running:
2019-09-22T21:30:52.068 INFO:teuthology.orchestra.run.smithi026:> sudo logrotate /etc/logrotate.d/ceph-test.conf
2019-09-22T21:30:52.089 INFO:teuthology.orchestra.run.smithi096:Running:
2019-09-22T21:30:52.089 INFO:teuthology.orchestra.run.smithi096:> sudo logrotate /etc/logrotate.d/ceph-test.conf
2019-09-22T21:31:14.724 INFO:tasks.workunit.client.0.smithi026.stderr:*** Error in `ceph_test_libcephfs': corrupted size vs. prev_size: 0x00007f440cacc230 ***
2019-09-22T21:31:14.733 INFO:tasks.workunit.client.0.smithi026.stderr:======= Backtrace: =========
2019-09-22T21:31:14.733 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libc.so.6(+0x80fc7)[0x7f44301bdfc7]
2019-09-22T21:31:14.734 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libc.so.6(+0x8159e)[0x7f44301be59e]
2019-09-22T21:31:14.734 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libcrypto.so.10(CRYPTO_free+0x1d)[0x7f44315f480d]
2019-09-22T21:31:14.734 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libcrypto.so.10(lh_free+0x57)[0x7f44316ac637]
2019-09-22T21:31:14.734 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libcrypto.so.10(+0x128c60)[0x7f44316aec60]
2019-09-22T21:31:14.735 INFO:tasks.workunit.client.0.smithi026.stderr:/usr/lib64/ceph/libceph-common.so.0(_ZN4ceph6crypto8shutdownEb+0x91)[0x7f4432f89891]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:/usr/lib64/ceph/libceph-common.so.0(_ZN11CephContextD1Ev+0x25f)[0x7f4432f8121f]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:/usr/lib64/ceph/libceph-common.so.0(_ZN11CephContext3putEv+0x19)[0x7f4432f81419]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libcephfs.so.2(ceph_shutdown+0xd1)[0x7f44328f7031]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:ceph_test_libcephfs(+0x73946)[0x55c72bf5b946]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:/usr/lib64/ceph/libceph-common.so.0(+0x81643f)[0x7f443341543f]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libpthread.so.0(+0x7ea5)[0x7f44326b9ea5]
2019-09-22T21:31:14.736 INFO:tasks.workunit.client.0.smithi026.stderr:/lib64/libc.so.6(clone+0x6d)[0x7f443023b8cd]

From: /ceph/teuthology-archive/pdonnell-2019-09-22_10:05:30-fs-wip-pdonnell-testing-20190922.042910-distro-basic-smithi/4326224/teuthology.log


Files

shutdownrace-valgrind.out (75 KB) shutdownrace-valgrind.out Jeff Layton, 09/24/2019 12:19 PM
tsan.out.gz (46.6 KB) tsan.out.gz TSAN output Jeff Layton, 09/30/2019 06:26 PM

Related issues 4 (0 open4 closed)

Related to Messengers - Bug #43070: CephxSessionHandler::_calc_signature segvResolvedRadoslaw Zarzynski

Actions
Has duplicate Messengers - Bug #40026: segfaults in ProtocolV2::write_message()DuplicateBrad Hubbard

Actions
Has duplicate Ceph - Bug #45523: mgr and mon crashedDuplicateBrad Hubbard

Actions
Copied to Ceph - Backport #42856: nautilus: test: LibCephFS.ShutdownRace segfaults (msgr v2 related part)ResolvedRadoslaw ZarzynskiActions
Actions

Also available in: Atom PDF