Project

General

Profile

Actions

Bug #4595

closed

lockdep error in rados_aio_flush_async()

Added by Sage Weil about 11 years ago. Updated about 11 years ago.

Status:
Resolved
Priority:
Immediate
Assignee:
Category:
librados
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2013-03-30T02:22:57.575 INFO:teuthology.task.workunit.client.0.err:common/lockdep.cc: In function 'int lockdep_will_lock(const char*, int)' thread 7fe4ddee5780 time 2013-03-30 02:22:47.152870
2013-03-30T02:22:57.575 INFO:teuthology.task.workunit.client.0.err:common/lockdep.cc: 212: FAILED assert(0)
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: ceph version 0.59-426-g95c4a81 (95c4a81be1af193786d0483fcbe81104d3da7c40)
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 1: (lockdep_will_lock(char const*, int)+0xcea) [0x7fe4dd2558da]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 2: (Mutex::Lock(bool)+0x104) [0x7fe4dd0faac4]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 3: (librados::IoCtxImpl::flush_aio_writes_async(librados::AioCompletionImpl*)+0xea) [0x7fe4dd0a0f4a]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 4: (rados_aio_flush_async()+0x9) [0x7fe4dd080279]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 5: (LibRadosAio_FlushAsync_Test::TestBody()+0x565) [0x429fe5]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 6: (testing::Test::Run()+0xaa) [0x45479a]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 7: (testing::internal::TestInfoImpl::Run()+0x100) [0x4548a0]
2013-03-30T02:22:57.576 INFO:teuthology.task.workunit.client.0.err: 8: (testing::TestCase::Run()+0xbd) [0x45496d]
2013-03-30T02:22:57.577 INFO:teuthology.task.workunit.client.0.err: 9: (testing::internal::UnitTestImpl::RunAllTests()+0x217) [0x454bd7]
2013-03-30T02:22:57.577 INFO:teuthology.task.workunit.client.0.err: 10: (main()+0x35) [0x41ece5]
2013-03-30T02:22:57.577 INFO:teuthology.task.workunit.client.0.err: 11: (__libc_start_main()+0xed) [0x7fe4dc4a076d]
2013-03-30T02:22:57.577 INFO:teuthology.task.workunit.client.0.err: 12: ceph_test_rados_api_aio() [0x41ed41]
2013-03-30T02:22:57.577 INFO:teuthology.task.workunit.client.0.err: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
2013-03-30T02:22:57.577 INFO:teuthology.task.workunit.client.0.err:terminate called after throwing an instance of 'ceph::FailedAssertion'
2013-03-30T02:22:57.625 INFO:teuthology.task.workunit.client.0.err:Aborted (core dumped)
2013-03-30T02:22:57.626 INFO:teuthology.task.workunit:Stopping rados/test.sh on client.0...
2013-03-30T02:22:57.626 DEBUG:teuthology.orchestra.run:Running [10.214.132.8]: 'rm -rf -- /home/ubuntu/cephtest/workunits.list /home/ubuntu/cephtest/workunit.client.0'

job was
ubuntu@teuthology:/a/teuthology-2013-03-30_01:00:05-rados-next-testing-basic/6309$ cat orig.config.yaml 
kernel:
  kdb: true
  sha1: c658410c172bcd72d4c08ca034901c180278cd49
nuke-on-error: true
overrides:
  ceph:
    conf:
      global:
        lockdep: true
        ms inject socket failures: 5000
      osd:
        osd op thread timeout: 60
    fs: btrfs
    log-whitelist:
    - slow request
    sha1: 95c4a81be1af193786d0483fcbe81104d3da7c40
  s3tests:
    branch: next
  workunit:
    sha1: 95c4a81be1af193786d0483fcbe81104d3da7c40
roles:
- - mon.a
  - mon.c
  - osd.0
  - osd.1
  - osd.2
- - mon.b
  - mds.a
  - osd.3
  - osd.4
  - osd.5
  - client.0
tasks:
- chef: null
- clock: null
- install: null
- ceph: null
- ceph-fuse: null
- workunit:
    clients:
      client.0:
      - rados/test.sh
Actions #1

Updated by Josh Durgin about 11 years ago

  • Status changed from 12 to Fix Under Review

This is a false positive since lockdep wasn't tracking individual instances of locks. Different AioCompletion locks were treated as the same by lockdep, hence the error. A fix to improve this for lockdep is in wip-4595.

Actions #2

Updated by Josh Durgin about 11 years ago

  • Status changed from Fix Under Review to Resolved
Actions

Also available in: Atom PDF