Project

General

Profile

Actions

Bug #10765

closed

LibRadosWatchNotify.WatchNotify2Timeout segfault

Added by Samuel Just about 9 years ago. Updated about 9 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2015-02-05T02:18:33.620 INFO:tasks.workunit.client.0.mira002.stdout:watch_notify2_test_cb from 4758 notify_id 3728031612928 cookie 40099104
2015-02-05T02:18:33.622 INFO:teuthology.orchestra.run.mira002:Running: 'adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph osd reweight 1 1'
2015-02-05T02:18:33.664 INFO:tasks.workunit.client.0.mira002.stderr:Segmentation fault (core dumped)
2015-02-05T02:18:33.666 INFO:tasks.workunit:Stopping ['rados/test.sh'] on client.0...
2015-02-05T02:18:33.666 INFO:teuthology.orchestra.run.mira002:Running: 'rm rf - /home/ubuntu/cephtest/workunits.list /home/ubuntu/cephtest/workunit.client.0'
2015-02-05T02:18:33.681 ERROR:teuthology.parallel:Exception in parallel execution
ubuntu@teuthology:/a/samuelj-2015-02-04_22:13:07-rados-wip-sam-working-wip-testing-vanilla-fixes-basic-multi/740170

Actions #1

Updated by Sage Weil about 9 years ago

  • Assignee set to Sage Weil
  • Source changed from other to Q/A
Actions #2

Updated by Sage Weil about 9 years ago

  • Status changed from New to In Progress

I think teh problem is that _linger_commit and friends are called as normal completions of the register event, which happens in the completion_lock but not any of the session lock stuff. these, and any fields they modify, probably need to go under LingerOp::watch_lock.

Actions #3

Updated by Sage Weil about 9 years ago

  • Status changed from In Progress to Fix Under Review
Actions #4

Updated by Sage Weil about 9 years ago

  • Status changed from Fix Under Review to Resolved
Actions

Also available in: Atom PDF