Project

General

Profile

Actions

Bug #1111

closed

file lock requests in wait queue not getting cleaned up after process exit

Added by Brian Chrisman almost 13 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Our interpretation of events:
1) proc1 requests lock
2) proc1 receives lock
3) proc2 requests lock
4) proc2 gets on wait list
5) proc2 exits
6) proc1 releases lock
7) proc2 gets assigned lock by mds though process is gone

I will add our test code here soon


Files

mds.scale-192-168-98-112.log (604 KB) mds.scale-192-168-98-112.log Brian Chrisman, 05/25/2011 11:36 AM
sclockandhold.cpp (2.37 KB) sclockandhold.cpp Brian Chrisman, 05/25/2011 12:12 PM
Actions #1

Updated by Brian Chrisman almost 13 years ago

Here's the notes on our bug and the related PIDs

NODE 1: 192.168.98.112 (client 4113)
- gets the lock and holds for 10 seconds
NODE 2: 192.168.98.113 (client 4120)
- attempts to get the lock and is killed before it gets it

PID 9432 is on NODE1
PID 14644 is on NODE2

console output:

[root@scale-192-168-98-112 ~]# ./createlock /fsscale0/createlock.test 10
0 of 10
Trying to get lock on /fsscale0/createlock.test
Got lock on /fsscale0/createlock.test
Trying to unlock on /fsscale0/createlock.test
Unlocked /fsscale0/createlock.test
1 of 10
Trying to get lock on /fsscale0/createlock.test
Got lock on /fsscale0/createlock.test
Trying to unlock on /fsscale0/createlock.test
Unlocked /fsscale0/createlock.test
2 of 10
Trying to get lock on /fsscale0/createlock.test

[root@scale-192-168-98-113 ~]# ./createlock /fsscale0/createlock.test 10
0 of 10
Trying to get lock on /fsscale0/createlock.test
^C

Actions #2

Updated by Brian Chrisman almost 13 years ago

Adding our test program.

Actions #3

Updated by Sage Weil almost 13 years ago

  • Category set to 1
  • Status changed from New to In Progress
  • Assignee set to Sage Weil
  • Target version set to v0.29
Actions #4

Updated by Sage Weil almost 13 years ago

pushed kclient fix to ceph-client.git master branch and mds fix ceph.git master branch.

Actions #5

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_story_points set to 2
  • Translation missing: en.field_position set to 1
  • Translation missing: en.field_position changed from 1 to 677
Actions #6

Updated by Sage Weil almost 13 years ago

  • Target version changed from v0.29 to v0.30
Actions #7

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (679)
  • Translation missing: en.field_position set to 8
Actions #8

Updated by Sage Weil almost 13 years ago

  • Status changed from In Progress to Resolved
  • Translation missing: en.field_position deleted (12)
  • Translation missing: en.field_position set to 12
Actions #9

Updated by Sage Weil almost 13 years ago

  • Translation missing: en.field_position deleted (27)
  • Translation missing: en.field_position set to 24
Actions #10

Updated by John Spray over 7 years ago

  • Project changed from Ceph to CephFS
  • Category deleted (1)
  • Target version deleted (v0.30)

Bulk updating project=ceph category=mds bugs so that I can remove the MDS category from the Ceph project to avoid confusion.

Actions

Also available in: Atom PDF