Project

General

Profile

Bug #18306

segfault in handle_client_caps

Added by John Spray 7 months ago. Updated 3 months ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
Correctness/Safety
Target version:
-
Start date:
12/20/2016
Due date:
% Done:

0%

Source:
Tags:
Backport:
jewel, kraken
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Release:
Component(FS):
MDS
Needs Doc:
No

Description

http://pulpito.ceph.com/jspray-2016-12-19_21:05:25-fs-master-distro-basic-smithi/648247

2016-12-19T23:01:22.549 INFO:tasks.ceph.mds.b.smithi027.stderr:*** Caught signal (Segmentation fault) **
2016-12-19T23:01:22.549 INFO:tasks.ceph.mds.b.smithi027.stderr: in thread 7fb83df6f700 thread_name:mds_rank_progr
2016-12-19T23:01:22.551 INFO:tasks.ceph.mds.b.smithi027.stderr: ceph version 11.1.0-6101-g20f936e (20f936e841ef8e60ba5ee5b8c42b1e739845e45d)
2016-12-19T23:01:22.551 INFO:tasks.ceph.mds.b.smithi027.stderr: 1: (()+0x537ada) [0x7fb849558ada]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 2: (()+0xf100) [0x7fb847324100]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 3: (Locker::handle_client_caps(MClientCaps*)+0x185) [0x7fb849417245]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 4: (Locker::dispatch(Message*)+0xa5) [0x7fb849419a55]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 5: (MDSRank::handle_deferrable_message(Message*)+0xbb4) [0x7fb849292e54]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 6: (MDSRank::_dispatch(Message*, bool)+0x207) [0x7fb84929c067]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 7: (MDSRank::retry_dispatch(Message*)+0x12) [0x7fb84929d1e2]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 8: (MDSInternalContextBase::complete(int)+0x1eb) [0x7fb8494e5cab]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 9: (MDSRank::_advance_queues()+0x4a5) [0x7fb84929b875]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 10: (MDSRank::ProgressThread::entry()+0x4a) [0x7fb84929bcea]
2016-12-19T23:01:22.552 INFO:tasks.ceph.mds.b.smithi027.stderr: 11: (()+0x7dc5) [0x7fb84731cdc5]
2016-12-19T23:01:22.553 INFO:tasks.ceph.mds.b.smithi027.stderr: 12: (clone()+0x6d) [0x7fb846407ced]
2016-12-19T23:01:22.553 INFO:tasks.ceph.mds.b.smithi027.stderr:2016-12-19 23:01:22.566959 7fb83df6f700 -1 *** Caught signal (Segmentation fault) **
2016-12-19T23:01:22.553 INFO:tasks.ceph.mds.b.smithi027.stderr: in thread 7fb83df6f700 thread_name:mds_rank_progr

Related issues

Copied to fs - Backport #18615: jewel: segfault in handle_client_caps Resolved
Copied to fs - Backport #18616: kraken: segfault in handle_client_caps Resolved

History

#1 Updated by Zheng Yan 7 months ago

http://qa-proxy.ceph.com/teuthology/zyan-2017-01-05_11:43:04-fs-wip-zyan-testing-testing-basic-mira/692034/teuthology.log

Following sequence events cause the crash

. Locker::handle_client_caps found inode's parent dirfrag is freezing/frozen, Add the message to unfreeze waiter
. client close session
. MDSDaemon::ms_handle_reset reset connection's priv
. dirfrag unfreeze, retry cap message. Locker::handle_client_caps dereference null pointer session

#2 Updated by Zheng Yan 7 months ago

  • Status changed from New to Verified

#3 Updated by Zheng Yan 7 months ago

  • Status changed from Verified to Need Review

#4 Updated by John Spray 6 months ago

  • Status changed from Need Review to Pending Backport
  • Tags set to jewel kraken

#5 Updated by Nathan Cutler 6 months ago

  • Tags deleted (jewel kraken)
  • Backport set to jewel, kraken

#6 Updated by Nathan Cutler 6 months ago

#7 Updated by Nathan Cutler 6 months ago

#8 Updated by Nathan Cutler 3 months ago

  • Status changed from Pending Backport to Resolved

Also available in: Atom PDF