Project

General

Profile

Actions

Bug #17275

closed

MDS long-time blocked ops. ceph-fuse locks up with getattr of file

Added by Henrik Korkuc over 7 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
jewel
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS, ceph-fuse
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

In 10.2.1 cluster we are having some blocked ceph-fuse (tested with 10.2.2) metadata accesses to files.

I cannot reproduce the issue but it happens from time to time.

mds dump_blocked_ops show some amount of long-running ops. Logs show some amount of slow ops, e.g. " 6 slow requests, 1 included below; oldest blocked for > 83360.920878 secs". Attaching some log files.
ceph-mds_log with debug_mds* set to 20/20 and greped with "10000328fca\|client.7397637"
dump_blocked_ops
mds_cache.gz

Problem is not Jewel specific, had similar problems on hammer cluster/client too


Files

ceph-mds_log (18.1 KB) ceph-mds_log Henrik Korkuc, 09/14/2016 12:53 PM
dump_blocked_ops (6.88 KB) dump_blocked_ops Henrik Korkuc, 09/14/2016 12:54 PM
mds_cache.xz (553 KB) mds_cache.xz Henrik Korkuc, 09/14/2016 12:55 PM
dump_cache (13.2 KB) dump_cache Henrik Korkuc, 09/14/2016 02:36 PM
mds_requests (592 Bytes) mds_requests Henrik Korkuc, 09/14/2016 02:36 PM
objecter_requests (131 Bytes) objecter_requests Henrik Korkuc, 09/14/2016 02:36 PM
dump_cache_2816210 (20.1 KB) dump_cache_2816210 Henrik Korkuc, 09/15/2016 06:40 AM
gdb.txt (594 KB) gdb.txt Henrik Korkuc, 09/16/2016 07:40 AM
client.27841966_gdb.txt (118 KB) client.27841966_gdb.txt Henrik Korkuc, 09/26/2016 02:50 PM
client.27841966_objecter_requests (3.22 KB) client.27841966_objecter_requests Henrik Korkuc, 09/26/2016 02:50 PM
mds-dump_blocked_ops (318 KB) mds-dump_blocked_ops Henrik Korkuc, 09/26/2016 02:51 PM

Related issues 1 (0 open1 closed)

Copied to CephFS - Backport #17697: jewel: MDS long-time blocked ops. ceph-fuse locks up with getattr of fileResolvedLoïc DacharyActions
Actions

Also available in: Atom PDF