Project

General

Profile

Actions

Bug #4385

closed

mds: refusing connections with high open socket count

Added by Noah Watkins about 11 years ago. Updated about 11 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

My MDS has become unresponsive after a long period of map-reduce jobs. The MDS process is idle, but is eating up 16 GB of virt memory. It also has over 8000 open socket file descriptors.

New clients have a lot of connection refused messages:

2013-03-07 18:00:11.169075 7f0080131700 10 -- 192.168.141.127:0/3067547546 >> 192.168.141.144:6800/8274 pipe(0x7f008437bb70 sd=63 :0 s=1 pgs=0 cs=0 l=0).connecting to 192.168.141.144:6800/8274
2013-03-07 18:00:11.169166 7f0080131700  2 -- 192.168.141.127:0/3067547546 >> 192.168.141.144:6800/8274 pipe(0x7f008437bb70 sd=63 :0 s=1 pgs=0 cs=0 l=0).connect error 192.168.141.144:6800/8274, 111: Connection refused
2013-03-07 18:00:11.169193 7f0080131700  2 -- 192.168.141.127:0/3067547546 >> 192.168.141.144:6800/8274 pipe(0x7f008437bb70 sd=63 :0 s=1 pgs=0 cs=0 l=0).fault 111: Connection refused
2013-03-07 18:00:11.169205 7f0080131700 10 -- 192.168.141.127:0/3067547546 >> 192.168.141.144:6800/8274 pipe(0x7f008437bb70 sd=63 :0 s=1 pgs=0 cs=0 l=0).fault waiting 15.000000

Related issues 1 (0 open1 closed)

Related to CephFS - Fix #3630: mds: broken closed connection cleanupResolvedSage Weil12/16/2012

Actions
Actions

Also available in: Atom PDF