Bug #65658: mds: MetricAggregator::ms_can_fast_dispatch2 acquires locks - CephFS - Ceph

Bug #65658

There was a lot of discussion surrounding this in 

 https://github.com/ceph/ceph/pull/26004/ 

 but circling back we have since seen evidence this is causing significant problems: after a long up:replay recovery, the MDS can be flooded with metrics messages by clients and the lock contention in fast_dispatch is preventing the MDS from sending beacons to the monitors. This then leads to undesirable MDS failovers. 

 We should convert this to using regular dispatch (and optimize later if needed).

Back

Project

General

Profile

Ceph » CephFS

Bug #65658