Project

General

Profile

Feature #63945

Updated by Jos Collin 4 months ago

https://jsw.ibm.com/browse/ISCE-49: 

 Introduce metrics that will be consumed by the OCP/ODF Dashboard to provide monitoring of Geo Replication in the OCP and ACM dashboard and elsewhere. This would generate the progress of cephfs_mirror syncing and thus provide the monitoring capability. 

 Metrics should enable monitoring logic to generate the following alerts: 

 * Secondary cluster disconnected 
 * Replication started/ended 
 * Resync started/ended 
 * Promotion/Demotion event (= failover or fallback initiated) 
 * Snapshot transfer failed or interrupted. 
 * Failed to complete the snapshot transfer before the next scheduled transfer. 
 * Replication status Monitoring: Alerts on policy non-compliance. 

 This would provide information like replication status, replication schedules, resync status - per volumes belonging to a storage class, namespace, label via OCP/ODF dashboards. 

Back