Feature #63945
Updated by Jos Collin 4 months ago
https://jsw.ibm.com/browse/ISCE-49: Introduce metrics that will be consumed by the OCP/ODF Dashboard to provide monitoring of Geo Replication in the OCP and ACM dashboard and elsewhere. This would generate the progress of cephfs_mirror syncing and thus provide the monitoring capability. Metrics should enable monitoring logic to generate the following alerts: * Secondary cluster disconnected * Replication started/ended * Resync started/ended * Promotion/Demotion event (= failover or fallback initiated) * Snapshot transfer failed or interrupted. * Failed to complete the snapshot transfer before the next scheduled transfer. * Replication status Monitoring: Alerts on policy non-compliance. This would provide information like replication status, replication schedules, resync status - per volumes belonging to a storage class, namespace, label via OCP/ODF dashboards.