Feature #40640
closedNetwork ping monitoring
0%
Description
The simplest version of this would be to see warnings if heartbeat ping response time exceeds certain thresholds.
Updated by David Zafman almost 5 years ago
See also https://pad.ceph.com/p/Network_ping_monitoring
Examples, with warning threshold set to 1 microsecond.
Summary status example
SLOW_PING_TIME_BACK Long heartbeat ping times on back interface seen, longest is 1488 msec, SLOW_PING_TIME_FRONT Long heartbeat ping times on front interface seen, longest is 1805 msec
Detail status example
SLOW_PING_TIME_BACK Long heartbeat ping times on back interface seen, longest is 1488 msec
Slow heartbeat ping on back interface from osd.1 to osd.2 1488 msec
Slow heartbeat ping on back interface from osd.0 to osd.2 1412 msec
Slow heartbeat ping on back interface from osd.2 to osd.1 1364 msec
Slow heartbeat ping on back interface from osd.2 to osd.0 1346 msec
Slow heartbeat ping on back interface from osd.0 to osd.1 1310 msec
Truncated long network list. Use ceph daemon osd.# dump_network for more information
SLOW_PING_TIME_FRONT Long heartbeat ping times on front interface seen, longest is 1805 msec
Slow heartbeat ping on front interface from osd.2 to osd.1 1805 msec
Slow heartbeat ping on front interface from osd.0 to osd.2 1648 msec
Slow heartbeat ping on front interface from osd.1 to osd.2 1495 msec
Slow heartbeat ping on front interface from osd.2 to osd.0 1461 msec
Slow heartbeat ping on front interface from osd.1 to osd.0 1448 msec
Truncated long network list. Use ceph daemon osd.# dump_network for more information
Updated by Neha Ojha over 4 years ago
- Status changed from New to Fix Under Review
Updated by David Zafman over 4 years ago
- Copied to Feature #41563: Add connection reset tracking to Network ping monitoring added
Updated by David Zafman over 4 years ago
- Status changed from Fix Under Review to Pending Backport
- Backport set to luminous, mimic, nautilus
Updated by David Zafman over 4 years ago
- Related to Bug #41689: Network ping test fails in TEST_network_ping_test2 added
Updated by Nathan Cutler over 4 years ago
- Copied to Backport #41695: nautilus: Network ping monitoring added
Updated by Nathan Cutler over 4 years ago
- Copied to Backport #41696: mimic: Network ping monitoring added
Updated by Nathan Cutler over 4 years ago
- Copied to Backport #41697: luminous: Network ping monitoring added
Updated by David Zafman over 4 years ago
- Related to Bug #41743: Long heartbeat ping times on front interface seen, longest is 2237.999 msec (OSD_SLOW_PING_TIME_FRONT) added
Updated by David Zafman over 4 years ago
- Related to Bug #42570: mgr: qa: upgrade mimic-master "src/osd/osd_types.h: 2313: FAILED ceph_assert(pos <= end)" added
Updated by David Zafman over 4 years ago
- Status changed from Pending Backport to Resolved