Project

General

Profile

Actions

Feature #709

closed

nagios plugin

Added by Sage Weil over 13 years ago. Updated almost 13 years ago.

Status:
Resolved
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Reviewed:
Affected Versions:
Pull request ID:

Description

Make a generic nagios plugin (for widespread use) that uses 'ceph health' to check status.

It should let you configure multiple monitors, and pass them to the ceph util so that it behaves when some monitors are down.

See #430

(Separately, we need to switch to this new version internally, and make sure our nagios config generation does the right thing.)

Actions #1

Updated by Dallas Kashuba about 13 years ago

  • Status changed from New to In Progress

I've started on this. I think it will look something like this...

A new "cephmonitor" service entry on a dummy machine object with an IP of 127.0.0.1. The cephmonitor service will have params for all of the monitor IPs and the secret key. The netsaint service will be updated to understand how to monitor the cephmonitor service. A new nagios plugin will be developed to use "ceph health" directly on a nagios machine. Nagios machines need to have a dependency added for the ceph debian package.

Actions #2

Updated by Sage Weil about 13 years ago

fwiw i've been creating a 'logical cluster' machine in teh dh db. see ceph-playground-fs, for example. although the point there was to have something for the mount mapping (for a home_dir of ceph-playground-fs:/some/path). should work here too.

Actions #3

Updated by Dallas Kashuba about 13 years ago

The nagios machines are still etch so I think I'm going to push to get them updated to lenny before proceeding with that part of this. I should be able to get a preliminary version of the nagios plugin put together before that, though.

Actions #4

Updated by Dallas Kashuba about 13 years ago

The issue for the nagios machine upgrade: https://dev.newdream.net/issues/8978

Actions #5

Updated by Sage Weil almost 13 years ago

  • Status changed from In Progress to Resolved
Actions

Also available in: Atom PDF