Help reviewing current configuration

Alex mysqlstudent at gmail.com
Wed May 8 06:16:36 CEST 2013

Previous message: Variables for determining time before first alert
Next message: High Availabilty with Nagios
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hi all,

I've worked hours and hours on trying to get these notifications
working properly and I'm still confused. I really hoped someone could
review my configuration to determine where I might be doing something
wrong, and provide any suggestions for improvement. I have a handful
of critical servers that I'd like to be notified by within two or
three minutes of there being a problem, such as the web server having
been stopped.

It's my understand that I first define this as a service, while in
parallel define varying types of escalations - a regular escalation
and a critical escalation. Both of these rely on checking the host for
service issues max_check_attempts times, and when that number is
reached, an event trigger occurs and the service is escalated. Perhaps
I don't understand the link between how a service is in a failed state
and a serviceescalation directive is initiated. How are they connected
or linked?

If I have the first notification for an escalated service set to 1,
and the last set to 10, with a notification_interval set to 2, there
would effectively be two minutes between each notification, correct?

I've included a few macros that I've built below, along with the
services to which they belong, and hoped someone could review them to
see where I might be going wrong.

With the configuration below, it seems I'm only notified at the 10m
mark, which I guess is the notification_interval set in the
standard-service-24x7 macro. The service never seems to escalate to
critical-services, where I should receive an alert after every second
check of the host.

define service {
        name                            standard-service-24x7
        active_checks_enabled           1
        passive_checks_enabled          1
        parallelize_check               1
        obsess_over_service             1
        check_freshness                 0
        notifications_enabled           1
        event_handler_enabled           1
        flap_detection_enabled          1
        process_perf_data               1
        retain_status_information       1
        retain_nonstatus_information    1
        max_check_attempts              4
        check_interval                  1
        retry_interval                  1
        is_volatile                     0
        check_period                    24x7
        contact_groups                  admins
        notification_interval           10
        notification_period             24x7
        notification_options            w,u,c,r
        register                        0
}

# service for standard-service-24x7 macro above
define service {
        use                     standard-service-24x7
        host_name               dedicatedtech
        service_description     SMTP
        check_command           check_smtp
}

# escalation definition for a critical service
# admins-escal has several email addresses defined in it
define serviceescalation {
        name                    critical-services
        first_notification      1
        last_notification       10
        contact_groups          admins-escal
        notification_interval   2
        register                0
}

# test critical services
define serviceescalation {
        use                     critical-services
        host_name               beast
        service_description     PROCS-qemu
}

Is there other information I should provide to better troubleshoot this?

Any help greatly appreciated.
Thanks,
Alex

------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and 
their applications. This 200-page book is written by three acclaimed 
leaders in the field. The early access version is available now. 
Download your free book today! http://p.sf.net/sfu/neotech_d2d_may
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

Previous message: Variables for determining time before first alert
Next message: High Availabilty with Nagios
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

More information about the Users mailing list