Help reviewing current configuration
Alex
mysqlstudent at gmail.com
Wed May 8 06:16:36 CEST 2013
Hi all,
I've worked hours and hours on trying to get these notifications
working properly and I'm still confused. I really hoped someone could
review my configuration to determine where I might be doing something
wrong, and provide any suggestions for improvement. I have a handful
of critical servers that I'd like to be notified by within two or
three minutes of there being a problem, such as the web server having
been stopped.
It's my understand that I first define this as a service, while in
parallel define varying types of escalations - a regular escalation
and a critical escalation. Both of these rely on checking the host for
service issues max_check_attempts times, and when that number is
reached, an event trigger occurs and the service is escalated. Perhaps
I don't understand the link between how a service is in a failed state
and a serviceescalation directive is initiated. How are they connected
or linked?
If I have the first notification for an escalated service set to 1,
and the last set to 10, with a notification_interval set to 2, there
would effectively be two minutes between each notification, correct?
I've included a few macros that I've built below, along with the
services to which they belong, and hoped someone could review them to
see where I might be going wrong.
With the configuration below, it seems I'm only notified at the 10m
mark, which I guess is the notification_interval set in the
standard-service-24x7 macro. The service never seems to escalate to
critical-services, where I should receive an alert after every second
check of the host.
define service {
name standard-service-24x7
active_checks_enabled 1
passive_checks_enabled 1
parallelize_check 1
obsess_over_service 1
check_freshness 0
notifications_enabled 1
event_handler_enabled 1
flap_detection_enabled 1
process_perf_data 1
retain_status_information 1
retain_nonstatus_information 1
max_check_attempts 4
check_interval 1
retry_interval 1
is_volatile 0
check_period 24x7
contact_groups admins
notification_interval 10
notification_period 24x7
notification_options w,u,c,r
register 0
}
# service for standard-service-24x7 macro above
define service {
use standard-service-24x7
host_name dedicatedtech
service_description SMTP
check_command check_smtp
}
# escalation definition for a critical service
# admins-escal has several email addresses defined in it
define serviceescalation {
name critical-services
first_notification 1
last_notification 10
contact_groups admins-escal
notification_interval 2
register 0
}
# test critical services
define serviceescalation {
use critical-services
host_name beast
service_description PROCS-qemu
}
Is there other information I should provide to better troubleshoot this?
Any help greatly appreciated.
Thanks,
Alex
------------------------------------------------------------------------------
Learn Graph Databases - Download FREE O'Reilly Book
"Graph Databases" is the definitive new guide to graph databases and
their applications. This 200-page book is written by three acclaimed
leaders in the field. The early access version is available now.
Download your free book today! http://p.sf.net/sfu/neotech_d2d_may
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list