No host, or service notifications received from Nagios 2.9 for critical states
Evgeny Stepanov
enst at mail.ru
Wed Aug 22 10:10:06 CEST 2007
It seems you don't have contact_groups in your host and service
definitions. Check templates or host and service definitions to have
contact_groups clause, like
define host{
use windows-servers ; Name of host template to use
; This host definition will inherit all variables that are defin$
; in (or inherited by) the windows-server host template definiti$
host_name server03
alias Label Server
address 192.168.10.127
contact_groups admins
}
the latter line can be in host or service template definition also.
Mark Nassy пишет:
> no notifications are being received from nagios when a service is
> down. i power off server03 and did not receive a notification from
> nagios. the log shows no record of an attempt to send a notification.
> i can manually send notifications successfully. any ideas why?
>
> here is the log with no record of a notification attempt.
>
> $ cat /opt/local/var/nagios/nagios.log
> [1187737502] HOST ALERT: server03;DOWN;SOFT;9;CRITICAL - Plugin timed
> out after 10 seconds
> [1187737512] HOST ALERT: server03;DOWN;HARD;10;CRITICAL - Plugin
> timed out after 10 seconds
> [1187737512] SERVICE ALERT: server03;PING;CRITICAL;HARD;1;CRITICAL -
> 192.168.10.127: rta nan, lost 100%
> [1187738002] HOST ALERT: server03;UP;HARD;1;PING OK - Packet loss =
> 0%, RTA = 0.45 ms
> [1187738002] SERVICE ALERT: server03;PING;OK;HARD;1;OK -
> 192.168.10.127: rta 0.589ms, lost 0%
> [1187739982] Auto-save of retention data completed successfully.
> ....
> 1187741712] HOST ALERT: server03;DOWN;HARD;10;CRITICAL - Plugin timed
> out after 10 seconds
> [1187741712] SERVICE ALERT: server03;PING;CRITICAL;HARD;1;CRITICAL -
> 192.168.10.127: rta nan, lost 100%
> [1187743582] Auto-save of retention data completed successfully.
> [1187745607] Caught SIGEXIT, shutting down...
> [1187745607] Successfully shutdown... (PID=3492)
> [1187745618] Nagios 2.9 starting... (PID=5949)
> [1187745618] LOG VERSION: 2.0
> [1187745618] Finished daemonizing... (New PID=5950)
> [1187749218] Auto-save of retention data completed successfully.
>
>
>
> file system permissions review looks ok (to me).
> $ ls -l
> ....
> -r-sr-xr-x 2 root admin 46644 Aug 20 11:46 check_icmp
> ....
> -rwxr-xr-x 2 root admin 42496 Aug 20 11:46 check_ping
>
>
>
> using the check_ping command returns the expected result for a host
> or service that is down.
> $ sudo -u nagios ./check_ping -H server03 -w 100.0,20% -c 500.0,60%
> CRITICAL - Plugin timed out after 10 seconds
>
>
> manually sending an email using the code from the notification
> command works. i receive the email.
> $ sudo -u nagios /usr/bin/printf "%b" "***** Nagios 2.9 *****\n
> \nNotification Type: CRITICAL\n\nService: PING\nHost: server03
> \nAddress: 192.168.10.127\nState: down\n\nDate/Time: Today Aug 22nd\n
> \nAdditional Info:\n\nTimeout" | /usr/bin/mail -s "** CRITICAL alert
> - server03/PING is down **" it at intranet.com
>
>
> config directory set and notifications turned on.
> $ cat /opt/local/etc/nagios/nagios.cfg
> ....
> cfg_dir=/opt/local/etc/nagios/ny
> ....
> log_notifications=1
>
>
>
> $ cat /opt/local/etc/nagios/ny/contacts.cfg
> ....
> define contact{
> contact_name nagios-admin
> alias Nagios Admin
> service_notification_period 24x7
> host_notification_period 24x7
> service_notification_options w,u,c,r
> host_notification_options d,r
> service_notification_commands notify-by-email
> host_notification_commands host-notify-by-email
> email it at intranet.com
> }
>
>
>
>
> $ cat /opt/local/etc/nagios/ny/contactgroups.cfg
> ....
> define contactgroup{
> contactgroup_name admins
> alias Nagios Administrators
> members nagios-admin
> }
>
>
>
> note: the ping command actually runs check_icmp in the command line.
> $ cat /opt/local/etc/nagios/ny/commands.cfg
> ....
>
> # 'check_ping' command definition
> define command{
> command_name check_ping
> command_line $USER1$/check_icmp -H $HOSTADDRESS$ -w $ARG1
> $ -c $ARG2$ -p 5
> }
> ....
> # 'host-notify-by-email' command definition
> define command{
> command_name host-notify-by-email
> command_line /usr/bin/printf "%b" "***** Nagios 2.9 *****
> \n\nNotification Type: $NOTIFICATIONTYPE$\nHost: $HOSTNAME$\nState:
> $HOSTSTATE$\nAddress: $HOSTADDRESS$\nInfo: $HOSTOUTPUT$\n\nDate/Time:
> $LONGDATETIME$\n" | /usr/bin/mail -s "Host $HOSTSTATE$ alert for
> $HOSTNAME$!" $CONTACTEMAIL$
> }
>
> # 'notify-by-email' command definition
> define command{
> command_name notify-by-email
> command_line /usr/bin/printf "%b" "***** Nagios 2.9 *****
> \n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$
> \nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n
> \nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$SERVICEOUTPUT$"
> | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ alert - $HOSTALIAS$/
> $SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$
> }
>
>
>
>
> $ cat /opt/local/etc/nagios/ny/hostgroups.cfg
> ....
> define hostgroup{
> hostgroup_name windows-servers
> alias Windows Servers
> members server01,server02,server03
> }
>
>
>
>
> $ cat /opt/local/etc/nagios/ny/hosts.cfg
> ....
> define host{
> use windows-servers ; Name of
> host template to use
> ; This host
> definition will inherit all variables that are defin$
> ; in (or
> inherited by) the windows-server host template definiti$
> host_name server03
> alias Label Server
> address 192.168.10.127
> }
>
>
>
> $ cat /opt/local/etc/nagios/ny/services.cfg
> ....
> define service{
> use remote-service ;
> Name of service template to use
> hostgroup windows-servers
> service_description PING
> check_command check_ping!100.0,20%!500.0,60%
> }
>
>
>
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems? Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
--
С уважением,
Степанов Е.Н.
enst at mail.ru
icq 2899514
+7-985-2267458
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list