nagios does its checks, but doesn't notify
Marco Herrn
ml at mherrn.de
Fri Feb 10 11:06:34 CET 2006
Hi there,
I have nagios (1.3) running on server to monitor two other servers. On
one of these 2 server is the nagios nrpe-server running to allow some
additional checks.
These checks seem to be executed correctly, as the nrpe-server is
regularly logging those:
----/----
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Connection from xxx.xxx.xxx.xxx port 53758
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Host address checks out ok
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Handling the connection...
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Host is asking for command çheck_mysql' to be run...
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Running command: /usr/lib/nagios/plugins/check_mysql --check-slave
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Command completed with return code 2 and output: Access denied for user: 'nagios at localhost' (Using password: NO)
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Return Code: 2, Output: Access denied for user: 'nagios at localhost' (Using password: NO)
Feb 8 16:47:32 dsxx-xxx-xxx-xx nrpe[15233]: Connection from xxx.xxx.xxx.xxx closed.
----/----
But on the actual nagios host, nearly nothing gets logged:
----/----
[1139409698] Nagios 1.3 starting... (PID=31188)
----/----
And no notifications are sent. Even if a service is not available.
There were times where the nagios host logged the following:
----/----
[1139402923] SERVICE ALERT: myhost;total_procs;CRITICAL;SOFT;2;PROCS CRITICAL: 209 processes
[1139402983] SERVICE ALERT: myhost;total_procs;CRITICAL;SOFT;3;PROCS CRITICAL: 204 processes
[1139403043] SERVICE ALERT: myhost;total_procs;CRITICAL;HARD;4;PROCS CRITICAL: 206 processes
[1139403253] SERVICE ALERT: myhost;http;CRITICAL;SOFT;1;CRITICAL - Socket timeout after 10 seconds
[1139403313] SERVICE ALERT: myhost;http;CRITICAL;SOFT;2;CRITICAL - Socket timeout after 10 seconds
[1139403343] SERVICE ALERT: myhost;total_procs;WARNING;HARD;4;PROCS WARNING: 185 processes
[1139403373] SERVICE ALERT: myhost;http;CRITICAL;SOFT;3;CRITICAL - Socket timeout after 10 seconds
[1139403433] SERVICE ALERT: myhost;http;CRITICAL;HARD;4;CRITICAL - Socket timeout after 10 seconds
----/----
These logs didn't appear regularly! I see no pattern when these got
logged. Also in these cases no notifications were sent.
Now, how can I find out what is going wrong here? Is there a debug
option for nagios to get more verbose logging?
Or any other ideas?
Regards
Marco
-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems? Stop! Download the new AJAX search engine that makes
searching your log files as easy as surfing the web. DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list