Multiple Nagios processes
Brian Sudis
SudisB at crlcorp.com
Thu Sep 15 15:34:33 CEST 2005
I've had Nagios 1.2 running for over a year now and have never seen this
problem before.
Currently running on RH ES 3 (2.4.21-32.0.1ELsmp).
Sometime in during the night the web interface started reports the
warning that monitoring processes may not be running.
Process info page reports the process status as warning and check
command output states
"Nagios problem: located 4 processes, status log updated 1126790135
seconds ago"
I have reviewed F0021, and F0123 and neither seem to apply.
I've validated the config (nagios -v nagios.cfg) and it passes
correctly.
The number of located processes changes.
Here is a quick look at nagios owned processes. Obviously there is more
than one nagios daemon running, which is a bad thing. I've stopped,
check that they stopped, and started nagios and it reverts back to this
each time. It looks like a new nagios daemon is being spawned for each
attempted check.
Here is thee successive looks at process status.
nagios 10720 1 0 08:02 ? 00:00:03 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios 13898 1 0 08:20 ? 00:00:00 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios 13899 13898 0 08:20 ? 00:00:00
/usr/libexec/nagios/check_ping -H sassvr1.crlcorp.com -w 100.0,20% -c
500.0,60% -p 5
nagios 13900 13899 0 08:20 ? 00:00:00 /bin/ping -n -U -c 5
sassvr1.crlcorp.com
Next process check.
nagios 10720 1 0 08:02 ? 00:00:03 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios 14101 1 0 08:21 ? 00:00:00 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios 14102 14101 0 08:21 ? 00:00:00
/usr/libexec/nagios/check_ping -H devsys.crlcorp.com -w 100.0,20% -c
500.0,60% -p 5
nagios 14103 14102 0 08:21 ? 00:00:00 /bin/ping -n -U -c 5
devsys.crlcorp.com
Next process check.
nagios 10720 1 0 08:02 ? 00:00:03 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios 14148 1 0 08:21 ? 00:00:00 /usr/bin/nagios -d
/etc/nagios/nagios.cfg
nagios 14149 14148 0 08:21 ? 00:00:00
/usr/libexec/nagios/check_ping -H ghostsvr.crlcorp.com -w 100.0,20% -c
500.0,60% -p 5
nagios 14150 14149 0 08:21 ? 00:00:00 /bin/ping -n -U -c 5
ghostsvr.crlcorp.com
The nagios.log file appears to be updating correctly. (At least it is
updating.)
One item of intrigue seems to be that on the program info page it shows
program start time as 12-31-1969 18:00:00!
System date and time are correct and ntp is running.
Any suggestions?
Thanks,
Brian
-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server.
Download it for free - -and be entered to win a 42" plasma tv or your very
own Sony(tm)PSP. Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list