nagios core dump and restart when check_nrpe
Kelvin Xu
kyoxu at hotmail.com
Fri Jan 4 05:00:34 CET 2008
Hi all,
I have just installed Nagios 3.0rc1 into a Solaris 10 machine. Everything is working fine except when i tried to do a check_nrpe on a remote host or localhost. I check my /var/adm/messages. Below is a section of the output:
Jan 4 10:16:39 pnsgsit1gw1 nagios[263]: [ID 702911 user.info] Caught SIGTERM, shutting down...Jan 4 10:16:39 pnsgsit1gw1 nagios[263]: [ID 702911 user.info] Successfully shutdown... (PID=263)Jan 4 10:16:39 pnsgsit1gw1 nagios[290]: [ID 702911 user.info] Nagios 3.0rc1 starting... (PID=290)Jan 4 10:16:39 pnsgsit1gw1 nagios[290]: [ID 702911 user.info] Local time is Fri Jan 04 10:16:39 SGT 2008Jan 4 10:16:39 pnsgsit1gw1 nagios[290]: [ID 702911 user.info] LOG VERSION: 2.0Jan 4 10:16:39 pnsgsit1gw1 nagios[291]: [ID 702911 user.info] Finished daemonizing... (New PID=291)Jan 4 10:17:53 pnsgsit1gw1 genunix: [ID 603404 kern.notice] NOTICE: core_log: nagios[302] setid process, core not dumped: /var/core/core.nagios.302.pnsgsit1gw1.210033.65541.1199413073Jan 4 10:17:53 pnsgsit1gw1 nagios[291]: [ID 702911 user.info] Caught SIGTERM, shutting down...Jan 4 10:17:53 pnsgsit1gw1 nagios[291]: [ID 702911 user.info] Successfully shutdown... (PID=291)Jan 4 10:17:53 pnsgsit1gw1 nagios[305]: [ID 702911 user.info] Nagios 3.0rc1 starting... (PID=305)Jan 4 10:17:53 pnsgsit1gw1 nagios[305]: [ID 702911 user.info] Local time is Fri Jan 04 10:17:53 SGT 2008Jan 4 10:17:53 pnsgsit1gw1 nagios[305]: [ID 702911 user.info] LOG VERSION: 2.0Jan 4 10:17:53 pnsgsit1gw1 nagios[306]: [ID 702911 user.info] Finished daemonizing... (New PID=306)
This will repeat every few minutes and will not occur when i remove the nrpe service monitoring from the configuration.
I tried to do a /usr/local/nagios/libexec/check_nrpe -H pnsgsit1gw2 -c check_load, The output seems fine except there is some addition characters appended to end.
OK - load average: 0.00, 0.00, 0.00|load1=0.000;15.000;30.000;0; load5=0.000;10.000;25.000;0; load15=0.000;5.000;20.000;0;ÿ¿àpÿ:
Below is my host configuration:
define host{ use linux-server host_name pnsgsit1gw2 ; The name we're giving to this server alias pnsgsit1gw2 ; A longer name for the server address 10.106.127.106 ; IP address of the server}
define service{ use local-service host_name pnsgsit1gw2 service_description NRPE check_command check_nrpe!check_load}
define service{ use local-service ; Name of service template to use host_name pnsgsit1gw2 service_description HTTP check_command check_http notifications_enabled 0 }
Anyone has any ideas of what could be the problem?
Thanks
Regards,
Kelvin Xu
_________________________________________________________________
Get your free suite of Windows Live services today!
http://www.get.live.com/wl/all
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20080104/710d6a2c/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list