Nagios process segfaulting, wedging
Dustin J. Mitchell
dustin at v.igoro.us
Thu Oct 19 17:12:43 CEST 2006
I'm afraid I don't have a lot of detail on this problem yet. On two occasions
about a week apart, my nagios process has wedged itself. I'm using nagios-2.5
on amd64, with a Gentoo install. Its log contains (sanitized; these were all
different hosts/services):
Oct 6 00:00:00 [nagios] CURRENT SERVICE STATE:
some-host;some-service;OK;HARD;1;HTTP OK HTTP/1.1 200 OK - 0.568 second
response time_
Oct 6 00:00:00 [nagios] Caught SIGSEGV, shutting down..._
in one case (this was at midnight, directly after the logfile rotation), and
[1160714430] EXTERNAL COMMAND:
PROCESS_SERVICE_CHECK_RESULT;some-host;some-service;0;Blah blah
[1160714480] EXTERNAL COMMAND:
PROCESS_SERVICE_CHECK_RESULT;some-host;some-service;0;Blah blah
[1160714481] Caught SIGSEGV, shutting down...
(this was *not* at midnight, so no fair blaming the logfile rotation)
This seemed to start after I implemented a number of service checks for
cronjobs; these are implemented as "OK" reports delivered via NSCA on
successful completion of the cronjob, and a freshness check set to some small
multiple of the cronjob frequency. Some of these cronjobs run every fifteen
minutes on a half-dozen hosts, so it's a small but non-trivial amount of traffic.
Based on mailing list archives, I tried bombarding nagios with nsca requests
and watching its memory consumption -- it hovered at a reasonable number. I
put nagios in debugging mode, which unfortunately makes it fairly unusable for
actual monitoring, so I can't leave it in that mode for a week. I could not
replicate the crash.
My questions, then, are:
* is there a known bug that could be causing this?
* is there anything I can do to help track down what might be causing this
next time it happens?
Thanks!
Dustin
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list