FW: Nagios 3.0.5 problem

Rick Mangus rick.mangus+nagios at gmail.com
Mon Feb 1 16:00:13 CET 2010


Oops, sent that email to you directly, and not the list.  The answers to
your questions should be plainly visible below; my apologies to anyone
reading on nagios-users.

As regards question 1, I am certain we are not running ndo2db.  I'm sorry if
my first answer seemed ambiguous, perhaps I should have stated outright that
I am not using ndoutils.  I instead attempted to let you know that we were
running some software that similarly stored data in a database and had
already ruled out large/slow database queries as the source of my problem.

Your response to number 2 is quite intriguing!  Further detail on that might
be helpful, though.  I'll check to see if anyone is using this server for
anything else that could interfere.  I don't think a nagios check can be
blocking, as 99% of our checks are passive checks passed through nsca and I
don't think any of our active checks besides host checks (check_icmp)
actually contact another computer.

As to the pre-existing administrator, he was just rebooting the server every
day as he did not know what was causing the problem. If I am convinced that
an upgrade will fix the problem, I will email him.  Otherwise, until there
is some change, there's not much point in bothering him during his
sabbatical.

Finally, the checkresults thing.  It seemed odd to me, but it looked like
several normal files concatenated.  I can't actually look at the file again
until the server is booted and on a Monday morning that could be a while.
;)

Thanks for your help!

--Rick

On Mon, Feb 1, 2010 at 8:21 AM, <jonathan.wheeler at stfc.ac.uk> wrote:

> From: rickmangus at gmail.com On Behalf Of Rick Mangus
> Sent: 01 February 2010 13:30
>
> > Thank you for the response.  In quick succession:
>
> I am forwarding my replies to the Nagios list as well.
>
> > 1. I do use perfparse, and one of our suspicions involved a mysql delete
> to prune old data
> > that took multiple hours every night.  I removed all jobs that I could
> find that could
> > possibly interfere.
>
> You can determine that you are running ndoutils by issuing a command like
> "ps -fu naguos" (to list all processes owned by username nagios; if you are
> running ndoutils, there will be a process named ndo2db
>
> > 2. No, or if we do, it's well-hidden.  ;)
>
> In our case, the process causing the main problem was running on another
> server, but was holding up nagios because it blocked the nagios process in
> part of the code that was single-threaded.
>
> > 3. I am walking into a pre-existing install, and trying to slowly take
> over the management
> > duties.  To hasten the learning process, the only person in the office
> who knows anything
> > about it went to Hawaii once I'd been working here a few weeks.  I don't
> know that I should
> > attempt any major changes without his blessing, and he will not return
> until
> > March.  Though, if we determine that is the only/best fix, I'll do it.
>
> Can he be contacted for his advice ?  If not, what has been changed since
> you took over (probably asking the obvious questions !)
>
> > One additional data point:  I found on Saturday night, as I logged in to
> restart Nagios and
> > prevent the machine dying, that the one file in
> /ramdisk_nagios/checkresults/ was over
> > 1MB.  Every other time I have checked, the files in there are sub-4kB.
> If that tells
> > anyone here anything, please share with the peanut gallery (me!).  :)
>
> 1MB sounds very large; if you see it again, try finding out which check
> generated this file (core or debug dump from check code ?).
>
> Jonathan Wheeler
> e-Science Centre
> Rutherford Appleton Laboratory
>
>
>
> --
> Scanned by iCritical.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20100201/4923da48/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
The Planet: dedicated and managed hosting, cloud storage, colocation
Stay online with enterprise data centers and the best network in the business
Choose flexible plans and management services without long-term contracts
Personal 24x7 support from experience hosting pros just a phone call away.
http://p.sf.net/sfu/theplanet-com
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null


More information about the Users mailing list