Nagios just stopped running
Rimbert Rivera
rrivera at comtex.com
Wed Jan 5 22:29:49 CET 2005
It looks like we are using performance data, but I was able to rename
the current one and nagios created a new one and continues working
without having to restart nagios. I did the same with
host-perfdata.out. We'll add this to our periodic maintenance
procedure.
- Rim
Rimbert Rivera
Manager, Information Technology
COMTEX News Network
rrivera at comtex.com
(703) 820-2000
Discover more about COMTEX at: http://www.comtex.com/
This e-mail is intended solely for the person or entity to which it is
addressed and may contain confidential and/or privileged information.
Any review, dissemination, copying, printing or other use of this e-mail
by persons or entities other than the addressee is prohibited. If you
have received this e-mail in error, please contact the sender
immediately and delete the material from any computer.
-----Original Message-----
From: Stephan Janosch [mailto:stephan.janosch at interface-business.de]
Sent: Wednesday, January 05, 2005 3:45 AM
To: Rimbert Rivera
Cc: nagios-users at lists.sourceforge.net
Subject: Re: [Nagios-users] Nagios just stopped running
Rimbert Rivera wrote:
> I have a cron job that runs the check_nagios plugin and e-mails us the
> output. Earlier today, we started getting:
> "Nagios problem: located 3 processes, status log updated 1565 seconds
ago"
>
> Everytime it ran, it was the same output with a longer time that it
> wasn't updated. This was working fine before where the status log
> would be updated usually no longer than 8 seconds ago. I checked the
> status.log and status.sav and confirmed that they hadn't updated. I
> restarted nagios but I still had the same problem. Even though none
> of the partitions were running out of space, I deleted archived logs
> and restarted nagios but same problem. I did some more
> troubleshooting without any luck. Long story short, I rebooted the
> RH9 box it was running on and nagios started running again.
>
> Anyone have an idea of what could've happened and things I could
check?
> This is the first time this has ever happened as far as I know. The
> recent changes we made were just setting up one new host to monitor so
> we edited hosts.cfg, hostgroups.cfg and services.cfg but nagios
> restarted without error. I even took out those changes and restarted
> nagios but still had the same problem. One thing I noticed was our
> service-perfdata.out is 790 MB. Can I delete this and nagios will
> create a new one? I'm not sure it's the problem since it's still that
> big and nagios is running now but it doesn't seem like I want that
> file to get that big.
To your service-perfdata.out. If you don't need any performance data,
you can switch perfmormance data logging of. Depending on your
performance command definition, you can simply delete it. I don't know,
if you have changed that command. Look into misccomands.cfg, there it is
located by standard.
>
> What kind of maintenance should I be performing on nagios? We've had
> it running for over a year and we haven't really did any kind of
cleanup on it.
>
> Your help to this newbie would be greatly appreciated.
Stephan
-------------------------------------------------------
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list