Antwort: log errors in nagios.log
Harry de Grote
harry at cc.kuleuven.be
Tue Aug 23 15:57:19 CEST 2005
On Tuesday 23 August 2005 15:20, srunschke at abit.de wrote:
> I fail to see what you mean?
> What's wrong with those errors?
> It shows that you IMAP Service seems to have been unavailable
> for a few minutes every few weeks. That tends to happen when
> you work on your servers.
>
> Where do you see the actual problem?
> It might have been a good idea to post some more information
> what we should look for or what you consider suspicious.
hi all,
sorry, i sent another mail, but i made a little mistake... this is the first
one:
<quote>
hey all,
i have a huge problem with nagios 2.0b4 (and all the other ones)
we're running a nagios 1.2 (before that, the 1.1) for almost 2 years now.
i want to create availability reports for the hosts and services (to give to
the management guys). now here's the problem...
there are a lot of services in an unknown state for a very long time. although
we KNOW they were online, and nagios even said they were online.
after a long search, i found the following:
when you add a service check, and restart nagios, it says: status ok, but it
doesn't write anything in the logs ==> no hard ok ==> still unknown in the
availability report. even when it gets a soft critical, and becomes ok again,
it still writes soft OK in the logs... still not enough to show ok in the
availability report. the only way to get it to show up, is to get it hard
critical, and then ok again... this will put a hard critical and hard ok in
the logs, which will result in ok status in the reports.
here the logs of my tests (explanation inserted in the log):
!!! restart because i added a webserver check on host damien.kotnet.org !!!
[1124788029] Caught SIGHUP, restarting...
[1124788029] Nagios 2.0b4 starting... (PID=22971)
[1124788029] LOG VERSION: 2.0
[1124788114] Caught SIGTERM, shutting down...
[1124788114] Successfully shutdown... (PID=22971)
[1124788114] Nagios 2.0b4 starting... (PID=8575)
[1124788114] LOG VERSION: 2.0
[1124788114] Finished daemonizing... (New PID=892)
!!! in availability report: 100% undetermined, while it says, service up !!!
!!! i shut down the webserver on host damien.kotnet.org and forced a http
check !!!
SCHEDULE_FORCED_SVC_CHECK;damien.kotnet.org;web;1124788671
[1124788674] SERVICE ALERT: damien.kotnet.org;web;CRITICAL;SOFT;1;Connection
refused
!!! service critical, but only soft error... still 100% undetermined!!!
!!! restarted webserver before the service became hard critical !!!
[1124788824] SERVICE ALERT: damien.kotnet.org;web;OK;SOFT;2;HTTP OK HTTP/1.1
200 OK - 1205 bytes in 0.031 seconds
!!! recover from a soft critical ==> soft ok ==> still 100% undetermined !!!
[1124791524] SERVICE ALERT: damien.kotnet.org;web;CRITICAL;SOFT;1;Connection
refused
[1124791584] SERVICE ALERT: damien.kotnet.org;web;CRITICAL;SOFT;2;Connection
refused
[1124791644] SERVICE ALERT: damien.kotnet.org;web;CRITICAL;SOFT;3;Connection
refused
[1124791704] SERVICE ALERT: damien.kotnet.org;web;CRITICAL;HARD;4;Connection
refused
!!! shutdown of webserver again, waiting for a hard critical !!!
!!! ==> undetermined BEFORE this event, critical after !!!
[1124792004] SERVICE ALERT: damien.kotnet.org;web;OK;HARD;4;HTTP OK HTTP/1.1
200 OK - 1205 bytes in 0.037 seconds
!!! only NOW, the service is ok for the first time in the availability log !!!
Is this normal behaviour?? because, i want my services to be shown as up, when
they are up, and down when they are down
this behaviour makes it very hard to get an accurate availability figure.
can i do something about this? maybe write more logs? (server is powerfull
enough, and there is enough storage ;))
can anyone help? maybe inform me if i do something wrong?
thanx in advance,
</quote>
anyway... the problem with these logs is, it gives availability reports as
follows:
http://harry.ulyssis.org/report.jpg
service looks down for half a year, while it was not...
again, sorry for the spam...
greetings,
--
harry
aka Rik Bobbaers
K.U.Leuven - LUDIT -=- Tel: +32 485 52 71 50
Rik.Bobbaers at cc.kuleuven.be -=- http://harry.ulyssis.org
Disclaimer:
By sending an email to ANY of my addresses you are agreeing that:
1. I am by definition, "the intended recipient"
2. All information in the email is mine to do with as I see fit and make
such financial profit, political mileage, or good joke as it lends itself to.
In particular, I may quote it on usenet.
3. I may take the contents as representing the views of your company.
4. This overrides any disclaimer or statement of confidentiality that may be
included on your message.
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list