Nagios-users digest, Vol 1 #2754 - 16 msgs
Arif Snort
arif.snort at gmail.com
Mon Aug 15 01:30:30 CEST 2005
i am sorry..i am a newbe make linux and nagios...can you help me, how
to install and compile nagios with mysql ?
ok thanks before..
On 8/12/05, nagios-users-request at lists.sourceforge.net
<nagios-users-request at lists.sourceforge.net> wrote:
> Send Nagios-users mailing list submissions to
> nagios-users at lists.sourceforge.net
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> or, via email, send a message with subject or body 'help' to
> nagios-users-request at lists.sourceforge.net
>
> You can reach the person managing the list at
> nagios-users-admin at lists.sourceforge.net
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Nagios-users digest..."
>
>
> Today's Topics:
>
> 1. False Negatives (David Johnson)
> 2. RE: False Negatives (Hosey, Chester)
> 3. RE: False Negatives (David Johnson)
> 4. RE: False Negatives (David Johnson)
> 5. Service Went Down, No notification sent... (John McGowan)
> 6. RE: False Negatives (Marc Powell)
> 7. Re: Service Went Down, No notification sent... (Andreas Ericsson)
> 8. Re: why can't a servicegroup have no members? (Kevin Hanser)
> 9. RE: why can't a servicegroup have no members? (Marc Powell)
> 10. Re: Service Went Down, No notification sent... (John McGowan)
> 11. missing var/objects.cache (naguser at bhodisoft.com)
> 12. RE: missing var/objects.cache (Marc Powell)
> 13. RE: missing var/objects.cache (naguser at bhodisoft.com)
> 14. Disable pager alerts for Acknowledgements ? (Michael J McCafferty)
> 15. external commands audit trail (Marc Martinez)
> 16. =?ISO-8859-1?Q?Multiple_orphaned_nagios_processes?= (Brian Murphy)
>
> --__--__--
>
> Message: 1
> Reply-To: <djohnson at jsatech.com>
> From: "David Johnson" <djohnson at jsatech.com>
> To: <nagios-users at lists.sourceforge.net>
> Date: Thu, 11 Aug 2005 10:20:15 -0500
> Subject: [Nagios-users] False Negatives
>
> Long time user, first time emailer...
>
> We have been loving nagios and we've even submitted some source code =
> improvements. However, we have a problem with false negatives. We have =
> 2 remote servers we monitor in which the SSH service is being reported =
> as down, while the host is being reported as up. Both of these servers =
> are operating properly. In addition, the host check command and service =
> check commands are identical. We have 40 other servers which operate in =
> exactly the same way, yet the SSH service on these 2 is being reported =
> as down. Exact same versions of SSH on all servers also. The only =
> difference is location. =20
>
>
>
> --__--__--
>
> Message: 2
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 11:23:27 -0400
> From: "Hosey, Chester" <Chester.Hosey at gianteagle.com>
> To: <djohnson at jsatech.com>,
> <nagios-users at lists.sourceforge.net>
>
> Have you run the service check plugins manually to see what it returns?
> Is it just an occasional failure?
>
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of David
> Johnson
> Sent: Thursday, August 11, 2005 11:20 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] False Negatives
>
>
> Long time user, first time emailer...
>
> We have been loving nagios and we've even submitted some source code
> improvements. However, we have a problem with false negatives. We have
> 2 remote servers we monitor in which the SSH service is being reported
> as down, while the host is being reported as up. Both of these servers
> are operating properly. In addition, the host check command and service
> check commands are identical. We have 40 other servers which operate in
> exactly the same way, yet the SSH service on these 2 is being reported
> as down. Exact same versions of SSH on all servers also. The only
> difference is location. =20
>
>
>
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing & QA Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
> --__--__--
>
> Message: 3
> Reply-To: <djohnson at jsatech.com>
> From: "David Johnson" <djohnson at jsatech.com>
> To: "Hosey, Chester" <Chester.Hosey at gianteagle.com>,
> <nagios-users at lists.sourceforge.net>
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 10:36:33 -0500
>
> I've run them manually both through NRPE and directly on the host that =
> runs them. All returns okay. Exactly the same as all other hosts. I =
> thought perhaps it might be time delay, so I set the timeout on my =
> check_ssh command to 30 seconds (from the default of 10). I did the =
> same with NRPE.
>
>
> -----Original Message-----
> From: Hosey, Chester [mailto:Chester.Hosey at gianteagle.com]
> Sent: Thursday, August 11, 2005 10:23 AM
> To: djohnson at jsatech.com; nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] False Negatives
>
>
> Have you run the service check plugins manually to see what it returns?
> Is it just an occasional failure?
>
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of David
> Johnson
> Sent: Thursday, August 11, 2005 11:20 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] False Negatives
>
>
> Long time user, first time emailer...
>
> We have been loving nagios and we've even submitted some source code
> improvements. However, we have a problem with false negatives. We have
> 2 remote servers we monitor in which the SSH service is being reported
> as down, while the host is being reported as up. Both of these servers
> are operating properly. In addition, the host check command and service
> check commands are identical. We have 40 other servers which operate in
> exactly the same way, yet the SSH service on these 2 is being reported
> as down. Exact same versions of SSH on all servers also. The only
> difference is location. =20
>
>
>
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing & QA Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
> --__--__--
>
> Message: 4
> Reply-To: <djohnson at jsatech.com>
> From: "David Johnson" <djohnson at jsatech.com>
> To: "Hosey, Chester" <Chester.Hosey at gianteagle.com>,
> <nagios-users at lists.sourceforge.net>
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 10:36:33 -0500
>
> The manual running of the plugins succeeds every time. The nagios =
> running fails every time. I must say that I get two different failures.
>
> 1. One falsely failing server returns: "Server answer:"
> 2. The other falsely failing server returns a CHECK_NRPE timeout of 10 =
> seconds (which is odd because I set the timeout to 30 and did a hard =
> restart). =20
>
>
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net]On Behalf Of Hosey,
> Chester
> Sent: Thursday, August 11, 2005 10:23 AM
> To: djohnson at jsatech.com; nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] False Negatives
>
>
> Have you run the service check plugins manually to see what it returns?
> Is it just an occasional failure?
>
> -----Original Message-----
> From: nagios-users-admin at lists.sourceforge.net
> [mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of David
> Johnson
> Sent: Thursday, August 11, 2005 11:20 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] False Negatives
>
>
> Long time user, first time emailer...
>
> We have been loving nagios and we've even submitted some source code
> improvements. However, we have a problem with false negatives. We have
> 2 remote servers we monitor in which the SSH service is being reported
> as down, while the host is being reported as up. Both of these servers
> are operating properly. In addition, the host check command and service
> check commands are identical. We have 40 other servers which operate in
> exactly the same way, yet the SSH service on these 2 is being reported
> as down. Exact same versions of SSH on all servers also. The only
> difference is location. =20
>
>
>
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing & QA Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle =
> Practices
> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & =
> QA
> Security * Process Improvement & Measurement * =
> http://www.sqe.com/bsce5sf
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when =
> reporting any issue.=20
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
> --__--__--
>
> Message: 5
> Date: Thu, 11 Aug 2005 10:36:42 -0500
> From: John McGowan <mcgowan at lynch2.com>
> To: Nagios Users List <nagios-users at lists.sourceforge.net>
> Subject: [Nagios-users] Service Went Down, No notification sent...
>
> I'm not sure exactly where to start on this.... last night i checked on
> my services and noticed a service that was down and had been down for
> about 20 minutes. No notification was ever sent out... this is what I
> saw in the log...
>
> [08-10-2005 22:07:36] SERVICE ALERT: tessweb;Tessitura
> SeatServer;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
>
> When I ran a test this morning this is what I saw in the event log.
>
> [08-11-2005 10:17:39] SERVICE ALERT: tessweb;Tessitura
> SeatServer;CRITICAL;SOFT;1;No route to host
> [08-11-2005 10:18:08] SERVICE ALERT: tessweb;Tessitura
> SeatServer;CRITICAL;SOFT;2;No route to host
> [08-11-2005 10:18:38] SERVICE ALERT: tessweb;Tessitura
> SeatServer;CRITICAL;SOFT;3;No route to host
> [08-11-2005 10:19:08] SERVICE ALERT: tessweb;Tessitura
> SeatServer;CRITICAL;SOFT;4;No route to host
> [08-11-2005 10:19:38] SERVICE ALERT: tessweb;Tessitura
> SeatServer;CRITICAL;HARD;5;No route to host
>
> The first thing that stood out when I saw it was the fact that the max
> check attempts didn't seem to make a difference last night... the
> service went critical hard on the first failure...
>
> FYI: the particular thing that caused this failure last night was that
> the VPN between here and there was down.
>
> the service definition is listed below... with it's template
>
>
>
> define service{
> name generic-service ; The 'name' of this service
> template, referenced in other service definitions
> active_checks_enabled 1 ; Active service checks are enabled
> passive_checks_enabled 1 ; Passive service checks are
> enabled/accepted
> parallelize_check 1 ; Active service checks should be
> parallelized (disabling this can lead to major performance problems)
> obsess_over_service 1 ; We should obsess over this service
> (if necessary)
> check_freshness 0 ; Default is to NOT check service
> 'freshness'
> notifications_enabled 1 ; Service notifications are enabled
> event_handler_enabled 1 ; Service event handler is enabled
> flap_detection_enabled 1 ; Flap detection is enabled
> process_perf_data 1 ; Process performance data
> retain_status_information 1 ; Retain status information across
> program restarts
> retain_nonstatus_information 1 ; Retain non-status information
> across program restarts
> is_volatile 0
>
> check_period 24x7
> notification_period 24x7
> notification_interval 120
> notification_options w,u,c,r
> contact_groups admins
> max_check_attempts 5
> normal_check_interval 60
> retry_check_interval 30
>
>
> register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A
> REAL SERVICE, JUST A TEMPLATE!
> }
>
> # Service definition
> define service{
> use generic-service ; Name of service template
> to use
> contact_groups csoadmins
> host_name tessweb
> service_description Tessitura SeatServer
> check_command
> check_http_site2_ssl!tessweb.cso.org!true!/Tessitura.asmx/WebSeatServerListening
> }
>
>
>
>
> --__--__--
>
> Message: 6
> Subject: RE: [Nagios-users] False Negatives
> Date: Thu, 11 Aug 2005 10:52:27 -0500
> From: "Marc Powell" <marc at ena.com>
> To: <nagios-users at lists.sourceforge.net>
>
>
>
> > -----Original Message-----
> > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> > admin at lists.sourceforge.net] On Behalf Of David Johnson
> > Sent: Thursday, August 11, 2005 10:37 AM
> > To: Hosey, Chester; nagios-users at lists.sourceforge.net
> > Subject: RE: [Nagios-users] False Negatives
> >=20
> > The manual running of the plugins succeeds every time. The nagios
> running
> > fails every time. I must say that I get two different failures.
> >=20
> > 1. One falsely failing server returns: "Server answer:"
>
> Looking at the src for check_ssh (which I assume you are using), it
> makes a connection on the specified port and looks for "SSH" as the
> first 3 characters in the first line of output. If it doesn't find it
> then it prints "Server answer:" then the text that it did see. In your
> case it looks like it's either getting a blank line or nothing at all.
> I'd try that test manually using telnet several times in a row just to
> make sure that you're seeing the expected output. Perhaps also turn up
> logging for your sshd. Maybe there's some indication of the problem
> there.
>
> > 2. The other falsely failing server returns a CHECK_NRPE timeout of 10
> > seconds (which is odd because I set the timeout to 30 and did a hard
> > restart).
>
> There are two places where the timeout could be coming from. Each plugin
> may have their own timeout value (usually -t) and nagios has a master
> timeout value in nagios.cfg (service_check_timeout). The master timeout
> will always override the plugin timeout value if it's shorter (since
> nagios has no idea what the plugin timeout might be). If -t=3D45 for a
> plugin but service_check_timeout is 10 then you'll still only get 10
> seconds.
>
> --
> Marc
>
>
> --__--__--
>
> Message: 7
> Date: Thu, 11 Aug 2005 17:58:14 +0200
> From: Andreas Ericsson <ae at op5.se>
> To: Nagios Users List <nagios-users at lists.sourceforge.net>
> Subject: Re: [Nagios-users] Service Went Down, No notification sent...
>
> John McGowan wrote:
> > I'm not sure exactly where to start on this.... last night i checked on
> > my services and noticed a service that was down and had been down for
> > about 20 minutes. No notification was ever sent out... this is what I
> > saw in the log...
> >
> > [08-10-2005 22:07:36] SERVICE ALERT: tessweb;Tessitura
> > SeatServer;CRITICAL;HARD;1;CRITICAL - Socket timeout after 10 seconds
> >
> > When I ran a test this morning this is what I saw in the event log.
> >
> > The first thing that stood out when I saw it was the fact that the max
> > check attempts didn't seem to make a difference last night... the
> > service went critical hard on the first failure...
> >
>
> This is because the host was down as well. Service checks aren't
> reported if the host is down, and they're set to HARD immediately to
> save the added strain of having to re-check them frequently.
>
> > FYI: the particular thing that caused this failure last night was that
> > the VPN between here and there was down.
> >
>
> So the hostcheck couldn't possibly have succeeded then, and the host was
> most likely in UNREACHABLE state (if you monitor the VPN tunnel in both
> ends and have parents set up properly).
>
> What notifications *did* you get? An unreachable for the VPN tunnel?
>
> --
> Andreas Ericsson andreas.ericsson at op5.se
> OP5 AB www.op5.se
> Lead Developer
>
>
> --__--__--
>
> Message: 8
> Subject: Re: [Nagios-users] why can't a servicegroup have no members?
> From: Kevin Hanser <kevin at mica.net>
> To: nagios-users at lists.sourceforge.net,
> "Chester R. Hosey" <Chester.Hosey at gianteagle.com>
> Date: Thu, 11 Aug 2005 13:52:30 -0400
>
> Hmmm... went back and tried it, and sure enough it's working for me. I
> think it was complaining before because I had a servicegroup with no
> members, and also didn't have any services defined with the
> "servicegroups" directive. Looks like one or the other is required..
>
> I was confused because I created my servicegroup first (w/out any
> members) and then did a config check. At that point it complained that
> the servicegroup had no members and wouldn't go. Apparently though
> after I've added the "servicegroup" directive to some services, it's no
> longer complaining about the "members" being missing.
>
> So I guess I need one or the other, I was only getting an error because
> I had just created the servicegroup and not added anything to it yet.
>
> thx!
>
> k
>
> On Wed, 2005-08-10 at 17:28 -0400, Chester R. Hosey wrote:
> > Using 2.0b4 also:
> >
> > define service{
> > use nrpe-load-check
> > host_name chet-linux-desktop
> > servicegroups test
> > }
> >
> > define servicegroup{
> > servicegroup_name test
> > alias Sample Group
> > }
> >
> >
> > /usr/bin/nagios -v /etc/nagios/nagios.cfg liked it.
> >
> > Chet
> >
> >
> > On Wed, 2005-08-10 at 17:14 -0400, Kevin Hanser wrote:
> > > I am running Nagios 2.0b4, and I'm trying to create some servicegroups
> > > for nice display and whatnot.
> > >
> > > I can get them to work fine, as long as I specify at least one member
> in
> > > the servicegroups definition... However, I'd like to create a
> > > servicegroup for "router interfaces", and I don't really want to type
> in
> > > every description for every router interface I have on every router,
> > > when I've already put it into the service definition. I see that I can
> > > specify servicegroups in the service definition and that would be
> > > perfect for me, except for one thing: nagios won't let me create a
> > > servicegroup w/out any members.
> > >
> > > I would like to create the servicegroup w/no members in its definition,
> > > and then use the "servicegroups" directive w/in the service definition
> > > to make each interface a member of the servicegroup. So I guess my
> > > question is... Why is the "members" directive required when defining a
> > > servicegroup?
> > >
> > > Thx!
> > >
> > > k
> > >
> > >
> > > -------------------------------------------------------
> > > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> Practices
> > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing &
> QA
> > > Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> > > _______________________________________________
> > > Nagios-users mailing list
> > > Nagios-users at lists.sourceforge.net
> > > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > > ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> > > ::: Messages without supporting info will risk being sent to /dev/null
>
>
> --__--__--
>
> Message: 9
> Subject: RE: [Nagios-users] why can't a servicegroup have no members?
> Date: Thu, 11 Aug 2005 13:08:39 -0500
> From: "Marc Powell" <marc at ena.com>
> To: <nagios-users at lists.sourceforge.net>
>
>
>
> > -----Original Message-----
> > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> > admin at lists.sourceforge.net] On Behalf Of Kevin Hanser
> > Sent: Thursday, August 11, 2005 12:53 PM
> > To: nagios-users at lists.sourceforge.net; Chester R. Hosey
> > Subject: Re: [Nagios-users] why can't a servicegroup have no members?
> >=20
> > Hmmm... went back and tried it, and sure enough it's working for me.
> I
> > think it was complaining before because I had a servicegroup with no
> > members, and also didn't have any services defined with the
> > "servicegroups" directive. Looks like one or the other is required..
> >=20
>
> Which makes sense.
>
> > I was confused because I created my servicegroup first (w/out any
> > members) and then did a config check. At that point it complained
> that
> > the servicegroup had no members and wouldn't go. Apparently though
> > after I've added the "servicegroup" directive to some services, it's
> no
> > longer complaining about the "members" being missing.
>
> Because it has members now, a 'member' indicating a service that is part
> of the group, not specifically the 'members' directive for the
> definition. The terminology may be slightly confusing but the concept
> shouldn't be. A servicegroup (or any group) has to have members (e.g.
> services in this case) associated with it, either through the
> servicegroup members definition or through the service servicegroups
> definition. A group with no members is meaningless.
>
> --
> Marc=20
>
>
> --__--__--
>
> Message: 10
> Date: Thu, 11 Aug 2005 14:19:43 -0500
> From: John McGowan <mcgowan at lynch2.com>
> To: Nagios Users List <nagios-users at lists.sourceforge.net>
> Subject: Re: [Nagios-users] Service Went Down, No notification sent...
>
>
>
> Andreas Ericsson wrote:
>
> >> This is because the host was down as well. Service checks aren't
> >> reported if the host is down, and they're set to HARD immediately to
> >> save the added strain of having to re-check them frequently.
> >
> Ah that makes perfect sense... My problem is that I've been using hosts
> as more of an organizational tool, just grouping services together.
>
> > What notifications *did* you get? An unreachable for the VPN tunnel?
>
> No notification at all, because the host wasn't setup properly
>
>
> --__--__--
>
> Message: 11
> Date: Thu, 11 Aug 2005 14:15:50 -0700 (PDT)
> From: naguser at bhodisoft.com
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] missing var/objects.cache
>
> I'm setting up the web interfaces on a pair of existing distributed nodes.
> On one of them, I get this whoops message "Error: Could not read object
> configuration data!".
>
> I compared the two nodes and on the one that is giving me the error, I'm
> missing a var/objects.cache file. The nodes have the same
> object_cache_file setting in etc/nagios.cfg and grep isn't turning up any
> other related settings that I can see.
>
> Am I right in suspecting that this missing file is the most likely source
> of the problem? If so, how do I create that file? The error message says
> that I should "make sure you've compiled the main program and the CGIs to
> use the same object data storage options", but to the best of my knowledge
> there was no difference between the two installs.
>
> I've tried just 'touch'ing it and reloading nagios, but no luck with that.
> It is still empty and I'm still getting the error.
>
> Thanks,
>
> -G_E
>
>
>
> --__--__--
>
> Message: 12
> Subject: RE: [Nagios-users] missing var/objects.cache
> Date: Thu, 11 Aug 2005 16:45:30 -0500
> From: "Marc Powell" <marc at ena.com>
> To: <nagios-users at lists.sourceforge.net>
>
>
>
> > -----Original Message-----
> > From: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-
> > admin at lists.sourceforge.net] On Behalf Of naguser at bhodisoft.com
> > Sent: Thursday, August 11, 2005 4:16 PM
> > To: nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] missing var/objects.cache
> >=20
> > I'm setting up the web interfaces on a pair of existing distributed
> nodes.
> > On one of them, I get this whoops message "Error: Could not read
> object
> > configuration data!".
> >=20
> > I compared the two nodes and on the one that is giving me the error,
> I'm
> > missing a var/objects.cache file. The nodes have the same
> > object_cache_file setting in etc/nagios.cfg and grep isn't turning up
> any
> > other related settings that I can see.
> >=20
> > Am I right in suspecting that this missing file is the most likely
> source
>
> Yes.
>
> > of the problem? If so, how do I create that file? The error message
> says
>
> You can't. Nagios creates it when the daemon is started. It's an
> optimized version of all your config files. Is nagios running? Does it
> have permissions to write to that directory?
>
> --
> Marc
>
>
> --__--__--
>
> Message: 13
> Date: Thu, 11 Aug 2005 15:09:09 -0700 (PDT)
> Subject: RE: [Nagios-users] missing var/objects.cache
> From: naguser at bhodisoft.com
> To: nagios-users at lists.sourceforge.net
>
> >> of the problem? If so, how do I create that file? The error message
> > says
> >
> > You can't. Nagios creates it when the daemon is started. It's an
> > optimized version of all your config files. Is nagios running? Does it
> > have permissions to write to that directory?
>
> Thanks for the quick reply.
>
> Turns out there WAS a difference on the object_cache_file setting after
> all; I just wasn't seeing it. My bad. For some reason it was actually
> pointing to var/nagios/objects.cache and the nagios subdirectory didn't
> exist. I fixed the cfg file and its all good now.
>
> Don't mind me, its just one of those days.
>
> -G_E
>
>
>
> --__--__--
>
> Message: 14
> Date: Thu, 11 Aug 2005 16:01:24 -0700
> From: Michael J McCafferty <mike at m5computersecurity.com>
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] Disable pager alerts for Acknowledgements ?
>
> All,
>
> We have some alerts go to a pager that is in the NOC, for audible alerts
> (can't seem to get audible alerts to work with passive service checks, but
> audible alerts work great with active service checks and active host
> checks).
> My NOC people asked if I can get the alerts to only get send to the pager
> when
> something goes down, not once each for down/critical, acknowledgement and
> up/recovery.
> I notice the scripts in misccommands.cfg use the variable
> $NOTIFICATIONTYPE$
> included in the alert. I suppose I could write some kind of wrapper that
> gets
> called instead of going straight to /bin/mail, but I was hoping there was a
> better (faster, easier, more direct) way.
>
> Thoughts ?
>
> Thanks,
> Mike
>
> --
> ************************************************************
> Michael J. McCafferty
> Principal, Security Engineer
> M5 Hosting
> 858-576-7325 Voice
> http://www.m5hosting.com
> ************************************************************
>
>
>
> --__--__--
>
> Message: 15
> Date: Thu, 11 Aug 2005 16:02:02 -0700
> From: Marc Martinez <lastxit at gmail.com>
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] external commands audit trail
>
> I'm hoping to get some suggestions about how to get an audit trail for
> external command submissions from the cgi interface..
>
> all the google and list archive digging I've done has only turned up
> permissions problems and the like, so if there is an existing resource
> to address this please send me on my way..
>
> the crux of the problem though is that while cmd.cgi does verify the
> auth credentials, the username is only recorded for comment
> submissions..
>
> what's the best way to approach this?
>
> thanks,
> Marc
>
>
> --__--__--
>
> Message: 16
> Date: Fri, 12 Aug 2005 03:57:06 +0200 (MEST)
> From: "Brian Murphy" <brian.murphy at gmx.net>
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] =?ISO-8859-1?Q?Multiple_orphaned_nagios_processes?=
>
> Hi There
>
> Running nagios is a distributed setup, 2 systems carrying out active checks
> and sending results to central display node using nsca.
>
> What we are seeing is that we end up with hundreds of nagios processes on
> the central node, enough to grind it to a stop. Over 2000 checks are being
> carried out at the checking nodes.
>
> We set the service_reaper_frequency to 3 on both the central and the
> chicking systems and still have the problem.
>
> We had this problem on 2.0b3 and still have it on 2.0b4
>
> I am suspecting that the processes are spawned off to process the passive
> checks, but collide writing into the pipe back to nagios (or it is full and
> EAGAIN) and end up eventually just orphaned out there.
>
> How many checks per second should nagios be able to process? We seem to be
> writing to the logfile at 100/sec sometimes.
>
> Do I just need to slow things down somehow? bigger poll cycles with the
> checks...
>
> We are running a perfdata command and an ocsp_command on the central host,
> processes forked by these do not seem to be the problem.
>
>
> Config file bits below
>
> status_file=/usr/local/nagios/var/status.dat
> nagios_user=nagios
> nagios_group=nagios
> check_external_commands=1
> command_check_interval=-1
> command_file=/usr/local/nagios/var/rw/nagios.cmd
> comment_file=/usr/local/nagios/var/comments.dat
> downtime_file=/usr/local/nagios/var/downtime.dat
> lock_file=/usr/local/nagios/var/nagios.lock
> temp_file=/usr/local/nagios/var/nagios.tmp
> event_broker_options=-1
> log_rotation_method=d
> log_archive_path=/usr/local/nagios/var/archives
> use_syslog=0
> log_notifications=1
> log_service_retries=1
> log_host_retries=1
> log_event_handlers=1
> log_initial_states=0
> log_external_commands=1
> log_passive_checks=1
> service_inter_check_delay_method=s
> max_service_check_spread=30
> service_interleave_factor=s
> host_inter_check_delay_method=s
> max_host_check_spread=30
> max_concurrent_checks=0
> service_reaper_frequency=3
> auto_reschedule_checks=0
> auto_rescheduling_interval=30
> auto_rescheduling_window=180
> sleep_time=0.25
> service_check_timeout=60
> host_check_timeout=30
> event_handler_timeout=30
> notification_timeout=30
> ocsp_timeout=5
> perfdata_timeout=5
> retain_state_information=1
> state_retention_file=/usr/local/nagios/var/retention.dat
> retention_update_interval=60
> use_retained_program_state=1
> use_retained_scheduling_info=0
> interval_length=60
> use_aggressive_host_checking=0
> execute_service_checks=1
> accept_passive_service_checks=1
> execute_host_checks=1
> accept_passive_host_checks=1
> enable_notifications=1
> enable_event_handlers=1
> process_performance_data=1
> service_perfdata_command=process-service-perfdata
> obsess_over_services=1
> ocsp_command=nagios-data-logger
> check_for_orphaned_services=0
> check_service_freshness=1
> service_freshness_check_interval=60
> check_host_freshness=0
> host_freshness_check_interval=60
> aggregate_status_updates=1
> status_update_interval=15
> enable_flap_detection=0
> low_service_flap_threshold=5.0
> high_service_flap_threshold=20.0
> low_host_flap_threshold=5.0
> high_host_flap_threshold=20.0
> date_format=us
> p1_file=/usr/local/nagios/bin/p1.pl
> illegal_object_name_chars=`~!$%^&*|'"<>?,()=
> illegal_macro_output_chars=`~$&|'"<>
> use_regexp_matching=0
> use_true_regexp_matching=0
> admin_email=nagios
> admin_pager=pagenagios
> daemon_dumps_core=1
>
> Any suggestions appreciated
>
> Thanx
>
> Brian
>
> --
> 5 GB Mailbox, 50 FreeSMS http://www.gmx.net/de/go/promail
> +++ GMX - die erste Adresse f�r Mail, Message, More +++
>
>
>
> --__--__--
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
>
>
> End of Nagios-users Digest
>
More information about the Users
mailing list