Nagios-users Digest, Vol 15, Issue 5
Noel Dave
profoundmove at gmail.com
Wed Aug 8 06:16:50 CEST 2007
On 8/4/07, nagios-users-request at lists.sourceforge.net <
nagios-users-request at lists.sourceforge.net> wrote:
>
> Send Nagios-users mailing list submissions to
> nagios-users at lists.sourceforge.net
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> or, via email, send a message with subject or body 'help' to
> nagios-users-request at lists.sourceforge.net
>
> You can reach the person managing the list at
> nagios-users-owner at lists.sourceforge.net
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Nagios-users digest..."
>
>
> Today's Topics:
>
> 1. Re: EMC Symetrix monitoring (orzeh)
> 2. Nagios and NSClient++ memory leak issue (Travis Hansen)
> 3. check_mysql -S isn't working (alexus)
> 4. Re: check_mysql -S isn't working (Peter Hinse)
> 5. Last Check time stamp incorrect or not updating properly?
> (Brady Maxwell)
> 6. Re: check_mysql -S isn't working {Disarmed} {Fraud?} (Marc Powell)
> 7. Re: check_mysql -S isn't working {Disarmed} {Fraud?} (alexus)
> 8. Re: check_mysql -S isn't working (alexus)
> 9. Re: check_mysql -S isn't working (Marc Powell)
> 10. SQL 2005 (Matthew Joyce)
> 11. Timeout Limits (Tom Ray [Lists])
> 12. Hierarchical representation in hostgroups. (Lalita Drolia)
> 13. Request for Feedback: Nagios Plugin (Kevin Menard)
> 14. Re: Request for Feedback: Nagios Plugin (Kevin Menard)
> 15. Re: Request for Feedback: Nagios Plugin (Kevin Menard)
> 16. Re: Hierarchical representation in hostgroups. (Ton Voon)
> 17. Re: Hierarchical representation in hostgroups. (Lalita Drolia)
> 18. Re: Timeout Limits (Marc Powell)
> 19. Re: Hierarchical representation in hostgroups. (Marc Powell)
> 20. Re: cgi's not working (Cassandra Pugh)
> 21. cgi issues (Steve Gregory)
> 22. Scaling a Nagios Server (Frost, Mark {PBG})
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Thu, 02 Aug 2007 21:15:57 +0200
> From: orzeh <orz3h at tlen.pl>
> Subject: Re: [Nagios-users] EMC Symetrix monitoring
> To: David Schlecht <dgsconsulting at gmail.com>
> Cc: nagios <nagios-users at lists.sourceforge.net>
> Message-ID: <46B22D6D.4010408 at tlen.pl>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> David Schlecht wrote:
> > Thanks for your reply, orzeh.
> no problem
>
> >
> > Can you give me a little more info?
>
> sure
>
> > It looks like you're getting SNMP traps from EMC that your Navisphere
> > is processing. Is that right?
>
> no exactly, snmp traps are only configured in navisphere, i'm not
> touching sps in lower level. Traps are transmitted from navisphere.
>
> >
> > If so, how do you get EMC to use the standard SNMP ports?
>
> It can be configured in navisphere monitors templates (port, host)
>
> > Our ECC console only uses non-standard ports and requires lots of
> > handshaking. Does Navisphere have an interface library so it knows how
> > to deal with EMC?
>
> i've thought the navisphere is part of ecc, for sure it can work with
> ecc. I'm not using ecc on my san
> environment so I'm not good help.
>
> > Thanks again.
> > -David
>
> please inform me about your progress!
> orzeh
>
>
>
>
> ------------------------------
>
> Message: 2
> Date: Thu, 2 Aug 2007 15:22:25 -0400
> From: "Travis Hansen" <thansen at plurisinc.com>
> Subject: [Nagios-users] Nagios and NSClient++ memory leak issue
> To: <nagios-users at lists.sourceforge.net>
> Message-ID:
> <DD9066A44BF3734EB356B65DA6E2DC069FA4C6 at pluris-fs2.plurisinc.com>
> Content-Type: text/plain; charset="us-ascii"
>
> Greetings,
>
> We have been using Nagios for a while now and have had
> NSClient installed on several boxes with Windows 2003 SP1. One of the
> boxes had an issue with NSClient that involved a memory leak that
> consistently brought the server down after approximately 2 days. We
> believe the issue stemmed from an error message that is generated every
> ~ 70 seconds.
>
>
>
> I updated the client to NSClient++ in hopes that this would
> fix the issue, but it appears that the issue remains. The monitoring
> still works (which proves we have two way communication), but the error
> is created nonetheless. The log file reports this error approximately
> every 70 seconds:
>
>
>
> 2007-08-02 15:05:12: debug:.\PDHCollector.cpp:101: Detected language:
> English US (0x0409)
>
> 2007-08-02 15:05:12: debug:.\NSClient++.cpp:305: Loading plugin: NRPE
> server...
>
> 2007-08-02 15:05:12: error:.\PDHCollector.cpp:119: Attempting to open
> counter...
>
> 2007-08-02 15:05:12: error:.\PDHCollector.cpp:122: Counters opend...
>
> 2007-08-02 15:05:12: debug:.\NSClient++.cpp:305: Loading plugin:
> NSClient server...
>
> 2007-08-02 15:05:12: debug:c:\source\nscp\trunk\include\Socket.h:515:
> Bound to: 0.0.0.0:1248
>
> 2007-08-02 15:05:12: debug:c:\source\nscp\trunk\include\Socket.h:515:
> Bound to: 0.0.0.0:5666
>
> 2007-08-02 15:05:12: debug:c:\source\nscp\trunk\include\Socket.h:521:
> Socket ready...
>
> 2007-08-02 15:05:12: debug:c:\source\nscp\trunk\include\Socket.h:521:
> Socket ready...
>
> 2007-08-02 15:05:36: debug:.\NSClientListener.cpp:141: Data: GIOP
>
> 2007-08-02 15:05:36: error:.\NSClientListener.cpp:155: Invalid password
> (GIOP
>
> 2007-08-02 15:06:45: debug:.\NSClientListener.cpp:141: Data: GIOP
>
> 2007-08-02 15:06:45: error:.\NSClientListener.cpp:155: Invalid password
> (GIOP
>
>
>
> The NSClientListener is reporting an invalid password, but we do not
> have a password set in our cfg file and we are not passing it from the
> Nagios server. Anyone seen this issue before?
>
>
>
> Thanks
>
> Hi,
Can you please give us which version of NSClient++ you're using.?
Thanks in Advance
-------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 3
> Date: Thu, 2 Aug 2007 15:24:14 -0400
> From: alexus <alexus at gmail.com>
> Subject: [Nagios-users] check_mysql -S isn't working
> To: Nagios-users at lists.sourceforge.net
> Message-ID:
> <6ae50c2d0708021224h11ea6691lb034a6b96116b424 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> # ./check_mysql -H 10.52.208.99 -u xxxx -p xxxxxxxxx -S
> Slave IO: Yes Slave SQL: No Seconds Behind Master: (null)
> #
>
> how come I get "null" in "No Seconds Behind Master?
> anyone?
>
>
> --
> http://alexus.org/
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 4
> Date: Thu, 02 Aug 2007 21:35:44 +0200
> From: Peter Hinse <loco at d0pefish.de>
> Subject: Re: [Nagios-users] check_mysql -S isn't working
> To: alexus <alexus at gmail.com>
> Cc: Nagios-users at lists.sourceforge.net
> Message-ID: <46B23210.9040003 at d0pefish.de>
> Content-Type: text/plain; charset=ISO-8859-1
>
> alexus wrote:
> > # ./check_mysql -H 10.52.208.99 <http://10.52.208.99> -u xxxx
> > -p xxxxxxxxx -S
> > Slave IO: Yes Slave SQL: No Seconds Behind Master: (null)
> > #
> >
> > how come I get "null" in "No Seconds Behind Master?
> > anyone?
>
> Read it as
>
> Slave IO: Yes
> Slave SQL: No
> Seconds Behind Master: (null)
>
> Your replication does not seem to work. Try a
>
> slave stop ;
> slave start ;
> show slave status ;
>
> on your mysql console.
>
> Regards,
>
> Peter
>
>
>
> ------------------------------
>
> Message: 5
> Date: Thu, 2 Aug 2007 15:41:08 -0400
> From: "Brady Maxwell" <brady.maxwell at gmail.com>
> Subject: [Nagios-users] Last Check time stamp incorrect or not
> updating properly?
> To: Nagios-users at lists.sourceforge.net
> Message-ID:
> <e755ea1b0708021241tfa66ae3n632487912f2faa3f at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> So when I look at the web interface for host status detail, I can see that
> the last check column has time values that are in correct. Many of the
> services will be accurate however some services will lag behind by several
> hours or even a day. However if I click on the service that has the
> incorrect time stamp and go to the details for that service I see that the
> last check time is wrong but the last update time is correct.
> Also I submit all the results on this server to another server with nsca,
> and the time stamp on the passive or central server might be more up to
> date
> than the time stamp on the sever doing the active check. However many of
> the
> time stamps still lag behind by an hour or more on the central server,
> even
> though they are sometimes ahead of the distributed servers checks. Again
> the
> last update time on the service status page seem to be accurate to with n
> 5
> minutes. I have all checks set to 5 minute time frames so it seems to me
> that the Last Check time stamp should always be with in the last 5
> minutes.
> Watching the logs has convinced me that the checks are actually happening
> and getting sent to the central server in the 5 minute time frame.
>
>
> Has anyone experienced this before? Anyone know why this happens?
>
>
> --
> -- Brady Maxwell --
> Biter of Dogs Ears
> Systems Administrator
> Garbage Man
> Bouncer
> Paratrooper
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 6
> Date: Thu, 2 Aug 2007 14:42:31 -0500
> From: "Marc Powell" <marc at ena.com>
> Subject: Re: [Nagios-users] check_mysql -S isn't working {Disarmed}
> {Fraud?}
> To: <Nagios-users at lists.sourceforge.net>
> Message-ID: <A7B0A9F02975A74A845FE85D0B95B8FA08276B02 at misex01.ena.com>
> Content-Type: text/plain; charset="US-ASCII"
>
>
>
> > -----Original Message-----
> > From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> > bounces at lists.sourceforge.net] On Behalf Of alexus
> > Sent: Thursday, August 02, 2007 2:24 PM
> > To: Nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] check_mysql -S isn't working {Disarmed}
> {Fraud?}
> >
> > # ./check_mysql -H 10.52.208.9910. <http://10.52.208.99> -u xxxx -p
> > xxxxxxxxx -S
> > Slave IO: Yes Slave SQL: No Seconds Behind Master: (null)
> > #
> >
> > how come I get "null" in "No Seconds Behind Master?
> > anyone?
>
> Ok. I'll bite on the second time around...
>
> My GUESS- The Seconds_Behind_Master column of the of the mysql command
> 'show slave status' has the value 'null' or is empty. Connect to mysql,
> issue the 'show slave status;' command and verify if that's the case.
>
> Alternately, the version of mysql that you are using, whatever it is,
> changed that column such that the plugin can no longer parse it.
>
> Alternately, the slave is broken in some way.
>
> --
> Marc
>
>
>
> ------------------------------
>
> Message: 7
> Date: Thu, 2 Aug 2007 16:01:13 -0400
> From: alexus <alexus at gmail.com>
> Subject: Re: [Nagios-users] check_mysql -S isn't working {Disarmed}
> {Fraud?}
> To: "Marc Powell" <marc at ena.com>
> Cc: Nagios-users at lists.sourceforge.net
> Message-ID:
> <6ae50c2d0708021301t2403232cn517457ac7abbdfe1 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> ok, 'Seconds Behind Master' does have value of null, but
> where and what do i need to set, in order for it to function properly?
> i'm using mysql-5.0.45
>
>
>
> On 8/2/07, Marc Powell <marc at ena.com> wrote:
> >
> >
> >
> > > -----Original Message-----
> > > From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> > > bounces at lists.sourceforge.net] On Behalf Of alexus
> > > Sent: Thursday, August 02, 2007 2:24 PM
> > > To: Nagios-users at lists.sourceforge.net
> > > Subject: [Nagios-users] check_mysql -S isn't working {Disarmed}
> > {Fraud?}
> > >
> > > # ./check_mysql -H 10.52.208.9910. <http://10.52.208.99> -u xxxx -p
> > > xxxxxxxxx -S
> > > Slave IO: Yes Slave SQL: No Seconds Behind Master: (null)
> > > #
> > >
> > > how come I get "null" in "No Seconds Behind Master?
> > > anyone?
> >
> > Ok. I'll bite on the second time around...
> >
> > My GUESS- The Seconds_Behind_Master column of the of the mysql command
> > 'show slave status' has the value 'null' or is empty. Connect to mysql,
> > issue the 'show slave status;' command and verify if that's the case.
> >
> > Alternately, the version of mysql that you are using, whatever it is,
> > changed that column such that the plugin can no longer parse it.
> >
> > Alternately, the slave is broken in some way.
> >
> > --
> > Marc
> >
> >
> -------------------------------------------------------------------------
> > This SF.net email is sponsored by: Splunk Inc.
> > Still grepping through log files to find problems? Stop.
> > Now Search log events and configuration files using AJAX and a browser.
> > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when
> > reporting any issue.
> > ::: Messages without supporting info will risk being sent to /dev/null
> >
>
>
>
> --
> http://alexus.org/
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 8
> Date: Thu, 2 Aug 2007 16:01:56 -0400
> From: alexus <alexus at gmail.com>
> Subject: Re: [Nagios-users] check_mysql -S isn't working
> To: "Peter Hinse" <loco at d0pefish.de>
> Cc: Nagios-users at lists.sourceforge.net
> Message-ID:
> <6ae50c2d0708021301g2f36594ewcac65ba7603602a5 at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> my replication does work...
>
> On 8/2/07, Peter Hinse <loco at d0pefish.de> wrote:
> >
> > alexus wrote:
> > > # ./check_mysql -H 10.52.208.99 <http://10.52.208.99> -u xxxx
> > > -p xxxxxxxxx -S
> > > Slave IO: Yes Slave SQL: No Seconds Behind Master: (null)
> > > #
> > >
> > > how come I get "null" in "No Seconds Behind Master?
> > > anyone?
> >
> > Read it as
> >
> > Slave IO: Yes
> > Slave SQL: No
> > Seconds Behind Master: (null)
> >
> > Your replication does not seem to work. Try a
> >
> > slave stop ;
> > slave start ;
> > show slave status ;
> >
> > on your mysql console.
> >
> > Regards,
> >
> > Peter
> >
>
>
>
> --
> http://alexus.org/
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 9
> Date: Thu, 2 Aug 2007 15:08:49 -0500
> From: "Marc Powell" <marc at ena.com>
> Subject: Re: [Nagios-users] check_mysql -S isn't working
> To: <Nagios-users at lists.sourceforge.net>
> Message-ID: <A7B0A9F02975A74A845FE85D0B95B8FA08276B03 at misex01.ena.com>
> Content-Type: text/plain; charset="US-ASCII"
>
>
>
> > -----Original Message-----
> > From: alexus [mailto:alexus at gmail.com]
> > Sent: Thursday, August 02, 2007 3:01 PM
> > To: Marc Powell
> > Cc: Nagios-users at lists.sourceforge.net
> > Subject: Re: [Nagios-users] check_mysql -S isn't working {Disarmed}
> > {Fraud?} {Disarmed} {Fraud?}
> >
> > ok, 'Seconds Behind Master' does have value of null, but
> > where and what do i need to set, in order for it to function properly?
> > i'm using mysql-5.0.45
>
> No clue here. The mysql-users support group will probably get you a
> faster, more accurate answer since it's outside the realm of nagios.
>
> --
> Marc
>
>
>
> ------------------------------
>
> Message: 10
> Date: Fri, 3 Aug 2007 09:41:04 +1000
> From: "Matthew Joyce" <MJoyce at ccia.unsw.edu.au>
> Subject: [Nagios-users] SQL 2005
> To: <nagios-users at lists.sourceforge.net>
> Message-ID:
> <2A67EA781EC7F949A2AB0A0D07A86C6A0286B93F at mail01.ccia.local>
> Content-Type: text/plain; charset="us-ascii"
>
>
> Hi all,
>
> Does anyone have method of determining is a database is up and
> accessible ?
>
> At the moment we check the server is up and services are running, but my
> availability reports don't show if staff can't login to a db.
> I'm thinking nrpe+wmi might be useful, or old fashioned vbs.
>
> Or perhaps SQL2005 have some snmp OIDs I can query ?
>
> Any ideas ?
>
> Matthew Joyce
> 02 9382 0051 | IT Manager | Children's Cancer Institute Australia for
> Medical Research
>
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 11
> Date: Fri, 03 Aug 2007 01:53:18 -0400
> From: "Tom Ray [Lists]" <lists at blazestudios.com>
> Subject: [Nagios-users] Timeout Limits
> To: nagios-users at lists.sourceforge.net
> Message-ID: <46B2C2CE.8030600 at blazestudios.com>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> How do I raise the timeout limits from 10 seconds on things like the
> smtp_check, httpd_check, etc?
>
> Thanks!
>
>
>
> ------------------------------
>
> Message: 12
> Date: Fri, 3 Aug 2007 00:08:25 -0700
> From: "Lalita Drolia" <ldrolia at bea.com>
> Subject: [Nagios-users] Hierarchical representation in hostgroups.
> To: <nagios-users at lists.sourceforge.net>
> Message-ID:
> <2402BA0C1F52594E8E271233F62EB6A903FC7530 at repbex02.amer.bea.com>
> Content-Type: text/plain; charset="us-ascii"
>
> Hi,
>
> I have configured nagios to monitor about 800 servers. I have made
> various hostgroups on the basis of operating systems, databases
> installed, teams using the machines etc.
>
> Now I want to view them in hostgroups on web interface in the form of a
> tree. For example, I want one broad category of operating systems, under
> that windows, linux solaris etc and again under windows, 2000 and 2003.
>
>
>
> Is it possible to have any such kind of view for hostgroups?
>
> Because right now it just shows a number of hostgroups in a table which
> is not very useful to me.
>
>
>
> Kindly help.
>
> Lalita
>
>
>
>
>
>
> Notice: This email message, together with any attachments, may contain
> information of BEA Systems, Inc., its subsidiaries and affiliated
> entities, that may be confidential, proprietary, copyrighted and/or
> legally privileged, and is intended solely for the use of the individual or
> entity named in this message. If you are not the intended recipient, and
> have received this message in error, please immediately return this by email
> and then delete it.
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 13
> Date: Fri, 3 Aug 2007 07:45:27 -0400
> From: "Kevin Menard" <kmenard at servprise.com>
> Subject: [Nagios-users] Request for Feedback: Nagios Plugin
> To: <nagios-users at lists.sourceforge.net>
> Message-ID:
> <384329B8D7108B45A3064FB38FCF267B0E1B10 at aristotle.servprise.office>
> Content-Type: text/plain; charset="US-ASCII"
>
> Hi all,
>
> I'm looking for some feedback on a plugin developed for our WebReboot
> Enterprise products. It won't require you to download anything, to
> install anything, or to purchase anything. The details on the plugin
> thus far can be found at: http://plato/nagios/index.html
>
> Really what I'm looking for is feedback on the overall design. Certain
> decisions had to be made with what constitutes a host check versus a
> service check or a host event handler versus a service one. Likewise,
> when a corrective action is taken is fairly subjective too. E.g., HARD
> only? Soft but after 3 attempts?
>
> I'm looking for help with a few of the following:
>
> 1) The password to the WebReboot Enterprise has to be stored somewhere.
> How would you like to see this done? For simplicity, it's specified as
> a command argument right now. Obviously, anyone with permissions to
> open your host and service configs can see it. One suggestion had been
> to push this off into a plugin configuration file and restrict access to
> that.
>
> 2) How would you like to see when plugins perform actions? Should it be
> configurable, yielding flexibility at the expense of usability? Should
> it just be hard-coded with notes on how to change it (it's ASL-licensed
> Python code, so not too hard)?
>
> 3) Event handlers are currently non-blocking. That is, if you choose to
> power on a host, the event handler issues the command to the WebReboot
> Enterprise and then returns control to Nagios. The WebReboot Enterprise
> then takes care of powering on the host. This means the script may
> return in seconds while it may take orders of magnitude longer for the
> host to power up. The consequence of this is that if your max attempts
> is too low or timing is too tight, host checks may continue to execute
> while the host is powering up and could put it into an erroneous HARD
> DOWN state.
>
> How should this be handled? Documentation for the user on changing
> timing & attempt values? Or should the event handler insert a
> configurable artificial delay? The downside of the latter being that it
> requires fine tuning for each host, as some boot to the OS quickly while
> others do not.
>
> Anyway, any help this list could provide would be great. Ultimately,
> the plugin is for Nagios users and we want to make sure we're building
> something that meets your expectations.
>
> --
> Kevin Menard
> Servprise International, Inc.
> 800.832.3823 x308
>
>
>
> ------------------------------
>
> Message: 14
> Date: Fri, 3 Aug 2007 08:03:58 -0400
> From: "Kevin Menard" <kmenard at servprise.com>
> Subject: Re: [Nagios-users] Request for Feedback: Nagios Plugin
> To: <sander at pictura-dp.nl>
> Cc: nagios-users at lists.sourceforge.net
> Message-ID:
> <384329B8D7108B45A3064FB38FCF267B0E1B11 at aristotle.servprise.office>
> Content-Type: text/plain; charset="US-ASCII"
>
> Argh. Guess I need to double-check my URLs.
>
> The correct one is: http://dev.servprise.com/nagios/
>
> Thanks for the correction.
>
> --
> Kevin
>
> > -----Original Message-----
> > From: Sander Klein [mailto:sander at pictura-dp.nl]
> > Sent: Friday, August 03, 2007 7:51 AM
> > To: Kevin Menard
> > Cc: nagios-users at lists.sourceforge.net
> > Subject: Re: [Nagios-users] Request for Feedback: Nagios Plugin
> >
> > Hi,
> >
> > looking at http://plato/nagios/index.html is kind of hard.
> >
> > Greets,
> >
> > Sander
> >
> >
> >
> > Kevin Menard wrote:
> > > Hi all,
> > >
> > > I'm looking for some feedback on a plugin developed for our
> WebReboot
> > > Enterprise products. It won't require you to download anything, to
> > > install anything, or to purchase anything. The details on the
> plugin
> > > thus far can be found at: http://plato/nagios/index.html
> > >
> > > Really what I'm looking for is feedback on the overall design.
> > Certain
> > > decisions had to be made with what constitutes a host check versus a
> > > service check or a host event handler versus a service one.
> > Likewise,
> > > when a corrective action is taken is fairly subjective too. E.g.,
> > HARD
> > > only? Soft but after 3 attempts?
> > >
> > > I'm looking for help with a few of the following:
> > >
> > > 1) The password to the WebReboot Enterprise has to be stored
> > somewhere.
> > > How would you like to see this done? For simplicity, it's specified
> > as
> > > a command argument right now. Obviously, anyone with permissions to
> > > open your host and service configs can see it. One suggestion had
> > been
> > > to push this off into a plugin configuration file and restrict
> access
> > to
> > > that.
> > >
> > > 2) How would you like to see when plugins perform actions? Should
> it
> > be
> > > configurable, yielding flexibility at the expense of usability?
> > Should
> > > it just be hard-coded with notes on how to change it (it's ASL-
> > licensed
> > > Python code, so not too hard)?
> > >
> > > 3) Event handlers are currently non-blocking. That is, if you
> choose
> > to
> > > power on a host, the event handler issues the command to the
> > WebReboot
> > > Enterprise and then returns control to Nagios. The WebReboot
> > Enterprise
> > > then takes care of powering on the host. This means the script may
> > > return in seconds while it may take orders of magnitude longer for
> > the
> > > host to power up. The consequence of this is that if your max
> > attempts
> > > is too low or timing is too tight, host checks may continue to
> > execute
> > > while the host is powering up and could put it into an erroneous
> HARD
> > > DOWN state.
> > >
> > > How should this be handled? Documentation for the user on changing
> > > timing & attempt values? Or should the event handler insert a
> > > configurable artificial delay? The downside of the latter being
> that
> > it
> > > requires fine tuning for each host, as some boot to the OS quickly
> > while
> > > others do not.
> > >
> > > Anyway, any help this list could provide would be great.
> Ultimately,
> > > the plugin is for Nagios users and we want to make sure we're
> > building
> > > something that meets your expectations.
> > >
> > >
>
>
>
>
> ------------------------------
>
> Message: 15
> Date: Fri, 3 Aug 2007 08:05:28 -0400
> From: "Kevin Menard" <kmenard at servprise.com>
> Subject: Re: [Nagios-users] Request for Feedback: Nagios Plugin
> To: "Kevin Menard" <kmenard at servprise.com>,
> <nagios-users at lists.sourceforge.net>
> Message-ID:
> <384329B8D7108B45A3064FB38FCF267B0E1B12 at aristotle.servprise.office>
> Content-Type: text/plain; charset="US-ASCII"
>
> Sorry for the self-reply, but the URL posted is incorrect (thanks Sander
> for pointing this out).
>
> It should be:
>
> http://dev.servprise.com/nagios/
>
> Thanks,
> Kevin
>
> > -----Original Message-----
> > From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> > bounces at lists.sourceforge.net] On Behalf Of Kevin Menard
> > Sent: Friday, August 03, 2007 7:45 AM
> > To: nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] Request for Feedback: Nagios Plugin
> >
> > Hi all,
> >
> > I'm looking for some feedback on a plugin developed for our WebReboot
> > Enterprise products. It won't require you to download anything, to
> > install anything, or to purchase anything. The details on the plugin
> > thus far can be found at: http://plato/nagios/index.html
> >
> > Really what I'm looking for is feedback on the overall design.
> Certain
> > decisions had to be made with what constitutes a host check versus a
> > service check or a host event handler versus a service one. Likewise,
> > when a corrective action is taken is fairly subjective too. E.g.,
> HARD
> > only? Soft but after 3 attempts?
> >
> > I'm looking for help with a few of the following:
> >
> > 1) The password to the WebReboot Enterprise has to be stored
> somewhere.
> > How would you like to see this done? For simplicity, it's specified
> as
> > a command argument right now. Obviously, anyone with permissions to
> > open your host and service configs can see it. One suggestion had
> been
> > to push this off into a plugin configuration file and restrict access
> > to
> > that.
> >
> > 2) How would you like to see when plugins perform actions? Should it
> > be
> > configurable, yielding flexibility at the expense of usability?
> Should
> > it just be hard-coded with notes on how to change it (it's
> ASL-licensed
> > Python code, so not too hard)?
> >
> > 3) Event handlers are currently non-blocking. That is, if you choose
> > to
> > power on a host, the event handler issues the command to the WebReboot
> > Enterprise and then returns control to Nagios. The WebReboot
> > Enterprise
> > then takes care of powering on the host. This means the script may
> > return in seconds while it may take orders of magnitude longer for the
> > host to power up. The consequence of this is that if your max
> attempts
> > is too low or timing is too tight, host checks may continue to execute
> > while the host is powering up and could put it into an erroneous HARD
> > DOWN state.
> >
> > How should this be handled? Documentation for the user on changing
> > timing & attempt values? Or should the event handler insert a
> > configurable artificial delay? The downside of the latter being that
> > it
> > requires fine tuning for each host, as some boot to the OS quickly
> > while
> > others do not.
> >
> > Anyway, any help this list could provide would be great. Ultimately,
> > the plugin is for Nagios users and we want to make sure we're building
> > something that meets your expectations.
> >
> > --
> > Kevin Menard
> > Servprise International, Inc.
> > 800.832.3823 x308
> >
> >
> -----------------------------------------------------------------------
> > --
> > This SF.net email is sponsored by: Splunk Inc.
> > Still grepping through log files to find problems? Stop.
> > Now Search log events and configuration files using AJAX and a
> browser.
> > Download your FREE copy of Splunk now >> http://get.splunk.com/
> > _______________________________________________
> > Nagios-users mailing list
> > Nagios-users at lists.sourceforge.net
> > https://lists.sourceforge.net/lists/listinfo/nagios-users
> > ::: Please include Nagios version, plugin version (-v) and OS when
> > reporting any issue.
> > ::: Messages without supporting info will risk being sent to /dev/null
>
>
>
> ------------------------------
>
> Message: 16
> Date: Fri, 3 Aug 2007 13:09:59 +0100
> From: Ton Voon <ton.voon at altinity.com>
> Subject: Re: [Nagios-users] Hierarchical representation in hostgroups.
> To: Lalita Drolia <ldrolia at bea.com>
> Cc: nagios-users at lists.sourceforge.net
> Message-ID: <647680E6-38DE-4BA0-9E9F-AA436BC0410A at altinity.com>
> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed
>
>
> On 3 Aug 2007, at 08:08, Lalita Drolia wrote:
>
> > I have configured nagios to monitor about 800 servers. I have made
> > various hostgroups on the basis of operating systems, databases
> > installed, teams using the machines etc.
> >
> > Now I want to view them in hostgroups on web interface in the form
> > of a tree. For example, I want one broad category of operating
> > systems, under that windows, linux solaris etc and again under
> > windows, 2000 and 2003.
> >
> >
> > Is it possible to have any such kind of view for hostgroups?
> >
> > Because right now it just shows a number of hostgroups in a table
> > which is not very useful to me.
> This sounds like Hostgroup Hierarchy.
>
> Opsview has a feature where we group hostgroups in a hierarchical
> fashion and then the /status/hostgroup link shows you the complete
> summary of all those hostgroups and all the host/service states
> underneath.
>
> We do this by utilising the NDO information and running some complex
> SQL.
>
> More information here: http://opsview.org/hostgrouphierarchy
>
> Ton
>
> http://www.altinity.com
> T: +44 (0)870 787 9243
> F: +44 (0)845 280 1725
> Skype: tonvoon
>
>
>
>
>
> ------------------------------
>
> Message: 17
> Date: Fri, 3 Aug 2007 05:26:09 -0700
> From: "Lalita Drolia" <ldrolia at bea.com>
> Subject: Re: [Nagios-users] Hierarchical representation in hostgroups.
> To: "Ton Voon" <ton.voon at altinity.com>
> Cc: nagios-users at lists.sourceforge.net
> Message-ID:
> <2402BA0C1F52594E8E271233F62EB6A903FC779E at repbex02.amer.bea.com>
> Content-Type: text/plain; charset=us-ascii
>
>
>
> But Opsview is a separate software in itself. Is there any way to aceive
> this in Nagios? Basically a tree like structure to see all the hosts in?
>
>
> -----Original Message-----
> From: Ton Voon [mailto:ton.voon at altinity.com]
> Sent: Friday, August 03, 2007 5:40 PM
> To: Lalita Drolia
> Cc: nagios-users at lists.sourceforge.net
> Subject: Re: [Nagios-users] Hierarchical representation in hostgroups.
>
>
> On 3 Aug 2007, at 08:08, Lalita Drolia wrote:
>
> > I have configured nagios to monitor about 800 servers. I have made
> > various hostgroups on the basis of operating systems, databases
> > installed, teams using the machines etc.
> >
> > Now I want to view them in hostgroups on web interface in the form
> > of a tree. For example, I want one broad category of operating
> > systems, under that windows, linux solaris etc and again under
> > windows, 2000 and 2003.
> >
> >
> > Is it possible to have any such kind of view for hostgroups?
> >
> > Because right now it just shows a number of hostgroups in a table
> > which is not very useful to me.
> This sounds like Hostgroup Hierarchy.
>
> Opsview has a feature where we group hostgroups in a hierarchical
> fashion and then the /status/hostgroup link shows you the complete
> summary of all those hostgroups and all the host/service states
> underneath.
>
> We do this by utilising the NDO information and running some complex
> SQL.
>
> More information here: http://opsview.org/hostgrouphierarchy
>
> Ton
>
> http://www.altinity.com
> T: +44 (0)870 787 9243
> F: +44 (0)845 280 1725
> Skype: tonvoon
>
>
>
> Notice: This email message, together with any attachments, may contain
> information of BEA Systems, Inc., its subsidiaries and affiliated
> entities, that may be confidential, proprietary, copyrighted and/or
> legally privileged, and is intended solely for the use of the individual or
> entity named in this message. If you are not the intended recipient, and
> have received this message in error, please immediately return this by email
> and then delete it.
>
>
>
> ------------------------------
>
> Message: 18
> Date: Fri, 3 Aug 2007 07:36:32 -0500
> From: "Marc Powell" <marc at ena.com>
> Subject: Re: [Nagios-users] Timeout Limits
> To: <nagios-users at lists.sourceforge.net>
> Message-ID: <A7B0A9F02975A74A845FE85D0B95B8FA039296B5 at misex01.ena.com>
> Content-Type: text/plain; charset="US-ASCII"
>
>
>
> > -----Original Message-----
> > From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> > bounces at lists.sourceforge.net] On Behalf Of Tom Ray [Lists]
> > Sent: Friday, August 03, 2007 12:53 AM
> > To: nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] Timeout Limits
> >
> > How do I raise the timeout limits from 10 seconds on things like the
> > smtp_check, httpd_check, etc?
>
> 1) if the plugin has a timeout option, typically -t but you can use
> --help to verify, modify your command{} definitions to pass it.
> 2) modify the host and service check timeout options in nagios.cfg to be
> greater than your highest plugin specific timeout.
> 3) reload nagios.
>
> --
> Marc
>
>
>
> ------------------------------
>
> Message: 19
> Date: Fri, 3 Aug 2007 07:42:00 -0500
> From: "Marc Powell" <marc at ena.com>
> Subject: Re: [Nagios-users] Hierarchical representation in hostgroups.
> To: <nagios-users at lists.sourceforge.net>
> Message-ID: <A7B0A9F02975A74A845FE85D0B95B8FA039296B6 at misex01.ena.com>
> Content-Type: text/plain; charset="US-ASCII"
>
>
>
> > -----Original Message-----
> > From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> > bounces at lists.sourceforge.net] On Behalf Of Lalita Drolia
> > Sent: Friday, August 03, 2007 7:26 AM
> > To: Ton Voon
> > Cc: nagios-users at lists.sourceforge.net
> > Subject: Re: [Nagios-users] Hierarchical representation in hostgroups.
> >
> >
> >
> > But Opsview is a separate software in itself. Is there any way to
> aceive
> > this in Nagios? Basically a tree like structure to see all the hosts
> in?
>
> No. Nagios has no notion of nested hostgroups outside of creative use of
> template inheritance.
>
> --
> Marc
>
>
>
> ------------------------------
>
> Message: 20
> Date: Fri, 3 Aug 2007 10:47:28 -0400
> From: "Cassandra Pugh" <cpugh at pppl.gov>
> Subject: Re: [Nagios-users] cgi's not working
> To: <nagios-users at lists.sourceforge.net>
> Message-ID:
> <4E20E7F01BC85746A7A29FDFBFEC988D05885C94 at MAIL-STORE-1.pppl.gov>
> Content-Type: text/plain; charset="us-ascii"
>
> Ok, so I disabled SELinux, and all of a sudden, everything works fine.
>
> SELinux = the bane of my existence. So thanks Marc for mentioning
> SELinux.
>
> --
> Cassandra
> (609) 243-2413
>
>
> "From a little spark may burst a mighty flame."
>
>
> Message: 7
> Date: Thu, 2 Aug 2007 14:42:10 -0400
> From: "Cassandra Pugh" <cpugh at pppl.gov>
> Subject: [Nagios-users] cgi's not working
> To: <nagios-users at lists.sourceforge.net>
> Message-ID:
> <4E20E7F01BC85746A7A29FDFBFEC988D05885C2B at MAIL-STORE-1.pppl.gov>
> Content-Type: text/plain; charset="us-ascii"
>
> I just installed Nagios 2.9 and when I get up to the web portion, I get my
> page up, the documentation shows up, but when I click on various cgis on
> the
> sidebar, I get the whoops message :
>
> now,
> 1 - I have verified the config options by using the command line
> procedure,
> and it comes back 0 warnings, 0 errors.
>
> 2- Nagios log shows nagios starting just fine. I have verified it is
> running,
> I see logs that tell me things are being monitored, I get emails if stuff
> goes down, etc. I just cannot see the web interface.
>
> #3 I am not so sure about? I just ran the make files as the documentation
> states, so I don't see why this would not be correct?
>
>
> " Whoops!
>
> Error: Could not read object configuration data!
>
> Here are some things you should check in order to resolve this error:
>
> 1. Verify configuration options using the -v command-line option to check
> for
> errors.
> 2. Check the Nagios log file for messages relating to startup or status
> data
> errors.
> 3. Make sure you've compiled the main program and the CGIs to use the same
> object data storage options (i.e. default text file or template-based
> file).
> :"
>
> --
> Cassandra
> (609) 243-2413
>
>
> "From a little spark may burst a mighty flame."
>
>
>
>
>
> ------------------------------
>
> Message: 8
> Date: Thu, 2 Aug 2007 13:47:33 -0500
> From: "Marc Powell" <marc at ena.com>
> Subject: Re: [Nagios-users] cgi's not working
> To: <nagios-users at lists.sourceforge.net>
> Message-ID: <A7B0A9F02975A74A845FE85D0B95B8FA08276AF1 at misex01.ena.com>
> Content-Type: text/plain; charset="US-ASCII"
>
>
>
> > -----Original Message-----
> > From: nagios-users-bounces at lists.sourceforge.net [mailto:nagios-users-
> > bounces at lists.sourceforge.net] On Behalf Of Cassandra Pugh
> > Sent: Thursday, August 02, 2007 1:42 PM
> > To: nagios-users at lists.sourceforge.net
> > Subject: [Nagios-users] cgi's not working
> >
> > I just installed Nagios 2.9 and when I get up to the web portion, I
> get my
> > page up, the documentation shows up, but when I click on various cgis
> on
> > the
> > sidebar, I get the whoops message :
> >
> > now,
> > 1 - I have verified the config options by using the command line
> > procedure,
> > and it comes back 0 warnings, 0 errors.
> >
> > 2- Nagios log shows nagios starting just fine. I have verified it is
> > running,
> > I see logs that tell me things are being monitored, I get emails if
> stuff
> > goes down, etc. I just cannot see the web interface.
> >
> > #3 I am not so sure about? I just ran the make files as the
> documentation
> > states, so I don't see why this would not be correct?
>
> Presuming you followed the manual on configuring the web interface and
> that all is ok there. Are there any errors in your web server logs? Does
> your web server have permission to read /path/to/nagios/var and
> /path/to/nagios/etc?
>
> Do you have SELinux enabled and if so, verified that there are no errors
> there?
>
> --
> Marc
>
>
>
> ------------------------------
>
> Message: 9
> Date: Thu, 2 Aug 2007 11:49:20 -0700
> From: Patrick Morris <patrick.morris at hp.com>
> Subject: Re: [Nagios-users] cgi's not working
> To: Cassandra Pugh <cpugh at pppl.gov>
> Cc: nagios-users at lists.sourceforge.net
> Message-ID: <20070802184920.GK9182 at pmorris.usa.hp.com>
> Content-Type: text/plain; charset=us-ascii
>
> On Thu, 02 Aug 2007, Cassandra Pugh wrote:
>
> > I just installed Nagios 2.9 and when I get up to the web portion, I get
> my
> > page up, the documentation shows up, but when I click on various cgis on
> the
> > sidebar, I get the whoops message :
> >
> > now,
> > 1 - I have verified the config options by using the command line
> procedure,
> > and it comes back 0 warnings, 0 errors.
> >
> > 2- Nagios log shows nagios starting just fine. I have verified it is
> running,
> > I see logs that tell me things are being monitored, I get emails if
> stuff
> > goes down, etc. I just cannot see the web interface.
> >
> > #3 I am not so sure about? I just ran the make files as the
> documentation
> > states, so I don't see why this would not be correct?
>
> 1. That won't check for errors in your cgi.cfg, which sounds like it's
> not configured correctly.
>
> 2. Yep, that matches up with a borked cgi.cfg, too.
>
> 3. Check your cgi.cfg file. :) Also, make sure that those config files
> have proper permissions for your webserver user to read them.
>
>
>
> ------------------------------
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems? Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
>
> ------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
>
>
> End of Nagios-users Digest, Vol 15, Issue 4
> *******************************************
>
>
>
> ------------------------------
>
> Message: 21
> Date: Fri, 03 Aug 2007 10:53:55 -0400
> From: Steve Gregory <steveg at qis.net>
> Subject: [Nagios-users] cgi issues
> To: nagios-users at lists.sourceforge.net
> Message-ID: <46B34183.7090604 at qis.net>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> I have upgraded nagios from version 1.4 to 2.9 using the Debian
> packages.. I've been trying to configure the new installation and keep
> getting "Error: Could not read object configuration data!" when I try to
> run any of the cgi's. I have double checked all my config files and made
> sure the paths were correct. Has anyone else come across this problem?
>
> Steve
>
>
>
>
> ------------------------------
>
> Message: 22
> Date: Fri, 3 Aug 2007 14:25:18 -0400
> From: "Frost, Mark {PBG}" <mark.frost1 at pepsi.com>
> Subject: [Nagios-users] Scaling a Nagios Server
> To: "Nagios Users mailinglist" <nagios-users at lists.sourceforge.net>
> Message-ID:
> <7F477BD26F545A4C8E4779754A38EFB30219DF58 at PEPWMV00043.corp.pep.pvt>
> Content-Type: text/plain; charset="US-ASCII"
>
>
> Hello. We've got a single Nagios server with the following stats
>
> 2 x Intel Xeon 2.4Ghz CPUs (hyperthreaded)
> 3.6Gb of memory
> Red Hat AS 3 Linux
> Nagios 2.9
> Host Checks: 451
> Service Checks: 2358
> Average OS load level: 3
>
> Service Check Execution Time (min/max/avg): 0.03 sec 30.04
> sec 0.500 sec
> Service Check Latency (min/max/avg): 0.01 sec 33.67 sec
> 5.516 sec
>
> Host Check Execution Time: 0.00 sec 26.78 sec 0.722
> sec
> Host Check Latency: 0.00 sec 3.68 sec 0.008 sec
>
> This box is almost exclusively doing active service checks. It is
> almost dedicated to Nagios (it does a couple of other things, all very
> low-volume).
>
> This server has grown considerably over time to encompass quite a lot
> more Nagios activity than it had when I took over Nagios admin. Now it
> seems active enough to start planning on scaling it somehow. While
> there will definitely be growth, I'm not really sure how I should break
> this architecture out. Our Nagios growth is a bit unpredictable as
> people ask us to monitor their stuff, but I might guess it at 20% per
> year.
>
> I assume I'd want to start looking at the distributed models for Nagios.
> My problem is that I feel like any architecture I try to plan out is
> really just a guess for this load (and predicted future load).
>
> Can anyone suggest where I should go from here in terms of planning out
> a Nagios-for-the-future architecture? How many hosts, for example? We
> looked at possibly going VM with Nagios, but it appears that at least
> with this configuration, Nagios is taking up too many resources. We
> could probably scale out with VM's, but I imagine we'd need more VM's
> (hosts) than we'd need with physical boxes.
>
> Any help is greatly appreciated.
>
> Thanks
>
> Mark
>
>
>
> ------------------------------
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by: Splunk Inc.
> Still grepping through log files to find problems? Stop.
> Now Search log events and configuration files using AJAX and a browser.
> Download your FREE copy of Splunk now >> http://get.splunk.com/
>
> ------------------------------
>
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
>
>
> End of Nagios-users Digest, Vol 15, Issue 5
> *******************************************
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20070808/a3842d53/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list