From jonathan.wheeler at stfc.ac.uk Mon Feb 1 10:31:15 2010 From: jonathan.wheeler at stfc.ac.uk (jonathan.wheeler at stfc.ac.uk) Date: Mon, 1 Feb 2010 09:31:15 -0000 Subject: FW: Nagios 3.0.5 problem Message-ID: From: Rick Mangus [mailto:rick.mangus+nagios at gmail.com] Sent: 29 January 2010 17:02 > Hello, all. > > Forgive me, I am new to the list, and have only begun working with nagios recently.? I have > searched this list and googled furiously with little result, so must cease my lurking and > present my problem to you. > > I will begin with the problem: Sometime after midnight every night, my nagios server starts > to have trouble processing service checks.? I don't know the cause, and cannot find a > solution.? I can describe the symptoms in detail and hope we can diagnose it. > > The web interface shows the last service check came in at 02:28:34 (EST).? I know that > around 4:15 every morning, xinetd starts refusing connections to nsca due to high load > (max_load is 18), and that eventually I will have 32000+ nsca connections using up all > available PIDs leading to an inability to fork new processes, effectively killing the > machine.? While all this happens, the nagios.log appears to periodically stall, making no > new entries for 15 minutes at a time, and then flush 15000 in the space of a single > second.? Also, it seems the checkresults directory is empty most of the time, but sometimes > pops up to 2045 files (it's on a ramdisk with 2048 inodes) and not a single one gets > deleted in a time period I have been patient enough to observe. > > The periods in which the nagios log is going nowhere are accompanied by nagios taking 100% > of 2 CPUs.? One thread appears to poll() approximately every 25 usecs, and another is > inscrutable, with mprotect() the only strace-visible syscall.? All the nsca processes have > a blocking write() they are waiting on.? When the log is showing new entries, there are > still no updates made to the services, and it seems that that is what is filling up > checkresults.? I admit I have not checked to find the order of the log and checkresults > processes, though I assumed they would operate in the opposite order of what this appears > to show. > > I know this behavior has been ongoing for at least 1 month.? I have disabled all cron jobs > that I feared might be interfering.? I will answer any and all questions to the best of my > ability, and hope someone here can shed some light on the situation. 1. Do you run ndoutils (to write results to a MySQL database) ? If so, which version ? I ask because I used to have a similar problem which I eventually tracked down to an interfering backup on the MySQL server that hosted the database. 2. Do you run other services on the Nagios server which might interfere with Nagios (e.g backups which start sometime after midnight) ? 3. Have you thought of upgrading to nagios 3.2.0 which is the latest stable version ? Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory -- Scanned by iCritical. -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: ATT32268.txt URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: ATT32269.txt URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jonathan.wheeler at stfc.ac.uk Mon Feb 1 15:21:05 2010 From: jonathan.wheeler at stfc.ac.uk (jonathan.wheeler at stfc.ac.uk) Date: Mon, 1 Feb 2010 14:21:05 -0000 Subject: FW: Nagios 3.0.5 problem In-Reply-To: <77c454bb1002010529i55140760k2fd1ca69781f3664@mail.gmail.com> References: <77c454bb1002010529i55140760k2fd1ca69781f3664@mail.gmail.com> Message-ID: From: rickmangus at gmail.com On Behalf Of Rick Mangus Sent: 01 February 2010 13:30 > Thank you for the response.? In quick succession: I am forwarding my replies to the Nagios list as well. > 1. I do use perfparse, and one of our suspicions involved a mysql delete to prune old data > that took multiple hours every night.? I removed all jobs that I could find that could > possibly interfere. You can determine that you are running ndoutils by issuing a command like "ps -fu naguos" (to list all processes owned by username nagios; if you are running ndoutils, there will be a process named ndo2db > 2. No, or if we do, it's well-hidden.? ;) In our case, the process causing the main problem was running on another server, but was holding up nagios because it blocked the nagios process in part of the code that was single-threaded. > 3. I am walking into a pre-existing install, and trying to slowly take over the management > duties.? To hasten the learning process, the only person in the office who knows anything > about it went to Hawaii once I'd been working here a few weeks.? I don't know that I should > attempt any major changes without his blessing, and he will not return until > March.? Though, if we determine that is the only/best fix, I'll do it. Can he be contacted for his advice ? If not, what has been changed since you took over (probably asking the obvious questions !) > One additional data point:? I found on Saturday night, as I logged in to restart Nagios and > prevent the machine dying, that the one file in /ramdisk_nagios/checkresults/ was over > 1MB.? Every other time I have checked, the files in there are sub-4kB.? If that tells > anyone here anything, please share with the peanut gallery (me!).? :) 1MB sounds very large; if you see it again, try finding out which check generated this file (core or debug dump from check code ?). Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory -- Scanned by iCritical. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick.mangus+nagios at gmail.com Mon Feb 1 16:00:13 2010 From: rick.mangus+nagios at gmail.com (Rick Mangus) Date: Mon, 1 Feb 2010 09:00:13 -0600 Subject: FW: Nagios 3.0.5 problem In-Reply-To: References: <77c454bb1002010529i55140760k2fd1ca69781f3664@mail.gmail.com> Message-ID: <77c454bb1002010700g5ac3adc7nae3197beba9d9ed5@mail.gmail.com> Oops, sent that email to you directly, and not the list. The answers to your questions should be plainly visible below; my apologies to anyone reading on nagios-users. As regards question 1, I am certain we are not running ndo2db. I'm sorry if my first answer seemed ambiguous, perhaps I should have stated outright that I am not using ndoutils. I instead attempted to let you know that we were running some software that similarly stored data in a database and had already ruled out large/slow database queries as the source of my problem. Your response to number 2 is quite intriguing! Further detail on that might be helpful, though. I'll check to see if anyone is using this server for anything else that could interfere. I don't think a nagios check can be blocking, as 99% of our checks are passive checks passed through nsca and I don't think any of our active checks besides host checks (check_icmp) actually contact another computer. As to the pre-existing administrator, he was just rebooting the server every day as he did not know what was causing the problem. If I am convinced that an upgrade will fix the problem, I will email him. Otherwise, until there is some change, there's not much point in bothering him during his sabbatical. Finally, the checkresults thing. It seemed odd to me, but it looked like several normal files concatenated. I can't actually look at the file again until the server is booted and on a Monday morning that could be a while. ;) Thanks for your help! --Rick On Mon, Feb 1, 2010 at 8:21 AM, wrote: > From: rickmangus at gmail.com On Behalf Of Rick Mangus > Sent: 01 February 2010 13:30 > > > Thank you for the response. In quick succession: > > I am forwarding my replies to the Nagios list as well. > > > 1. I do use perfparse, and one of our suspicions involved a mysql delete > to prune old data > > that took multiple hours every night. I removed all jobs that I could > find that could > > possibly interfere. > > You can determine that you are running ndoutils by issuing a command like > "ps -fu naguos" (to list all processes owned by username nagios; if you are > running ndoutils, there will be a process named ndo2db > > > 2. No, or if we do, it's well-hidden. ;) > > In our case, the process causing the main problem was running on another > server, but was holding up nagios because it blocked the nagios process in > part of the code that was single-threaded. > > > 3. I am walking into a pre-existing install, and trying to slowly take > over the management > > duties. To hasten the learning process, the only person in the office > who knows anything > > about it went to Hawaii once I'd been working here a few weeks. I don't > know that I should > > attempt any major changes without his blessing, and he will not return > until > > March. Though, if we determine that is the only/best fix, I'll do it. > > Can he be contacted for his advice ? If not, what has been changed since > you took over (probably asking the obvious questions !) > > > One additional data point: I found on Saturday night, as I logged in to > restart Nagios and > > prevent the machine dying, that the one file in > /ramdisk_nagios/checkresults/ was over > > 1MB. Every other time I have checked, the files in there are sub-4kB. > If that tells > > anyone here anything, please share with the peanut gallery (me!). :) > > 1MB sounds very large; if you see it again, try finding out which check > generated this file (core or debug dump from check code ?). > > Jonathan Wheeler > e-Science Centre > Rutherford Appleton Laboratory > > > > -- > Scanned by iCritical. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsi.com Mon Feb 1 16:19:42 2010 From: mark.frost1 at pepsi.com (Frost, Mark {PBG}) Date: Mon, 1 Feb 2010 10:19:42 -0500 Subject: Nagios 3.2.0 DST bug addressed? Message-ID: Has the bug with Nagios 3.2.0 talked about in http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg29812.html been addressed in any way yet? Or is it still about trying the workarounds? I'm assuming that since there's no 3.2.1, that it's probably still about using the workarounds. Also, from what I read I had the impression that this only affected "falling back" -- that is, the one day of the year when you have a 25 hour day. This does not have an effect when we "spring forward" (i.e. a 23-hour day?) I have a need to upgrade from 3.0.6 to 3.2.0 to take advantage of the improved dependency checking, but I got bit by the DST issue in November and am frankly rather nervous about this again without knowing this has been addressed somehow. Thanks Mark ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justinp at norchemlab.com Mon Feb 1 16:43:53 2010 From: justinp at norchemlab.com (Justin Pryzby) Date: Mon, 1 Feb 2010 08:43:53 -0700 Subject: FW: Nagios 3.0.5 problem In-Reply-To: References: <77c454bb1002010529i55140760k2fd1ca69781f3664@mail.gmail.com> Message-ID: <20100201154353.GB9293@norchemlab.com> > > One additional data point:? I found on Saturday night, as I logged in to restart Nagios and > > prevent the machine dying, that the one file in /ramdisk_nagios/checkresults/ was over > > 1MB.? Every other time I have checked, the files in there are sub-4kB.? If that tells > > anyone here anything, please share with the peanut gallery (me!).? :) > > 1MB sounds very large; if you see it again, try finding out which > check generated this file (core or debug dump from check code ?). I can't resist the temptation to suggest writing a test for that, something like: nrpe.cfg: command[bigresult]=find /dir -mindepth 1 -maxdepth 1 -size +1k |xargs |grep . && exit 2; echo OK; exit 0 Justin ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Feb 1 17:08:49 2010 From: benny at bennyvision.com (C. Bensend) Date: Mon, 1 Feb 2010 10:08:49 -0600 Subject: check_openmanage having issues with OMSA 6.2.0 Message-ID: <1e2de691ae5c5c868e2fa34f5146f99d.squirrel@webmail.stinkweasel.net> Hey folks, During this past weekend's maintenance window, we upgraded several hosts to OMSA v6.2.0. They were previously at v5.0.0, and so check_openmanage wasn't able to poll them. Now, they are still showing as UNKNOWN, giving the following error: Problem running 'omreport storage controller': Error! Invalid name=value pair: controller This is running on a Dell PowerEdge 1950, with the following command via NSClient++: check_openmanage.exe -e -b bat_charge=ALL/ctrl_fw=ALL/ctrl_driver=ALL --omreport E:\OpenManage\oma\bin\omreport.exe I would have thought 6.2.0 would be OK - is anyone else seeing issues, or does anyone know of incompatibilities? I checked the check_openmanage FAQ, but didn't see anything... Thanks! Benny -- "Blow with all your might into the bean stuffed straw." -- Spam message, 2006-11 ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 1 17:25:30 2010 From: marc at ena.com (Marc Powell) Date: Mon, 1 Feb 2010 10:25:30 -0600 Subject: Nagios 3.2.0 DST bug addressed? In-Reply-To: References: Message-ID: On Feb 1, 2010, at 9:19 AM, Frost, Mark {PBG} wrote: > > Has the bug with Nagios 3.2.0 talked about in > > http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg29812.html > > been addressed in any way yet? Or is it still about trying the workarounds? I'm assuming that since there's no 3.2.1, that it's probably still about using the workarounds. http://tracker.nagios.org/view.php?id=46 -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 1 17:40:39 2010 From: marc at ena.com (Marc Powell) Date: Mon, 1 Feb 2010 10:40:39 -0600 Subject: Nagios 3.2.0 DST bug addressed? In-Reply-To: References: Message-ID: On Feb 1, 2010, at 10:25 AM, Marc Powell wrote: > > On Feb 1, 2010, at 9:19 AM, Frost, Mark {PBG} wrote: > >> >> Has the bug with Nagios 3.2.0 talked about in >> >> http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg29812.html >> >> been addressed in any way yet? Or is it still about trying the workarounds? I'm assuming that since there's no 3.2.1, that it's probably still about using the workarounds. > > > http://tracker.nagios.org/view.php?id=46 Sorry, yes I realize now this is a different issue. Trying to track down if the issue you reference has been fixed in CVS. I believe it has but not 100% sure yet. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 1 17:48:54 2010 From: marc at ena.com (Marc Powell) Date: Mon, 1 Feb 2010 10:48:54 -0600 Subject: Nagios 3.2.0 DST bug addressed? In-Reply-To: References: Message-ID: <2FC06CF8-17A4-426E-B4B4-EE5423FC8764@ena.com> On Feb 1, 2010, at 10:40 AM, Marc Powell wrote: > > On Feb 1, 2010, at 10:25 AM, Marc Powell wrote: > >> >> On Feb 1, 2010, at 9:19 AM, Frost, Mark {PBG} wrote: >> >>> >>> Has the bug with Nagios 3.2.0 talked about in >>> >>> http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg29812.html >>> >>> been addressed in any way yet? Or is it still about trying the workarounds? I'm assuming that since there's no 3.2.1, that it's probably still about using the workarounds. >> >> >> http://tracker.nagios.org/view.php?id=46 > > Sorry, yes I realize now this is a different issue. Trying to track down if the issue you reference has been fixed in CVS. I believe it has but not 100% sure yet. Here it is. Fixed in October by Ton Voon - http://nagios.cvs.sourceforge.net/viewvc/nagios/nagios/base/utils.c?view=log Only in CVS snapshots at this point. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Mon Feb 1 18:38:45 2010 From: israel at frontierflying.com (Israel Brewster) Date: Mon, 1 Feb 2010 08:38:45 -0900 Subject: Mutually exclusive contacts? Message-ID: <69C3739D-1073-4562-8146-E0FFE37870B2@frontierflying.com> I suspect the answer is no, but in nagios 3, is there any way to set up mutually exclusive contact? As in, I have a host template set up with a number of contacts. On specific hosts that use this template, I want to replace one contact (normal me) with a different contact (urgent me). The other contacts for the host remain the same, so I don't want to just override the contacts directive for the host. I think I can do something like contacts +UrgentMe, !NormalMe, but I'm not even sure about that - I want to say there was an issue with doing negative contacts (although that might have been something else- I'll try it and see). Be that as it may, it would be really convenient if I could set something in the contacts to say "Urgent me overrides Normal me" Is this possible? Thanks. ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Feb 1 20:03:35 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 01 Feb 2010 20:03:35 +0100 Subject: check_openmanage having issues with OMSA 6.2.0 In-Reply-To: <1e2de691ae5c5c868e2fa34f5146f99d.squirrel@webmail.stinkweasel.net> (C. Bensend's message of "Mon, 1 Feb 2010 10:08:49 -0600") References: <1e2de691ae5c5c868e2fa34f5146f99d.squirrel@webmail.stinkweasel.net> Message-ID: <15t7hqw6ans.fsf@tux.uio.no> "C. Bensend" writes: > Hey folks, > > During this past weekend's maintenance window, we upgraded several > hosts to OMSA v6.2.0. They were previously at v5.0.0, and so > check_openmanage wasn't able to poll them. > > Now, they are still showing as UNKNOWN, giving the following > error: > > > Problem running 'omreport storage controller': Error! Invalid > name=value pair: controller > > > This is running on a Dell PowerEdge 1950, with the following > command via NSClient++: > > > check_openmanage.exe -e -b bat_charge=ALL/ctrl_fw=ALL/ctrl_driver=ALL > --omreport E:\OpenManage\oma\bin\omreport.exe > > > I would have thought 6.2.0 would be OK - is anyone else seeing > issues, or does anyone know of incompatibilities? I checked the > check_openmanage FAQ, but didn't see anything... Hi Benny, The command 'omreport storage controller' is pretty basic and should never fail like that. You should check if OMSA is correctly installed, specifically the storage stuff. OMSA consists of many different components, and I'm guessing that the storage component(s) are missing on your server. If you run the command manually, you get the same error message, right? Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Feb 1 20:42:40 2010 From: benny at bennyvision.com (C. Bensend) Date: Mon, 1 Feb 2010 13:42:40 -0600 Subject: check_openmanage having issues with OMSA 6.2.0 In-Reply-To: <15t7hqw6ans.fsf@tux.uio.no> References: <1e2de691ae5c5c868e2fa34f5146f99d.squirrel@webmail.stinkweasel.net> <15t7hqw6ans.fsf@tux.uio.no> Message-ID: <079d9657b1549255cf630281ad4a0eac.squirrel@webmail.stinkweasel.net> >> I would have thought 6.2.0 would be OK - is anyone else seeing >> issues, or does anyone know of incompatibilities? I checked the >> check_openmanage FAQ, but didn't see anything... > > Hi Benny, > > The command 'omreport storage controller' is pretty basic and should > never fail like that. You should check if OMSA is correctly installed, > specifically the storage stuff. OMSA consists of many different > components, and I'm guessing that the storage component(s) are missing > on your server. > > If you run the command manually, you get the same error message, right? Hi Trond, I'm not sure - I let the Windows guys do the upgrades, so I wasn't witness. I asked them, though, and they said they did what they always do, so I wonder if Dell re-grouped some of the components and the storage bits weren't installed. They have followed the same process in the past for 5.3.0 - 6.1.0, so methinks OMSA may have changed a bit. We have another upgrade this afternoon, we're going to test it during the window to see if additional options need to be installed with 6.2.0. Thanks! Benny -- "Blow with all your might into the bean stuffed straw." -- Spam message, 2006-11 ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Mon Feb 1 22:11:09 2010 From: benny at bennyvision.com (C. Bensend) Date: Mon, 1 Feb 2010 15:11:09 -0600 Subject: check_openmanage having issues with OMSA 6.2.0 In-Reply-To: <079d9657b1549255cf630281ad4a0eac.squirrel@webmail.stinkweasel.net> References: <1e2de691ae5c5c868e2fa34f5146f99d.squirrel@webmail.stinkweasel.net> <15t7hqw6ans.fsf@tux.uio.no> <079d9657b1549255cf630281ad4a0eac.squirrel@webmail.stinkweasel.net> Message-ID: >>> I would have thought 6.2.0 would be OK - is anyone else seeing >>> issues, or does anyone know of incompatibilities? I checked the >>> check_openmanage FAQ, but didn't see anything... >> >> Hi Benny, >> >> The command 'omreport storage controller' is pretty basic and should >> never fail like that. You should check if OMSA is correctly installed, >> specifically the storage stuff. OMSA consists of many different >> components, and I'm guessing that the storage component(s) are missing >> on your server. >> >> If you run the command manually, you get the same error message, right? > > Hi Trond, > > I'm not sure - I let the Windows guys do the upgrades, so I wasn't > witness. I asked them, though, and they said they did what they > always do, so I wonder if Dell re-grouped some of the components and > the storage bits weren't installed. They have followed the same > process in the past for 5.3.0 - 6.1.0, so methinks OMSA may have > changed a bit. > > We have another upgrade this afternoon, we're going to test it > during the window to see if additional options need to be installed > with 6.2.0. Confirmed - Dell changed something with the install process in v6.2.0, so the same procedure they were using in the past did *not* install the storage components. Thanks, Trond. FYI to everyone else that uses OMSA and check_openmanage - you may have to adjust the options you choose when installing. Benny -- "Blow with all your might into the bean stuffed straw." -- Spam message, 2006-11 ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mwiese at beit.de Mon Feb 1 22:01:48 2010 From: mwiese at beit.de (mwiese at beit.de) Date: Mon, 1 Feb 2010 22:01:48 +0100 Subject: =?iso-8859-1?q?Marco_Wiese/BEIT_GmbH_ist_au=DFer_H?= =?iso-8859-1?q?aus=2E?= Message-ID: Ich werde vom 27.01.2010 bis einschlie?lich 02.02.2010 nicht im B?ro sein. Ich werde Ihre Nachricht nach meiner R?ckkehr schnellstm?glich beantworten. ------------------------------------------------------ I will be out of the office starting 01/27/2010 and will not return until 02/02/2009. I will answer your message after my return. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ravishankar.gundlapali at wipro.com Tue Feb 2 08:50:13 2010 From: ravishankar.gundlapali at wipro.com (ravishankar.gundlapali at wipro.com) Date: Tue, 2 Feb 2010 13:20:13 +0530 Subject: Socket timeout after 10 seconds Message-ID: <39C5D75C6A5D3440A833E1C014384BAA07DE7AC2@PNE-HJN-MBX02.wipro.com> Hi, I am using Nagios Version 3.2.4,with NSC Client++ 0.3.6 and Nagios -Plugins:1.4.14 with Red Hat Fedora Linux Version 10.1 on Windows Servers. Every few minutes Nagios detects that a servers/service is down because it times out and in the status information shows: Critical - SocketTimeoutAfter10Seconds Please someone let me know what does this mean. What information can I get from this alert. Thanks & Regards, Ravi G -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Tue Feb 2 09:24:02 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Tue, 02 Feb 2010 00:24:02 -0800 Subject: Socket timeout after 10 seconds In-Reply-To: <39C5D75C6A5D3440A833E1C014384BAA07DE7AC2@PNE-HJN-MBX02.wipro.com> References: <39C5D75C6A5D3440A833E1C014384BAA07DE7AC2@PNE-HJN-MBX02.wipro.com> Message-ID: <4B67E122.6020005@hp.com> ravishankar.gundlapali at wipro.com wrote: > > Every few minutes Nagios detects that a servers/service is down > because it times out and in the status information shows: Critical - > *SocketTimeoutAfter10Seconds* > > > > > > Please someone let me know what does this mean. What information can I > get from this alert. > It means simply that the connection to the socket you're monitoring timed out after 10 seconds. Pretty self-explanatory, really. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios.raja at gmail.com Tue Feb 2 14:04:18 2010 From: nagios.raja at gmail.com (raja nagios) Date: Tue, 2 Feb 2010 18:34:18 +0530 Subject: Nagios Help Message-ID: <48dff22c1002020504s171461a3v3ad8c063950edbe@mail.gmail.com> Hi, I have configured Nagios on RHEL5.3. now monitoring Linux and windows server. when i look at MAP tab. it is giving MAP from Nagios server to clients.But It is giving "?" mark. Why? How can i remove ? mark? Instead of ? mark i want to display name of the machine. How can i do so? Thanks in Advance. Raja. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From yu.watanabe at jp.fujitsu.com Tue Feb 2 14:17:26 2010 From: yu.watanabe at jp.fujitsu.com (Yu Watanabe) Date: Tue, 02 Feb 2010 22:17:26 +0900 Subject: Is Notification history refreshed after the commit? Message-ID: <201002021317.AA01513@S2007337.jp.fujitsu.com> Hello all. I have a question does nagios refreshes the notification history event though the state retention option is specified. I am using the Nagios v3.0.6 and following is the situation: 1. Service Notification interval is set as 24h 2. Host status is UP at that time 3. Normal check interval is set 24h Following is the history from the nagios.log: * HARD status change had occured at Jan 11 08:35 . Notification has been sent at 08:35 * Notification was sent at Jan 12 08:35 * Notification was sent at Jan 13 08:35 * Commit was executed at Jan 13 22:24 * Notification was not sent at Jan 14 08:35 * Recovery status change had occured from CRITICAL -> OK at Jan 15 08:35. Notification has been sent at 08:35. I thought that Nagios inherits the last notification time when it restarts using the nagiostatus.sav. But it seems to be not the case for this time. Could someone give us an advice? Thank you. Yu Watanabe ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Feb 2 15:05:44 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 02 Feb 2010 14:05:44 +0000 Subject: NSCA checks not appear on nagios Message-ID: <4B683138.7030701@flatto.net> Hello I'm trying to setup a distributed nagios using complied nagios 3.2.0 . I set up the NSCA and i can see the result being delivered to the nagios (example below) : remote nagios: **** BEGIN MACRO PROCESSING *********** [1265118755.249654] [2048.1] [pid=12343] Processing: '/usr/local/nagios/libexec/eventhandlers/submit_check_result $HOSTNAME$ '$SERVICEDESC$' $SERVICESTATE$ '$SERVICEOUTPUT$'' [1265118755.249670] [2048.1] [pid=12343] Done. Final output: '/usr/local/nagios/libexec/eventhandlers/submit_check_result foo.bar 'HTTP' CRITICAL 'HTTP CRITICAL - No data received from host'' [1265118755.249691] [2048.1] [pid=12343] **** END MACRO PROCESSING ************* [1265118755.249697] [001.0] [pid=12343] my_system() [1265118755.249703] [256.1] [pid=12343] Running command '/usr/local/nagios/libexec/eventhandlers/submit_check_result foo.bar 'HTTP' CRITICAL 'HTTP CRITICAL - No data received from host''... Central Nagios : Connection from xx.xx.xx.xx port 63416 nsca[6405]: Handling the connection... nsca[6405]: SERVICE CHECK -> Host Name: 'foo.bar', Service Description: 'HTTP', Return Code: '0', Output: 'HTTP OK: HTTP/1.1 301 Moved Permanently - 461 bytes in 0.081 second response time' nsca[6405]: End of connection... The problem is that the host/service on the central server stay in the pending status and the web page does not change. I have 4 hosts with only check_http defined as the check running on them (both on the remote and the central nagios ). when i force a check on a service - it comes up at ok ( as it should be ) , how can i make sure they start with out manual intervention ? Thanks Assaf ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios.raja at gmail.com Tue Feb 2 15:53:32 2010 From: nagios.raja at gmail.com (raja nagios) Date: Tue, 2 Feb 2010 20:23:32 +0530 Subject: Nagios Message-ID: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> Hi all, Look at the attached snapshot of nagios. why i am getting ? mark... can anyone help me. please have a look at picture. Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: nagios.jpg Type: image/jpeg Size: 220029 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Feb 2 16:05:25 2010 From: jpratt at norwich.edu (James Pratt) Date: Tue, 2 Feb 2010 10:05:25 -0500 Subject: Nagios In-Reply-To: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> References: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD49489451@nuexchg.norwich.edu> >> -----Original Message----- >> From: raja nagios [mailto:nagios.raja at gmail.com] >> Sent: Tuesday, February 02, 2010 9:54 AM >> To: Nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] Nagios >> >> Hi all, >> >> Look at the attached snapshot of nagios. >> >> why i am getting ? mark... >> >> can anyone help me. please have a look at picture. >> >> >> Thanks Hello, I think if you just change / fix the /etc/hosts file to reflect the proper IP and hostname of your nagios server, as opposed to localhost.localdomain, which should have an entry for 127.0.0.1 only. Also, check the output of "hostname" command - if it's not correct, you can set it via hostname -v "nagios.mydomain.com" I think if you restart nagios after this change/fix, you will see what you expect... if not, check your httpd setup/ conf files for correct ServerName directive , etc etc... Example /etc/hosts file : # Do not remove the following line, or various programs # that require network functionality will fail. 127.0.0.1 localhost.localdomain localhost ::1 localhost6.localdomain6 localhost6 10.12.200.90 nagios.norwich.edu nagios Cheers, james ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Chris.Menard at Aspect.com Tue Feb 2 16:09:58 2010 From: Chris.Menard at Aspect.com (Menard, Chris) Date: Tue, 2 Feb 2010 10:09:58 -0500 Subject: Nagios In-Reply-To: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> References: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> Message-ID: Check icon_image of the host configuration. From: raja nagios [mailto:nagios.raja at gmail.com] Sent: Tuesday, February 02, 2010 9:54 AM To: Nagios-users at lists.sourceforge.net Subject: [Nagios-users] Nagios Hi all, Look at the attached snapshot of nagios. why i am getting ? mark... can anyone help me. please have a look at picture. Thanks -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 2 16:09:12 2010 From: marc at ena.com (Marc Powell) Date: Tue, 2 Feb 2010 09:09:12 -0600 Subject: NSCA checks not appear on nagios In-Reply-To: <4B683138.7030701@flatto.net> References: <4B683138.7030701@flatto.net> Message-ID: <4EE08382-0643-41C7-B158-4A089D23B634@ena.com> On Feb 2, 2010, at 8:05 AM, Assaf Flatto wrote: > The problem is that the host/service on the central server stay in the > pending status and the web page does not change. > I have 4 hosts with only check_http defined as the check running on them > (both on the remote and the central nagios ). > > when i force a check on a service - it comes up at ok ( as it should be > ) , how can i make sure they start with out manual intervention ? Make sure you've enabled external commands: http://nagios.sourceforge.net/docs/3_0/extcommands.html Make sure you've enabled passive checks in nagios.cfg and in the service definitions and that the host/service your are submitting for exists in your config: http://nagios.sourceforge.net/docs/3_0/passivechecks.html Also see this recent thread - http://www.mail-archive.com/nagios-users at lists.sourceforge.net/msg30439.html If you're still having problems, post consistent service definitions and nsca debug output (your examples don't match), nagios.cfg entries related to passive checks and external commands and an 'ls -l /path/to/nagios.cmd'. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at smetj.net Tue Feb 2 16:44:29 2010 From: nagios at smetj.net (Jelle Smet) Date: Tue, 02 Feb 2010 16:44:29 +0100 Subject: Scheduled downtime for 1 host and its services Message-ID: <5b3c9e44de4aae2026afc688a3383778@smetj.net> Hi List, I'm using Nagios 3.2.0 and have a question about scheduled downtimes which I can't find in the docs. If I schedule downtime for a host, does this automatically schedule downtime for all the host services too? If so, why isn't there the ZZzzz icon next to these services? Will scheduled downtime for a host also make sure this doesn't impact the availability report of the services? Thanks in advance! Jelle ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Feb 2 18:41:58 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 02 Feb 2010 17:41:58 +0000 Subject: Nagiosgraph failing to generate graphs In-Reply-To: <4B585657.4000901@flatto.net> References: <4B583604.7030308@flatto.net> <4B585657.4000901@flatto.net> Message-ID: <4B6863E6.7080304@flatto.net> Assaf Flatto wrote: > Eric Emerson wrote: > >> Is that the only entry in nagiosgraph.log? >> >> > It was when i first posted , hoiwever now there is a new line in the log : > > Thu Jan 21 11:08:39 2010 show.cgi critical define '/boot' in ngshared.pm > (in %Ctrans) and report it, please > Can anyone help me resolve this issue ? I know the data is entered to the rrd files , but u do not get any graphs . The errors i get are as below , but I did not find any explanations on how to resolve this , and trying to add things to the ngshared.pm file isn't working . show.cgi critical define 'zombie' in ngshared.pm (in %Ctrans) and report it, please show.cgi critical define '%2F' in ngshared.pm (in %Ctrans) and report it, please Thanks Assaf ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 2 16:16:18 2010 From: marc at ena.com (Marc Powell) Date: Tue, 2 Feb 2010 09:16:18 -0600 Subject: Nagios In-Reply-To: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> References: <48dff22c1002020653i5165f96bt13eca2424414d987@mail.gmail.com> Message-ID: On Feb 2, 2010, at 8:53 AM, raja nagios wrote: > Hi all, > > Look at the attached snapshot of nagios. > > why i am getting ? mark... I don't use the statusmap but did you specify a valid statusmap_image in the host{} definition? http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#host Please read that as there are subtleties related to 'valid'... -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.weiss at gmail.com Wed Feb 3 02:14:26 2010 From: chris.weiss at gmail.com (Chris Weiss) Date: Tue, 2 Feb 2010 17:14:26 -0800 Subject: Could not expand hostgroups error Message-ID: I've got a Nagios 3.2 setup on FC10 monitoring ~32 or so hosts for a few dozen services and am having troubles trying to add a couple of hostgroups. The way I have things organized is /etc/nagios/conf.d/ contains individual files for each host and /etc/nagios/objects contains hostgroup definitions used to define common services to be monitored. I've got all the objects/*.cfg files included in nagios.cfg: cfg_file=/etc/nagios/objects/commands.cfg cfg_file=/etc/nagios/objects/contacts.cfg cfg_file=/etc/nagios/objects/timeperiods.cfg cfg_file=/etc/nagios/objects/templates.cfg cfg_file=/etc/nagios/objects/localhost.cfg cfg_file=/etc/nagios/objects/windows.cfg cfg_file=/etc/nagios/objects/linux.cfg cfg_file=/etc/nagios/objects/switch.cfg cfg_file=/etc/nagios/objects/printer.cfg cfg_file=/etc/nagios/objects/router.cfg #cfg_file=/etc/nagios/objects/winhttp.cfg #cfg_file=/etc/nagios/objects/twiki.cfg #cfg_file=/etc/nagios/objects/bugzilla.cfg cfg_file=/etc/nagios/objects/mysql.cfg cfg_file=/etc/nagios/objects/exchange.cfg Sample host definition: #/etc/nagios/conf.d/mailsrv.cfg define host{ use windows-server ; Inherit default values from a template host_name mailsrv ; The name we're giving to this host alias mailsrv.domain.com ; A longer name associated with the host hostgroups windows-servers,exchange-servers address 10.239.1.16 ; IP address of the host parents cisco-3560g } Sample hostgroup definition: #/etc/nagios/objects/exchange.cfg define hostgroup{ hostgroup_name exchange-servers alias Exchange Servers } define service{ use generic-service hostgroup_name exchange-servers service_description Exchange Information Store check_command check_nt!SERVICESTATE!-d SHOWALL -l MSExchangeIS } define service{ use generic-service hostgroup_name exchange-servers service_description Exchange Management check_command check_nt!SERVICESTATE!-d SHOWALL -l MSExchangeMGMT } define service{ use generic-service hostgroup_name exchange-servers service_description Exchange MTA Stacks check_command check_nt!SERVICESTATE!-d SHOWALL -l MSExchangeMTA } define service{ use generic-service hostgroup_name exchange-servers service_description Exchange Routing Engine check_command check_nt!SERVICESTATE!-d SHOWALL -l RESvc } define service{ use generic-service hostgroup_name exchange-servers service_description Exchange System Attendant check_command check_nt!SERVICESTATE!-d SHOWALL -l MSExchangeSA } define service{ use generic-service hostgroup_name exchange-servers service_description IIS Admin Service check_command check_nt!SERVICESTATE!-d SHOWALL -l IISADMIN } This seems to work fine. However, I have three hostgroup files that do not work and I can't suss out why. They're all fairly identical, single-service hostgroups, two are checking a URL (bugzilla & twiki) and one is a Win32 service check. Sample 'broken' hostgroup config: #/etc/nagios/objects/twiki.cfg define hostgroup{ hostgroup_name twiki-servers alias TWiki Servers } define service{ use generic-service hostgroup_name twiki-servers service_description TWiki check_command check_http!$HOSTADDRESS$!twiki/bin/view/Software } When I run nagios -v with cfg_file=/etc/nagios/objects/twiki.cfg present, it complains: Error: Could not expand hostgroups and/or hosts specified in service (config file '/etc/nagios/objects/twiki.cfg', starting on line 24) Error processing object config files! I'm stumped as to why it cannot expand the hostname. It's defined in the same file and nowhere else. The only other place "twiki-servers" is referenced is in one of the other host files in conf.d: Sample host config (actually works regardless of whether broken hostgroups are included) #/etc/nagios/conf.d/hudson.cfg define host{ use linux-server ; Name of host template to use host_name hudson alias hudson hostgroups linux-servers,mysql-servers,twiki-servers,bugzilla-servers address hudson.sw.domain.com parents t1-router, cisco-3560g } If I comment out that include in nagios.cfg, things run just fine. The hostgroup_name definition does not appear anywhere else except that file. I would think that the exchange.cfg file would fail as well, since they're implemented identically. The same thing happens with the other 2 definitions, bugzilla.cfg and winhttp.cfg. They're very similar and use the exchange.cfg template. Am I missing something? This all *looks* like it should work. -- -Chris ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Feb 3 11:54:00 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 03 Feb 2010 10:54:00 +0000 Subject: more nagiosgraph issues Message-ID: <4B6955C8.3020306@flatto.net> Hello List I've been struggling with nagiosgraph (1.4) and nagios 3.2 for the last 3 weeks and can't get it to work, my setting are as follows : Nagios.cfg # PROCESS PERFORMANCE DATA OPTION process_performance_data=1 service_perfdata_file=/var/spool/nagios/perfdata.log service_perfdata_file_template=$LASTSERVICECHECK$||$HOSTNAME$||$SERVICEDESC$||$SERVICEOUTPUT$||$SERVICEPERFDATA$ service_perfdata_file_mode=a service_perfdata_file_processing_interval=30 service_perfdata_file_processing_command=process-service-perfdata commands.cfg: # Nagios Performance Commands define command { command_name process-service-perfdata command_line /usr/local/nagios/lib/insert.pl } nagiosgraph.conf : ( cleared from all comments ) debug = 5 debug_showgraph = 5 logfile = /usr/local/nagios/var/nagiosgraph.log rrddir = /usr/local/nagios/nagiosgraph/rrd mapfile = /usr/local/nagios/nagiosgraph/map colorscheme = 1 heartbeat = 600 perflog = /var/spool/nagios/perfdata.log dbseparator = subdir plotas = LINE2 maximums = Current Load,PLW,Procs: total,Procs: zombie,User Count minimums = APCUPSD,Mem: free,Mem: swap withmaximums = PING withminimums = PING hostdb = /usr/local/nagios/nagiosgraph/hostdb.conf servdb = /usr/local/nagios/nagiosgraph/servdb.conf nagioscgiurl = https://nagiosserver/nagios/cgi-bin javascript = /nagios/nagiosgraph.js stylesheet = /nagios/stylesheets/nagiosgraph.css graphlabels = true small = 650x150 clear = clear the list selecthost = Select server selectitems = Optionally, select the data set(s) to graph: selectserv = Select service fixedscale = Fixed Scale submit = Update Graphs zoom = Resize the graphs: perfforhost = Performance data for host perfforserv = Performance data for service service = service asof = as of dai = Today daily = Daily day = Today week = This Week weekly = Weekly month = This Month monthly = Monthly year = This Year yearly = Yearly configerror = Configuration Error (email Alan). noservicegiven = Bad URL for showservice.cgi; no service given apcupsd = Uninterruptible Power Supply Status (Battery Charge, Tempurature, Load Percentage, Time Left) bps = Bits Per Second clamdb = Clam Database diskgb = Disk Usage in Gigabytes diskpct = Disk Usage in Percent http = Bits Per Second load = Load Average losspct = Loss Percentage mailq = Pending Output E-mail Messages memory = RAM Usage Mem%3A%20swap = Swap Utilization swap = Swap Utilization ping = Ping Loss Percentage and Round Trip Average pingloss = Ping Loss Percentage pingrta = Ping Round Trip Average PLW = Perl Log Watcher Events procs = Processes qsize = Messages in Outbound Queue rta = Round Trip Average smtp = E-mail Status testcolor = Show Colors typesome = Type some space seperated nagiosgraph line names here graph = Graph previous = previous next = next createdby = Created by nagiosgraph directory and all sub dirs and files have nagios:nagios ownership with full rwx for group and user. By using rrdtool dump I can see data in the rrd files , but no graphs are being displayed . from nagiosgraph.log Wed Feb 3 10:03:02 2010 insert.pl debug getrules(/usr/local/nagios/nagiosgraph/map) Wed Feb 3 10:03:02 2010 insert.pl debug inputdata() Wed Feb 3 10:03:02 2010 insert.pl debug inputdata empty /var/spool/nagios/perfdata.log Wed Feb 3 10:03:02 2010 insert.pl debug insert.pl exited but doing tail -f /var/spool/nagios/perfdata.log 1265194199||dec1-be-107||Check NTP Time||NTP OK: Offset -0.0004923343658 secs||offset=-0.000492s;10.000000;500.000000; 1265194199||dec1-be-107||Total Processes||PROCS OK: 80 processes|| 1265194199||dec1-be-71||Check NTP Time||NTP OK: Offset -0.0008155107498 secs||offset=-0.000816s;10.000000;500.000000; 1265194199||dec1-be-71||Total Processes||PROCS OK: 130 processes|| Any one got any idea why this is happening and hoiw can i get it to work ? Thanks Assaf ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From electro93 at gmail.com Wed Feb 3 15:16:58 2010 From: electro93 at gmail.com (Jeff) Date: Wed, 3 Feb 2010 08:16:58 -0600 Subject: Event Handlers Message-ID: I have a service that needs to be monitored every minute. I need some help understanding how services go from soft to a hard state and if an event handler can be run after a service has gone into a hard state. I'm sure everyone has a very dynamic and custom environment to some extent. I have event handlers that will not run if a lock file is present (cause i am deploying code or so other scripts do not step on each other). So I for this service that I monitor every minute, I have Max Retries set to 3, Check Interval is 1, and retry interval is 1. Can someone help shed some light on how I can get an event handler to run again after a service has gone into a hard state? Thanks, JB -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From robert.wolfe at robertwolfe.org Wed Feb 3 15:51:54 2010 From: robert.wolfe at robertwolfe.org (Wolfe, Robert) Date: Wed, 3 Feb 2010 09:51:54 -0500 Subject: Asterisk Questions Message-ID: <863B3EE13473404BB5FA7516A1E31EF71B5C73EB31@exchange-01.wolfe.local> Hi all! I am wanting to write a plugin that will tell Asterisk to make a test call every so often and then report back to Nagios if a call was successful or not. The caveat here is that Asterisk is on one server and Nagios is on its own dedicated server. If anyone knows of any script or plugin that can do this, I would greately appreciate any pointers in the right direction. Thanks! ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 3 16:57:09 2010 From: marc at ena.com (Marc Powell) Date: Wed, 3 Feb 2010 09:57:09 -0600 Subject: Event Handlers In-Reply-To: References: Message-ID: <24FE7944-8D14-4009-A207-E810CEEB6EC9@ena.com> On Feb 3, 2010, at 8:16 AM, Jeff wrote: > I have a service that needs to be monitored every minute. I need some help understanding how services go from soft to a hard state When a service check results in a non-OK state, services go from a Soft to a Hard state when they reach max_check_attempts. http://nagios.sourceforge.net/docs/3_0/statetypes.html > and if an event handler can be run after a service has gone into a hard state. Only for it's initial Hard problem state or initial Hard recovery state. http://nagios.sourceforge.net/docs/3_0/eventhandlers.html > I'm sure everyone has a very dynamic and custom environment to some extent. I have event handlers that will not run if a lock file is present (cause i am deploying code or so other scripts do not step on each other). So I for this service that I monitor every minute, I have Max Retries set to 3, Check Interval is 1, and retry interval is 1. Can someone help shed some light on how I can get an event handler to run again after a service has gone into a hard state? You can't really... The only real facility nagios has to do this (that I can think of right now) is is_volatile (http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service) but that's probably overkill for your needs; particularly the notification implications. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dave.list at pixelhammer.com Wed Feb 3 16:59:29 2010 From: dave.list at pixelhammer.com (DAve) Date: Wed, 03 Feb 2010 10:59:29 -0500 Subject: check load oddity Message-ID: <4B699D61.20703@pixelhammer.com> This is making no sense to me at all. It is obvious to me that my load is less than the critical threshold, why is the plugin reporting a critical state? [root at monitor1 plugins]# ./check_load 2.0 1.8 1.5 3.0 2.8 2.5 CRITICAL - load average: 1.96, 1.01, 0.75|load1=1.960;0.000;0.000;0; load5=1.010;0.000;0.000;0; load15=0.750;0.000;0.000;0; Usage:check_load [-r] -w WLOAD1,WLOAD5,WLOAD15 -c CLOAD1,CLOAD5,CLOAD15 Options: -h, --help Print detailed help screen -V, --version Print version information -w, --warning=WLOAD1,WLOAD5,WLOAD15 Exit with WARNING status if load average exceeds WLOADn -c, --critical=CLOAD1,CLOAD5,CLOAD15 Exit with CRITICAL status if load average exceed CLOADn the load average format is the same used by "uptime" and "w" -r, --percpu Divide the load averages by the number of CPUs (when possible) Completely confused with this one. DAve -- "Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it." John Adams http://appleseedinfo.org ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justinp at norchemlab.com Wed Feb 3 17:33:44 2010 From: justinp at norchemlab.com (Justin T Pryzby) Date: Wed, 3 Feb 2010 09:33:44 -0700 Subject: check load oddity In-Reply-To: <4B699D61.20703@pixelhammer.com> References: <4B699D61.20703@pixelhammer.com> Message-ID: <20100203163344.GA4554@norchemlab.com> I've seen that with check_procs. Justin ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 3 17:46:29 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 03 Feb 2010 08:46:29 -0800 Subject: Could not expand hostgroups error In-Reply-To: References: Message-ID: <4B69A865.8080308@hp.com> Chris Weiss wrote: > When I run nagios -v with cfg_file=/etc/nagios/objects/twiki.cfg > present, it complains: > Error: Could not expand hostgroups and/or hosts specified in service > (config file '/etc/nagios/objects/twiki.cfg', starting on line 24) > Error processing object config files! > > I'm stumped as to why it cannot expand the hostname. It's defined in > the same file and nowhere else. The only other place "twiki-servers" > is referenced is in one of the other host files in conf.d > That's precisely the reason for the error. You're assigning a service to a hostgroup you have not defined. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 3 17:48:01 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 03 Feb 2010 08:48:01 -0800 Subject: check load oddity In-Reply-To: <4B699D61.20703@pixelhammer.com> References: <4B699D61.20703@pixelhammer.com> Message-ID: <4B69A8C1.5030208@hp.com> DAve wrote: > This is making no sense to me at all. It is obvious to me that my load > is less than the critical threshold, why is the plugin reporting a > critical state? > > [root at monitor1 plugins]# ./check_load 2.0 1.8 1.5 3.0 2.8 2.5 > CRITICAL - load average: 1.96, 1.01, 0.75|load1=1.960;0.000;0.000;0; > load5=1.010;0.000;0.000;0; load15=0.750;0.000;0.000;0; > Re-read the output of the syntax help the plugin is giving you. You are not passing the plugin valid paramaters. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 3 17:49:41 2010 From: marc at ena.com (Marc Powell) Date: Wed, 3 Feb 2010 10:49:41 -0600 Subject: check load oddity In-Reply-To: <4B699D61.20703@pixelhammer.com> References: <4B699D61.20703@pixelhammer.com> Message-ID: <1ECD7741-300E-4139-AD64-7A480040D82F@ena.com> On Feb 3, 2010, at 9:59 AM, DAve wrote: > This is making no sense to me at all. It is obvious to me that my load > is less than the critical threshold, why is the plugin reporting a > critical state? Almost certainly because these don't match (e.g. you're using it wrong) - > [root at monitor1 plugins]# ./check_load 2.0 1.8 1.5 3.0 2.8 2.5 > Usage:check_load [-r] -w WLOAD1,WLOAD5,WLOAD15 -c CLOAD1,CLOAD5,CLOAD15 -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cwfox at us.fujitsu.com Wed Feb 3 18:28:55 2010 From: cwfox at us.fujitsu.com (Camron W. Fox) Date: Wed, 03 Feb 2010 07:28:55 -1000 Subject: Exclusions to check_procs plugin? Message-ID: <4B69B257.1060307@us.fujitsu.com> Alle, I've searched and found this question has been asked before, but there don't seem to have been any responses. Is it possible to exclude a process with the check_procs plugin. I have John the Ripper running on one of my machines which consistently uses 100% of one of the four CPUs: Cpu0 : 0.3%us, 0.3%sy, 0.0%ni, 93.0%id, 6.3%wa, 0.0%hi, 0.0%si, 0.0%st Cpu1 : 0.7%us, 0.7%sy, 0.0%ni, 97.0%id, 1.3%wa, 0.0%hi, 0.3%si, 0.0%st Cpu2 : 0.3%us, 0.3%sy, 0.0%ni, 99.3%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Cpu3 :*100.0%us*, 0.0%sy, 0.0%ni, 0.0%id, 0.0%wa, 0.0%hi, 0.0%si, 0.0%st Mem: 3912852k total, 3761780k used, 151072k free, 180016k buffers Swap: 8388600k total, 112k used, 8388488k free, 2994944k cached PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 11066 root 25 0 12376 7012 644 R 100.1 0.2 70505:06 john I'd like to ignore this if possible. Best Regards, Camron -- Camron W. Fox Hilo Office High Performance Computing Group Fujitsu Management Services of America, Inc. E-mail: cwfox at us.fujitsu.com ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dave.list at pixelhammer.com Wed Feb 3 19:09:29 2010 From: dave.list at pixelhammer.com (DAve) Date: Wed, 03 Feb 2010 13:09:29 -0500 Subject: check load oddity In-Reply-To: <4B69A8C1.5030208@hp.com> References: <4B699D61.20703@pixelhammer.com> <4B69A8C1.5030208@hp.com> Message-ID: <4B69BBD9.8040503@pixelhammer.com> Morris, Patrick wrote: > DAve wrote: >> This is making no sense to me at all. It is obvious to me that my load >> is less than the critical threshold, why is the plugin reporting a >> critical state? >> >> [root at monitor1 plugins]# ./check_load 2.0 1.8 1.5 3.0 2.8 2.5 >> CRITICAL - load average: 1.96, 1.01, 0.75|load1=1.960;0.000;0.000;0; >> load5=1.010;0.000;0.000;0; load15=0.750;0.000;0.000;0; >> > > Re-read the output of the syntax help the plugin is giving you. You are > not passing the plugin valid paramaters. The preloaded command in NagiosQL is wrong and I never even checked it. Coffee needed... sigh... [root at monitor1 plugins]# ./check_load -w 2.0,1.8,1.5 -c 3.0,2.8,2.5 OK - load average: 1.76, 0.86, 0.72|load1=1.760;2.000;3.000;0; load5=0.860;1.800;2.800;0; load15=0.720;1.500;2.500;0; DAve -- "Posterity, you will know how much it cost the present generation to preserve your freedom. I hope you will make good use of it. If you do not, I shall repent in heaven that ever I took half the pains to preserve it." John Adams http://appleseedinfo.org ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cburke at innova-partners.com Wed Feb 3 19:31:32 2010 From: cburke at innova-partners.com (Chip Burke) Date: Wed, 3 Feb 2010 13:31:32 -0500 Subject: Change From Address in notifications Message-ID: <4B0D54ED353C46969513A4D928F9DFD6@innova.local> Is there a way to statically set the from address in notifications? Our notifications are all being generated as nagios at localhost which causes them to be blocked by various spam filters. Thanks! ________________________________________ Chip Burke -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 3 19:55:44 2010 From: marc at ena.com (Marc Powell) Date: Wed, 3 Feb 2010 12:55:44 -0600 Subject: Exclusions to check_procs plugin? In-Reply-To: <4B69B257.1060307@us.fujitsu.com> References: <4B69B257.1060307@us.fujitsu.com> Message-ID: <9177189F-A227-4682-A9B5-19B87838BC97@ena.com> On Feb 3, 2010, at 11:28 AM, Camron W. Fox wrote: > Alle, > > I've searched and found this question has been asked before, but there > don't seem to have been any responses. > Is it possible to exclude a process with the check_procs plugin. The --help details no such option so being trusting and not going through the code, I'd say no. A quick search of http://exchange.nagios.org shows a number of check_proc* scripts. Perhaps one of them has that functionality or could easily be modified to ignore processes you specify. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From chris.weiss at gmail.com Wed Feb 3 20:07:41 2010 From: chris.weiss at gmail.com (Chris Weiss) Date: Wed, 3 Feb 2010 11:07:41 -0800 Subject: Could not expand hostgroups error In-Reply-To: <4B69A865.8080308@hp.com> References: <4B69A865.8080308@hp.com> Message-ID: On Wed, Feb 3, 2010 at 8:46 AM, Morris, Patrick wrote: > Chris Weiss wrote: >> >> When I run nagios -v with cfg_file=/etc/nagios/objects/twiki.cfg >> present, it complains: >> Error: Could not expand hostgroups and/or hosts specified in service >> (config file '/etc/nagios/objects/twiki.cfg', starting on line 24) >> ? Error processing object config files! >> >> I'm stumped as to why it cannot expand the hostname. It's defined in >> the same file and nowhere else. The only other place "twiki-servers" >> is referenced is in one of the other host files in conf.d >> > > That's precisely the reason for the error. You're assigning a service to a > hostgroup you have not defined. > Patrick - I'm defining the hostgroup immediately above the service definition in that file. I could see where that might not be supported, but I do the same thing with my exchange.cfg file (and all the others) and that seems to work fine. Matt - I do have one host that references those hostgroups. Thanks for the link! I'd run across (and bookmarked) it before when I was hunting for ways to optimize Nagios configs. -- -Chris ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From schneemann at b1-systems.de Wed Feb 3 20:09:53 2010 From: schneemann at b1-systems.de (Christian Schneemann) Date: Wed, 3 Feb 2010 20:09:53 +0100 Subject: Change From Address in notifications In-Reply-To: <4B0D54ED353C46969513A4D928F9DFD6@innova.local> References: <4B0D54ED353C46969513A4D928F9DFD6@innova.local> Message-ID: <201002032009.53915.schneemann@b1-systems.de> Hi, On Wednesday 03 February 2010 07:31:32 pm Chip Burke wrote: > Is there a way to statically set the from address in notifications? Our > notifications are all being generated as nagios at localhost which causes them > to be blocked by various spam filters. just edit the command in your nagios configuration and add the option -r to the mail commando define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "..." | /usr/bin/mail -s "..." $CONTACTEMAIL$ -r nagios at example.com } see man mailx Greetings, Christian > > > > Thanks! > > > > ________________________________________ > Chip Burke > -- Christian Schneemann Geschaeftsfuehrer: Ralph Dehner IT Consultant & Trainer Unternehmenssitz: Vohburg B1 Systems GmbH Amtsgericht: Ingolstadt Mobil: +49-(0)-1757250665 Handelsregister: HRB 3537 EMail: schneemann at b1-systems.de http://www.b1-systems.de Adresse: B1 Systems GmbH, Osterfeldstra?e 7, 85088 Vohburg GPG: http://pgpkeys.pca.dfn.de/pks/lookup?op=get&search=0x2FA8643A41BDAB81 ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 3 20:07:53 2010 From: marc at ena.com (Marc Powell) Date: Wed, 3 Feb 2010 13:07:53 -0600 Subject: Change From Address in notifications In-Reply-To: <4B0D54ED353C46969513A4D928F9DFD6@innova.local> References: <4B0D54ED353C46969513A4D928F9DFD6@innova.local> Message-ID: <84285DF3-6564-45C2-9ED7-CC955EB86B81@ena.com> On Feb 3, 2010, at 12:31 PM, Chip Burke wrote: > Is there a way to statically set the from address in notifications? Our notifications are all being generated as nagios at localhost which causes them to be blocked by various spam filters. This is a configuration problem with your mailer daemon; it sounds like you haven't properly configured it to be Internet friendly. How you fix it depends on what MTA (postfix, sendmail, exim, etc) you're using on the Nagios machine and is pretty much outside the scope of this list. All accounts on the machine will have this problem, not just the nagios account. http://support.nagios.com/knowledgebase/faqs/index.php?option=com_content&view=article&id=52&catid=35&faq_id=338 If you just want to fix it for your nagios user, there are several options that may work for you. Google result you may find useful (several ways discussed) -- http://www.groundworkopensource.com/community/forums/viewtopic.php?t=1286 -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cburke at innova-partners.com Wed Feb 3 20:16:04 2010 From: cburke at innova-partners.com (Chip Burke) Date: Wed, 3 Feb 2010 14:16:04 -0500 Subject: Change From Address in notifications In-Reply-To: <201002032009.53915.schneemann@b1-systems.de> References: <4B0D54ED353C46969513A4D928F9DFD6@innova.local> <201002032009.53915.schneemann@b1-systems.de> Message-ID: <29C8FA7DBBE641EE9CB73C7E7DF792ED@innova.local> That's got it, thanks! ________________________________________ Chip Burke -----Original Message----- From: Christian Schneemann [mailto:schneemann at b1-systems.de] Sent: Wednesday, February 03, 2010 2:10 PM To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Change From Address in notifications Hi, On Wednesday 03 February 2010 07:31:32 pm Chip Burke wrote: > Is there a way to statically set the from address in notifications? Our > notifications are all being generated as nagios at localhost which causes them > to be blocked by various spam filters. just edit the command in your nagios configuration and add the option -r to the mail commando define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "..." | /usr/bin/mail -s "..." $CONTACTEMAIL$ -r nagios at example.com } see man mailx Greetings, Christian > > > > Thanks! > > > > ________________________________________ > Chip Burke > -- Christian Schneemann Geschaeftsfuehrer: Ralph Dehner IT Consultant & Trainer Unternehmenssitz: Vohburg B1 Systems GmbH Amtsgericht: Ingolstadt Mobil: +49-(0)-1757250665 Handelsregister: HRB 3537 EMail: schneemann at b1-systems.de http://www.b1-systems.de Adresse: B1 Systems GmbH, Osterfeldstra?e 7, 85088 Vohburg GPG: http://pgpkeys.pca.dfn.de/pks/lookup?op=get&search=0x2FA8643A41BDAB81 ---------------------------------------------------------------------------- -- The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick.mangus+nagios at gmail.com Wed Feb 3 21:49:37 2010 From: rick.mangus+nagios at gmail.com (Rick Mangus) Date: Wed, 3 Feb 2010 14:49:37 -0600 Subject: Nagios 3.0.5 problem In-Reply-To: <77c454bb1001290901s5248b437yba4c50994e8bf649@mail.gmail.com> References: <77c454bb1001290901s5248b437yba4c50994e8bf649@mail.gmail.com> Message-ID: <77c454bb1002031249v7536f3f0l925d71e1077ff0b6@mail.gmail.com> Well, I have more information to add. I found a script that was being launched at midnight to purge old data from the database. The tables being pruned are used by perfparse to store perfdata and the like. They have > 180M rows, are 30-60GB, and are actively being inserted into all the while. As I understand it, they are InnoDB and should be using row (not table) locks, and really should not have much trouble with concurrent inserts. While this goes on, one CPU/core is largely in iowait, but the other 7 are largely idle, and we generally don't have any trouble with RAM or other resource exhaustion. Now that I know what caused my problem, I can reproduce it, which is ... interesting. After only a few minutes, nagios starts falling behind on service checks. It appears to be getting new checks with current timestamps in the nagios.log, but a service detail sorted by "Last Check" descending slowly shows the timestamps getting further and further behind current. A bit later, nagios starts taking 100% of 2 CPU cores, and nsca processes start to stack up... leading to the problem as I was observing it in the morning. In an attempt to diagnose I tried a few things. I have found that by the time nagios starts to bug out it can't be saved. If you cancel the delete query after seeing a lag on the check results, it does not slowly improve, and 'catch up' as I had hoped. This happens even if there are no rows to be deleted, though not if you use LIMIT to keep the query to a reasonable timeframe. I'm still looking for fresh ideas, but in the meantime I am writing a script to loop over the delete and do it in 10,000 row increments which are ~10 seconds instead of ~3M rows which takes over an hour per table. If you do the math, though, you'll see it'll be nearly as time-consuming, and I'm just hoping that we'll lock whatever is going on for a shorter period with room for inserts to happen in-between. Even if that 'fixes' it, I won't be satisfied. Any and all suggestions are welcomed. --Rick On Fri, Jan 29, 2010 at 11:01 AM, Rick Mangus > wrote: > Hello, all. > > Forgive me, I am new to the list, and have only begun working with nagios > recently. I have searched this list and googled furiously with little > result, so must cease my lurking and present my problem to you. > > I will begin with the problem: Sometime after midnight every night, my > nagios server starts to have trouble processing service checks. I don't > know the cause, and cannot find a solution. I can describe the symptoms in > detail and hope we can diagnose it. > > The web interface shows the last service check came in at 02:28:34 (EST). > I know that around 4:15 every morning, xinetd starts refusing connections to > nsca due to high load (max_load is 18), and that eventually I will have > 32000+ nsca connections using up all available PIDs leading to an inability > to fork new processes, effectively killing the machine. While all this > happens, the nagios.log appears to periodically stall, making no new entries > for 15 minutes at a time, and then flush 15000 in the space of a single > second. Also, it seems the checkresults directory is empty most of the > time, but sometimes pops up to 2045 files (it's on a ramdisk with 2048 > inodes) and not a single one gets deleted in a time period I have been > patient enough to observe. > > The periods in which the nagios log is going nowhere are accompanied by > nagios taking 100% of 2 CPUs. One thread appears to poll() approximately > every 25 usecs, and another is inscrutable, with mprotect() the only > strace-visible syscall. All the nsca processes have a blocking write() they > are waiting on. When the log is showing new entries, there are still no > updates made to the services, and it seems that that is what is filling up > checkresults. I admit I have not checked to find the order of the log and > checkresults processes, though I assumed they would operate in the opposite > order of what this appears to show. > > I know this behavior has been ongoing for at least 1 month. I have > disabled all cron jobs that I feared might be interfering. I will answer > any and all questions to the best of my ability, and hope someone here can > shed some light on the situation. > > --Rick > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Wed Feb 3 23:01:12 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Wed, 03 Feb 2010 14:01:12 -0800 Subject: Event Handlers In-Reply-To: <24FE7944-8D14-4009-A207-E810CEEB6EC9@ena.com> References: <24FE7944-8D14-4009-A207-E810CEEB6EC9@ena.com> Message-ID: <4B69F228.4020500@5dninja.net> Marc Powell wrote: > On Feb 3, 2010, at 8:16 AM, Jeff wrote: > >> I have a service that needs to be monitored every minute. I need some help understanding how services go from soft to a hard state > > When a service check results in a non-OK state, services go from a Soft to a Hard state when they reach max_check_attempts. http://nagios.sourceforge.net/docs/3_0/statetypes.html > >> and if an event handler can be run after a service has gone into a hard state. > > Only for it's initial Hard problem state or initial Hard recovery state. http://nagios.sourceforge.net/docs/3_0/eventhandlers.html > >> I'm sure everyone has a very dynamic and custom environment to some extent. I have event handlers that will not run if a lock file is present (cause i am deploying code or so other scripts do not step on each other). So I for this service that I monitor every minute, I have Max Retries set to 3, Check Interval is 1, and retry interval is 1. Can someone help shed some light on how I can get an event handler to run again after a service has gone into a hard state? > > You can't really... The only real facility nagios has to do this (that I can think of right now) is is_volatile (http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html#service) but that's probably overkill for your needs; particularly the notification implications. The other possibility for having something run every time the service is checked, is to configure your ocsp_command. Not exactly what it's generally used for, but it'll do in a pinch. -- Mike Lindsey ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Wed Feb 3 23:11:53 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Thu, 4 Feb 2010 09:11:53 +1100 Subject: Asterisk Questions In-Reply-To: <863B3EE13473404BB5FA7516A1E31EF71B5C73EB31@exchange-01.wolfe.local> References: <863B3EE13473404BB5FA7516A1E31EF71B5C73EB31@exchange-01.wolfe.local> Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E2C7F5313@off-win2003-01.ausregistrygroup.local> > -----Original Message----- > From: Wolfe, Robert [mailto:robert.wolfe at robertwolfe.org] > Sent: Thursday, 4 February 2010 01:52 > > Hi all! > > I am wanting to write a plugin that will tell Asterisk to make a test > call every so often and then report back to Nagios if a call was > successful or not. The caveat here is that Asterisk is on one server > and Nagios is on its own dedicated server. > > If anyone knows of any script or plugin that can do this, I would > greately appreciate any pointers in the right direction. Use NRPE (active) or NSCA (passive) for the check result submission. Write your test routine, house it on your remote server. For NRPE, use 'check_nrpe -H -c '. Have the NRPE daemon listen on your asterisk box with the '' defined in the 'nrpe.cfg' file. For NSCA, the NSCA daemon listens on your Nagios server, and a cron-job runs your check and submits the results. See http://nagios.sourceforge.net/docs/3_0/addons.html for a better description and more documentation the check submission methods. Stuart ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cburke at innova-partners.com Wed Feb 3 23:51:41 2010 From: cburke at innova-partners.com (Chip Burke) Date: Wed, 3 Feb 2010 17:51:41 -0500 Subject: Customizing notifications Message-ID: <79781DFD43A74C42AD17425EAB2C6445@innova.local> I have a request to "plain English"-ify my notifications. One item I have been asked for is when the service state changes, to report the duration of the previous service state. Example: HTTP is now OK after 00:02:35 of down time. Is there an easy way to do this? It seems Nagios doesn't offer a Last State Duration macro, so I am assuming this is going to be a matter of some sort of custom scripting. Has anyone had experience with this sort of thing? Thanks! ________________________________________ Chip Burke -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mike-nagios at 5dninja.net Thu Feb 4 02:50:36 2010 From: mike-nagios at 5dninja.net (Mike Lindsey) Date: Wed, 03 Feb 2010 17:50:36 -0800 Subject: Customizing notifications In-Reply-To: <79781DFD43A74C42AD17425EAB2C6445@innova.local> References: <79781DFD43A74C42AD17425EAB2C6445@innova.local> Message-ID: <4B6A27EC.10304@5dninja.net> Chip Burke wrote: > I have a request to ?plain English?-ify my notifications. One item I > have been asked for is when the service state changes, to report the > duration of the previous service state. > > Example: HTTP is now OK after 00:02:35 of down time. > > Is there an easy way to do this? It seems Nagios doesn?t offer a Last > State Duration macro, so I am assuming this is going to be a matter of > some sort of custom scripting. Has anyone had experience with this sort > of thing? Likely, your best option will be to set up an event handler script for that service. If you already have event handlers configured, and you want this logic to run everywhere, consider setting up a script like this for your global event handler. In the event handler, you will want to touch a file in /tmp based on the host, service, and state, whenever there's a hard state change. Like, /tmp/localhost-load-ok... You could even simplify if all you care is ok/not ok. Then in your notification script, just check for the presence of those files, and do your date calculation by pulling the modification date out with stat (or script code, if your notification command isn't a chunk of bash). Something like: now=`date +%s` if [ "${NAGIOS_LASTSSERVICESTATE}" == "OK"] then time=`echo ${now} - ${filetime} | bc` filetime=`stat -f "%m" /tmp/localhost-load-notok` else time=`echo ${now} - ${filetime} | bc` filetime=`stat -f "%m" /tmp/localhost-load-ok` fi echo "${NAGIOS_SERVICE} is now ${NAGIOS_SERVICESTATE} after ${time} seconds." You might want to flesh it out with some file-exists tests as well. Good luck! -- Mike Lindsey ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 4 05:53:21 2010 From: marc at ena.com (Marc Powell) Date: Wed, 3 Feb 2010 22:53:21 -0600 Subject: Customizing notifications In-Reply-To: <79781DFD43A74C42AD17425EAB2C6445@innova.local> References: <79781DFD43A74C42AD17425EAB2C6445@innova.local> Message-ID: <7A85BF8A-2AD2-4773-8B60-10330788CB16@ena.com> On Feb 3, 2010, at 4:51 PM, Chip Burke wrote: > I have a request to ?plain English?-ify my notifications. One item I have been asked for is when the service state changes, to report the duration of the previous service state. > > Example: HTTP is now OK after 00:02:35 of down time. > > Is there an easy way to do this? It seems Nagios doesn?t offer a Last State Duration macro, so I am assuming this is going to be a matter of some sort of custom scripting. Has anyone had experience with this sort of thing? $LASTSERVICEOK$ has potential, depending on when it's updated. "This is a timestamp in time_t format (seconds since the UNIX epoch) indicating the time at which the service was last detected as being in an OK state." so time_t(now) - $LASTSERVICEOK$ = number of seconds in non-OK state I am _assuming_ that the macro is not updated until after the recovery notification is sent. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Feb 4 08:29:16 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 03 Feb 2010 23:29:16 -0800 Subject: Nagios 3.0.5 problem In-Reply-To: <77c454bb1002031249v7536f3f0l925d71e1077ff0b6@mail.gmail.com> References: <77c454bb1001290901s5248b437yba4c50994e8bf649@mail.gmail.com> <77c454bb1002031249v7536f3f0l925d71e1077ff0b6@mail.gmail.com> Message-ID: <4B6A774C.1020207@hp.com> Rick Mangus wrote: > Well, I have more information to add. > > I found a script that was being launched at midnight to purge old data > from the database. The tables being pruned are used by perfparse to > store perfdata and the like. They have > 180M rows, are 30-60GB, and > are actively being inserted into all the while. As I understand it, > they are InnoDB and should be using row (not table) locks, and really > should not have much trouble with concurrent inserts. While this goes > on, one CPU/core is largely in iowait, but the other 7 are largely > idle, and we generally don't have any trouble with RAM or other > resource exhaustion. Are your check results going to the same disk partition where all the I/O is happening? If Nagios is stuck waiting for disk, moving them somewhere else may just fix your problem. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shacky83 at gmail.com Thu Feb 4 13:58:01 2010 From: shacky83 at gmail.com (shacky) Date: Thu, 4 Feb 2010 13:58:01 +0100 Subject: Connecting more Nagios installation Message-ID: <7fedbc911002040458v3e5d6fe3vb5cd17f245f1ee5e@mail.gmail.com> Hi. I have two Nagios installations, can I "connect" them together to have the status of both installations in the same "tactical overview" window of one of them? Thank you very much! Bye. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Feb 4 15:51:42 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 04 Feb 2010 14:51:42 +0000 Subject: Connecting more Nagios installation In-Reply-To: <7fedbc911002040458v3e5d6fe3vb5cd17f245f1ee5e@mail.gmail.com> References: <7fedbc911002040458v3e5d6fe3vb5cd17f245f1ee5e@mail.gmail.com> Message-ID: <4B6ADEFE.3030207@flatto.net> shacky wrote: > Hi. > > I have two Nagios installations, can I "connect" them together to have > the status of both installations in the same "tactical overview" > window of one of them? > > Thank you very much! > Bye. > See distributed nagios http://nagios.sourceforge.net/docs/3_0/distributed.html ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.moritz at gmail.com Thu Feb 4 19:44:24 2010 From: michael.moritz at gmail.com (Mike Moritz) Date: Thu, 4 Feb 2010 12:44:24 -0600 Subject: Nagios 3.2 verify-config Message-ID: I am in the process of upgrading from Nagios 2.9 to 3.2. In 2.9 we would get a waring when running "nagios -v nagios.cfg" : Warning: Service 'Check_http' on host 'blah.com' has no default contact group(s) defined! 3.2 does not give this warning. Is this a change in the nagios verification or a problem with my configuration? I found it very helpful to highlight checks that would not be sent to anyone. Thanks Mike ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 4 20:55:09 2010 From: marc at ena.com (Marc Powell) Date: Thu, 4 Feb 2010 13:55:09 -0600 Subject: Nagios 3.2 verify-config In-Reply-To: References: Message-ID: <8AEE7C04-E89F-4DC0-A126-194F465AC719@ena.com> On Feb 4, 2010, at 12:44 PM, Mike Moritz wrote: > I am in the process of upgrading from Nagios 2.9 to 3.2. > > In 2.9 we would get a waring when running "nagios -v nagios.cfg" : > > Warning: Service 'Check_http' on host 'blah.com' has no default > contact group(s) defined! > > 3.2 does not give this warning. Is this a change in the nagios > verification or a problem with my configuration? http://nagios.sourceforge.net/docs/3_0/whatsnew.html "13. Object Inheritance: ... ? Services now inherit contact groups, notification interval, and notification period from their associated host if not otherwise specified. (Read more)" -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mgagne at iweb.com Thu Feb 4 20:10:57 2010 From: mgagne at iweb.com (=?ISO-8859-1?Q?Mathieu_Gagn=E9?=) Date: Thu, 04 Feb 2010 14:10:57 -0500 Subject: Nagios 3.2 verify-config In-Reply-To: References: Message-ID: <4B6B1BC1.60105@iweb.com> On 2/4/10 1:44 PM, Mike Moritz wrote: > I am in the process of upgrading from Nagios 2.9 to 3.2. > > In 2.9 we would get a waring when running "nagios -v nagios.cfg" : > > Warning: Service 'Check_http' on host 'blah.com' has no default > contact group(s) defined! > > 3.2 does not give this warning. Is this a change in the nagios > verification or a problem with my configuration? I found it very > helpful to highlight checks that would not be sent to anyone. contagroups is inherited from the host if none is defined in the service: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html#implied_inheritance -- Mathieu ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rick.mangus+nagios at gmail.com Thu Feb 4 21:59:56 2010 From: rick.mangus+nagios at gmail.com (Rick Mangus) Date: Thu, 4 Feb 2010 14:59:56 -0600 Subject: Nagios 3.0.5 problem In-Reply-To: <4B6A774C.1020207@hp.com> References: <77c454bb1001290901s5248b437yba4c50994e8bf649@mail.gmail.com> <77c454bb1002031249v7536f3f0l925d71e1077ff0b6@mail.gmail.com> <4B6A774C.1020207@hp.com> Message-ID: <77c454bb1002041259j2e6dd0f4keef8917b3dfc3178@mail.gmail.com> Patrick: The check results are being written to a ramdisk, so I think we're safe on that.? :( I am now starting to examine what happens when nagios sends perfdata to service_perfdata_command. In the case of perfparse, it is a perl script that opens a named pipe for appending, and tacks the perfdata on the end. That leaves an option for blocking i/o that should exit in 5 seconds due to the perfdata_timeout, but should happen ~2000x/minute. While I think I can tweak the database performance or queue my perfdata logging, I still think there's something inelegant about that. On the other hand, once it works, I'll stop spending my time writing test scripts and sql queries. --Rick On Thu, Feb 4, 2010 at 1:29 AM, Morris, Patrick wrote: > > Rick Mangus wrote: > Are your check results going to the same disk partition where all the I/O is happening? ?If Nagios is stuck waiting for disk, moving them somewhere else may just fix your problem. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shoaibahmedbaig at gmail.com Thu Feb 4 23:20:50 2010 From: shoaibahmedbaig at gmail.com (Shoaib Ahmed Baig) Date: Fri, 5 Feb 2010 03:20:50 +0500 Subject: Help Needed Message-ID: <963f18931002041420p2cd48fb2x3fb15b6086f83eae@mail.gmail.com> HI, Can any one explain/guide me what is the difference between the central and distributive server. And what will be there role if i setup a distributive server with the central Server -- Regards, Shoaib Ahmed Baig Network Engineer Systems Limited Lahore Chamber of Commerce Building 11, Sharae Aiwane Tijarat Lahore - Pakistan Tel: + 92 42 6304825, 6304835 and 6369390 Ext: 228 Fax: + 92 42 6368857 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Feb 4 23:45:24 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 04 Feb 2010 22:45:24 +0000 Subject: Help Needed In-Reply-To: <963f18931002041420p2cd48fb2x3fb15b6086f83eae@mail.gmail.com> References: <963f18931002041420p2cd48fb2x3fb15b6086f83eae@mail.gmail.com> Message-ID: <4B6B4E04.2040805@flatto.net> > HI, > > Can any one explain/guide me what is the difference between the > central and distributive server. And what will be there role if i > setup a distributive server with the central Server > I think that what you need is explained here : http://nagios.sourceforge.net/docs/3_0/distributed.html ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.moritz at gmail.com Fri Feb 5 01:34:13 2010 From: michael.moritz at gmail.com (Mike Moritz) Date: Thu, 4 Feb 2010 18:34:13 -0600 Subject: Nagios 3.2 verify-config In-Reply-To: References: Message-ID: Thanks to all that pointed me in the right direction. Inheriting from the host configuration is a nice (new) feature. Mike On Thu, Feb 4, 2010 at 12:44 PM, Mike Moritz wrote: > I am in the process of upgrading from Nagios 2.9 to 3.2. > > In 2.9 we would get a waring when running "nagios -v nagios.cfg" : > > ? Warning: Service 'Check_http' on host 'blah.com' ?has no default > contact group(s) defined! > > 3.2 does not give this warning. ?Is this a change in the nagios > verification or a problem with my configuration? ?I found it very > helpful to highlight checks that would not be sent to anyone. > > > Thanks > Mike > ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From texner at eoipso.com Fri Feb 5 11:47:56 2010 From: texner at eoipso.com (Tobias Exner) Date: Fri, 05 Feb 2010 11:47:56 +0100 Subject: check_siemens_highpath error in packet Message-ID: <4B6BF75C.9090107@eoipso.com> Hi list, today I'm trying to use the "check_siemens_hipath" script to monoring our HIGHPATH 3500 When I start that script manually I've got the following errors: //check_siemens_highpath -H 192.168.202.19 -C public -w 10 -c 20 Error in packet Reason: (noSuchName) There is no such variable name in this MIB. Failed object: H150E-MIB::cardState.1 ./check_siemens_highpath: line 117: test: too many arguments ./check_siemens_highpath: line 118: test: -ge: unary operator expected ./check_siemens_highpath: line 119: [: -gt: unary operator expected OK - Overall system state(normal); All 10 expansion cards(normal); CPU Load 0% (w/c 10%/20%); Uptime: 85:21:23:51.77 / Any ideas? -- Tobias -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 5 13:17:15 2010 From: marc at ena.com (Marc Powell) Date: Fri, 5 Feb 2010 06:17:15 -0600 Subject: check_siemens_highpath error in packet In-Reply-To: <4B6BF75C.9090107@eoipso.com> References: <4B6BF75C.9090107@eoipso.com> Message-ID: <4720353C-4EE6-4AD8-8FE4-446584A6921D@ena.com> On Feb 5, 2010, at 4:47 AM, Tobias Exner wrote: > Hi list, > > > today I'm trying to use the "check_siemens_hipath" script to monoring our HIGHPATH 3500 > > When I start that script manually I've got the following errors: > > > /check_siemens_highpath -H 192.168.202.19 -C public -w 10 -c 20 > Error in packet > Reason: (noSuchName) There is no such variable name in this MIB. > Failed object: H150E-MIB::cardState.1 I'm not familiar with this plugin and assume you got it from Nagios Exchange... Did you install the accompanying H150E-MIB.txt file into /usr/share/snmp/mibs? -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kyleodonnell at gmail.com Fri Feb 5 15:48:04 2010 From: kyleodonnell at gmail.com (Kyle O'Donnell) Date: Fri, 5 Feb 2010 09:48:04 -0500 Subject: nagios 3.0.x 3.2.x compat Message-ID: <2274b9c31002050648p4dc2caa1yfe1824c466745e38@mail.gmail.com> Are the config file syntaxs 100% compatible? ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirko.stefanelli at gmail.com Fri Feb 5 16:12:29 2010 From: mirko.stefanelli at gmail.com (mirko stefanelli) Date: Fri, 5 Feb 2010 16:12:29 +0100 Subject: Problem with check_by_ssh Message-ID: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> Hi to all, We run nagios on Ubuntu server with 2Gb ramm kernel version 2.6.24-23-server. Nagios version is 3.2.0. We have configured all our server (Linux REL) to execute our custom plugin by check_by_ssh. The problem is that for some server this check fails and the output is: CRITICAL - Plugin timed out while executing system call If we try to do same command, executed with check_by_shh, from user nagios on cli the command work fine! Any idea? Regards, Mirko. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 5 16:38:31 2010 From: marc at ena.com (Marc Powell) Date: Fri, 5 Feb 2010 09:38:31 -0600 Subject: nagios 3.0.x 3.2.x compat In-Reply-To: <2274b9c31002050648p4dc2caa1yfe1824c466745e38@mail.gmail.com> References: <2274b9c31002050648p4dc2caa1yfe1824c466745e38@mail.gmail.com> Message-ID: On Feb 5, 2010, at 8:48 AM, Kyle O'Donnell wrote: > Are the config file syntaxs 100% compatible? Yes, always (at least 3.2.x works with 3.0.x files). There's never been breakage within the same major version although new tags may be added that aren't understood by older versions. You can always find what has changed between specific versions here - http://www.nagios.org/development/history/core-3x -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From smcafee at collaborativefusion.com Fri Feb 5 16:33:08 2010 From: smcafee at collaborativefusion.com (Sean McAfee) Date: Fri, 05 Feb 2010 10:33:08 -0500 Subject: Problem with check_by_ssh In-Reply-To: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> References: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> Message-ID: <4B6C3A34.40105@collaborativefusion.com> mirko stefanelli wrote: > Hi to all, > > We run nagios on Ubuntu server with 2Gb ramm kernel version > 2.6.24-23-server. Nagios version is 3.2.0. > > We have configured all our server (Linux REL) to execute our custom > plugin by check_by_ssh. The problem is that for some server this check > fails and the output is: > > CRITICAL - Plugin timed out while executing system call > > If we try to do same command, executed with check_by_shh, from user > nagios on cli the command work fine! > > Any idea? > > Regards, > Mirko. Doing it on the CLI as the Nagios user was the right first step. Could you send your command definition, host definition, service definition, and any templates used? It's very hard to troubleshoot without those. -- Sean McAfee ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirko.stefanelli at gmail.com Fri Feb 5 16:42:17 2010 From: mirko.stefanelli at gmail.com (mirko stefanelli) Date: Fri, 5 Feb 2010 16:42:17 +0100 Subject: Problem with check_by_ssh In-Reply-To: <4B6C3A34.40105@collaborativefusion.com> References: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> <4B6C3A34.40105@collaborativefusion.com> Message-ID: <9eaab8b81002050742j6f916d75q8ca43b05a12febd3@mail.gmail.com> Hi, yes if I doing it from the CLI as Nagios User is work fine. Here: Command definition: # 'check_tablespace' command definition_ define command{ command_name check_tablespace command_line $USER1$/check_by_ssh -H $HOSTADDRESS$ -C "/usr/local/nagios/libexec/check_tablespace $ARG1$ $ARG2$ $ARG3$ $ARG4$" Host definition: define host{ use linux-server ; Name of host template to use ; This host definition will inherit all variables that are defined ; in (or inherited by) the linux-server host template definition. host_name siper_afr alias siper_afr address 10.5.3.xxx parents routerIperAfr } service definition: define service{ use local-service ; Name of service template to use host_name siper_afr service_description tablespace check_command check_tablespace!siper!!!89 notifications_enabled 0 } Service template definition: # Local service definition template - This is NOT a real service, just a template! define service{ name local-service ; The name of this service template use generic-service ; Inherit default values from the generic-service definition max_check_attempts 4 ; Re-check the service up to 4 times in order to determine its final (hard) state normal_check_interval 5 ; Check the service every 5 minutes under normal conditions retry_check_interval 1 ; Re-check the service every minute until a hard state can be determined register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } thank in advance, Mirko. 2010/2/5 Sean McAfee > mirko stefanelli wrote: > >> Hi to all, >> >> We run nagios on Ubuntu server with 2Gb ramm kernel version >> 2.6.24-23-server. Nagios version is 3.2.0. >> >> We have configured all our server (Linux REL) to execute our custom plugin >> by check_by_ssh. The problem is that for some server this check fails and >> the output is: >> >> CRITICAL - Plugin timed out while executing system call >> >> If we try to do same command, executed with check_by_shh, from user nagios >> on cli the command work fine! >> >> Any idea? >> >> Regards, >> Mirko. >> > > Doing it on the CLI as the Nagios user was the right first step. > > Could you send your command definition, host definition, service > definition, and any templates used? It's very hard to troubleshoot without > those. > > -- > Sean McAfee > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 5 16:57:53 2010 From: marc at ena.com (Marc Powell) Date: Fri, 5 Feb 2010 09:57:53 -0600 Subject: Problem with check_by_ssh In-Reply-To: <9eaab8b81002050742j6f916d75q8ca43b05a12febd3@mail.gmail.com> References: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> <4B6C3A34.40105@collaborativefusion.com> <9eaab8b81002050742j6f916d75q8ca43b05a12febd3@mail.gmail.com> Message-ID: On Feb 5, 2010, at 9:42 AM, mirko stefanelli wrote: > Hi, > > yes if I doing it from the CLI as Nagios User is work fine. > > Here: > > Command definition: > > # 'check_tablespace' command definition_ > define command{ > command_name check_tablespace > command_line $USER1$/check_by_ssh -H $HOSTADDRESS$ -C "/usr/local/nagios/libexec/check_tablespace $ARG1$ $ARG2$ $ARG3$ $ARG4$" Does it/can it take longer than 10 seconds to run the check_tablespace command? If so, you need to adjust the service_check_timeout value in nagios.cfg if you haven't already done so. You'll probably also want to verify that the check_tablespace command itself doesn't have a built-in timeout. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Fri Feb 5 17:25:53 2010 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Fri, 5 Feb 2010 16:25:53 +0000 Subject: How can I get more information - eventlog descriptions - in my notification emails Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD233A@HQ-MAIL-01.scotborders.gov.uk> Hi I'm looking to get more information in my notificiation emails. Specifically, I'd like to get the text of Windows server Eventlogs into the body of the email. This information is coming across to the nagios server and can be found in the log file. See example below: Feb 5 15:08:36 sbc-omega-test nagios: SERVICE NOTIFICATION: nagiosadmin;hq-test-01;EventLog TempDB start chk;WARNING;notify-service-by-email;informational: MSSQLSERVER: Starting up database ReportServerTempDB. (384), eventlog: 384 warning # I'd like to get the information MSSQLSERVER: Starting up database ReportServerTempDB. (384), eventlog: 384 warning into the body of the email too. However, all I get is: ***** Nagios ***** Notification Type: PROBLEM Service: EventLog TempDB start chk Host: Windows test server Address: 150.122.120.29 State: WARNING Date/Time: Fri Feb 5 15:08:36 GMT 2010 Additional Info: I had hoped I could resolve this by changing the $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$ in the notify-service-by-email command definition (see below). # 'notify-service-by-email' command definition define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional Info:\n\n$LONGSERVICEOUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } However this did not give any change in the output. That's all I've changed on the server. Can anyone give me pointers as to what else I need to change? Regards Ewan Sytem info: Nagios core - version - 3.2.0 uname -a Linux sbc-omega-test 2.6.31-14-server #48-Ubuntu SMP Fri Oct 16 15:07:34 UTC 2009 x86_64 GNU/Linux # nagios.cfg: =========== log_file=/usr/local/nagios/var/nagios.log cfg_file=/usr/local/nagios/etc/objects/commands.cfg cfg_file=/usr/local/nagios/etc/objects/contacts.cfg cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg cfg_file=/usr/local/nagios/etc/objects/templates.cfg cfg_file=/usr/local/nagios/etc/objects/localhost.cfg cfg_file=/usr/local/nagios/etc/objects/switch.cfg cfg_file=/usr/local/nagios/etc/objects/hosts.cfg cfg_file=/usr/local/nagios/etc/objects/hostgroups.cfg cfg_file=/usr/local/nagios/etc/objects/services.cfg object_cache_file=/usr/local/nagios/var/objects.cache precached_object_file=/usr/local/nagios/var/objects.precache resource_file=/usr/local/nagios/etc/resource.cfg status_file=/usr/local/nagios/var/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/usr/local/nagios/var/nagios.lock temp_file=/usr/local/nagios/var/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/usr/local/nagios/var/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/usr/local/nagios/var/retention.dat retention_update_interval=60 use_retained_program_state=1 log_file=/usr/local/nagios/var/nagios.log cfg_file=/usr/local/nagios/etc/objects/commands.cfg cfg_file=/usr/local/nagios/etc/objects/contacts.cfg cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg cfg_file=/usr/local/nagios/etc/objects/templates.cfg cfg_file=/usr/local/nagios/etc/objects/localhost.cfg cfg_file=/usr/local/nagios/etc/objects/switch.cfg cfg_file=/usr/local/nagios/etc/objects/hosts.cfg cfg_file=/usr/local/nagios/etc/objects/hostgroups.cfg cfg_file=/usr/local/nagios/etc/objects/services.cfg object_cache_file=/usr/local/nagios/var/objects.cache precached_object_file=/usr/local/nagios/var/objects.precache resource_file=/usr/local/nagios/etc/resource.cfg status_file=/usr/local/nagios/var/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/usr/local/nagios/var/nagios.lock temp_file=/usr/local/nagios/var/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/usr/local/nagios/var/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/usr/local/nagios/var/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 check_for_updates=1 bare_update_check=0 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=1 process_performance_data=0 obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us p1_file=/usr/local/nagios/bin/p1.pl enable_embedded_perl=1 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$&|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=ewscott at scotborders.gov.uk admin_pager=ewscott at scotborders.gov.uk daemon_dumps_core=0 use_large_installation_tweaks=0 enable_environment_macros=1 debug_level=0 debug_verbosity=1 debug_file=/usr/local/nagios/var/nagios.debug max_debug_file_size=1000000 ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From clair at practeo.ch Fri Feb 5 17:49:30 2010 From: clair at practeo.ch (Clair Johnson) Date: Fri, 5 Feb 2010 16:49:30 +0000 Subject: service notification Message-ID: <0CC98CB467A5E044A9C403E9688943C704E55F@exchange2010-vm.office.practeo.ch> Hi all, I have simples questions. Is it possible not to execute host checks but execute services checks ? Is it possible to send services notifications when the host is not checked ? Because I don't understand, I'm not notified of services changes. here are parts of my conf files (They come from my old nagios 2.9 configuration, I'm actualy moving from 2.9 to 3.2) : ##### static definitions ## notifications commands define command{ command_name notify-by-mail command_line /opt/nagios/libexec/notify_manager mail $CONTACTNAME$ "$CONTACTALIAS$" $CONTACTEMAIL$ $HOSTADDRESS$ "$HOSTALIAS$" "$SERVICEDESC$" $SERVICESTATE$ "$SERVICEOUTPUT$" $DATE$ $TIME$ $LASTSERVICESTATECHANGE$ } define command{ command_name notify-by-sms command_line /opt/nagios/libexec/notify_manager sms $CONTACTNAME$ "$CONTACTALIAS$" $CONTACTEMAIL$ $HOSTADDRESS$ "$HOSTALIAS$" "$SERVICEDESC$" $SERVICESTATE$ "$SERVICEOUTPUT$" $DATE$ $TIME$ $LASTSERVICESTATECHANGE$ } define command{ command_name host-notify-by-mail command_line /opt/nagios/libexec/dummy } define command{ command_name host-notify-by-sms command_line /opt/nagios/libexec/dummy } ## contacts templates define contact{ name generic-contact-mail host_notification_period 24x7 host_notification_options d,r host_notification_commands host-notify-by-mail service_notification_options w,u,c,r service_notification_commands notify-by-mail register 0 } define contact{ name generic-contact-sms host_notification_period 24x7 host_notification_options d,r host_notification_commands host-notify-by-sms service_notification_options c service_notification_commands notify-by-sms register 0 } ## host template define host{ name generic-host ; The name of this host template notifications_enabled 1 ; Host notifications are enabled event_handler_enabled 1 ; Host event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled failure_prediction_enabled 1 ; Failure prediction is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts check_interval 1 max_check_attempts 2 notification_interval 120 notification_period 24x7 notification_options d,r register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL HOST, JUST A TEMPLATE! } ## service template define service{ name generic-service ; The 'name' of this service template active_checks_enabled 1 ; Active service checks are enabled passive_checks_enabled 1 ; Passive service checks are enabled/accepted parallelize_check 1 ; Active service checks should be parallelized (disabling this can lead to major performance problems) obsess_over_service 1 ; We should obsess over this service (if necessary) check_freshness 0 ; Default is to NOT check service 'freshness' notifications_enabled 1 ; Service notifications are enabled event_handler_enabled 1 ; Service event handler is enabled flap_detection_enabled 1 ; Flap detection is enabled failure_prediction_enabled 1 ; Failure prediction is enabled process_perf_data 1 ; Process performance data retain_status_information 1 ; Retain status information across program restarts retain_nonstatus_information 1 ; Retain non-status information across program restarts is_volatile 0 check_period 24x7 notification_interval 2880 notification_options w,u,c,r,f notification_period 24x7 register 0 ; DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE! } ##### dynamic definitions ## contacts definition define contact{ use generic-contact-mail contact_name contact_4_2_mail alias Clair Johnson email clair at practeo.ch service_notification_period timeperiod_4_2_mail } define timeperiod{ timeperiod_name timeperiod_4_2_mail alias timeperiod_4_2_mail monday 07:00-22:00 tuesday 07:00-22:00 wednesday 07:00-22:00 thursday 07:00-22:00 friday 07:00-22:00 sunday 07:00-22:00 saturday 07:00-22:00 } define contact{ use generic-contact-sms contact_name contact_4_2_sms alias Clair Johnson email null service_notification_period timeperiod_4_2_sms } define timeperiod{ timeperiod_name timeperiod_4_2_sms alias timeperiod_4_2_sms } ## host definition define host{ use generic-host host_name localhost alias supervision address localhost contact_groups contactgroup_2 } ## service definition define service{ use generic-service host_name localhost service_description http contact_groups contactgroup_2 check_command check_http!"80" normal_check_interval 20 retry_check_interval 20 max_check_attempts 3 } A run a complete custom interface but everything else seems working normally, except that the transition between state changes in only 1 check attempt and not what is specified in the service description. Dose some one has any idea why I don't get notifications from services checks and why the max_check_attempts is not working normaly ? Thx a lot for the help Clair Johnson -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 5 17:50:41 2010 From: marc at ena.com (Marc Powell) Date: Fri, 5 Feb 2010 10:50:41 -0600 Subject: How can I get more information - eventlog descriptions - in my notification emails In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD233A@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD233A@HQ-MAIL-01.scotborders.gov.uk> Message-ID: <1EBC9A40-E73F-4218-BA85-3114D846C3DE@ena.com> On Feb 5, 2010, at 10:25 AM, Scott, Ewan wrote: > Hi > > I'm looking to get more information in my notificiation emails. Specifically, I'd like to get the text of Windows server Eventlogs into the body of the email. This information is coming across to the nagios server and can be found in the log file. > See example below: > Feb 5 15:08:36 sbc-omega-test nagios: SERVICE NOTIFICATION: nagiosadmin;hq-test-01;EventLog TempDB start chk;WARNING;notify-service-by-email;informational: MSSQLSERVER: Starting up database ReportServerTempDB. (384), eventlog: 384 warning > I had hoped I could resolve this by changing the $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$ in the notify-service-by-email command definition (see below). > # 'notify-service-by-email' command definition > define command{ > command_name notify-service-by-email > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification > Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: > $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional > Info:\n\n$LONGSERVICEOUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service > Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ > } > However this did not give any change in the output. That's all I've changed on the server. Can anyone give me pointers as to what else I need to change? $SERVICEOUTPUT$ is the macro that you want. It looks like the plugin is only outputting one line of text. Make sure it is/was typed correctly in the notification command above. -- http://nagios.sourceforge.net/docs/3_0/macrolist.html#serviceoutput $SERVICEOUTPUT$ The first line of text output from the last service check (i.e. "Ping OK"). $LONGSERVICEOUTPUT$ The full text output (aside from the first line) from the last service check. (e.g. everything beyond the first line of text). -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tony.johansson at svenskakyrkan.se Fri Feb 5 17:41:48 2010 From: tony.johansson at svenskakyrkan.se (Tony Johansson) Date: Fri, 5 Feb 2010 17:41:48 +0100 Subject: Nagios 3.2.0 process dies silently - help! Message-ID: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> Hello, Our nagios 3.2.0 installation is having major problems. The nagios process dies silently about 10-60 seconds after beeing started. No record as to why in any logfiles. Have tried setting max debug (debug_level=-1 and debug_verbosity=2) in nagios.cfg - nothing. System is a CentOS release 5.4 which has been running fine for months. Any ideas on how to troubleshoot this or what is going on? Regards, Tony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Feb 5 18:05:25 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 05 Feb 2010 09:05:25 -0800 Subject: service notification In-Reply-To: <0CC98CB467A5E044A9C403E9688943C704E55F@exchange2010-vm.office.practeo.ch> References: <0CC98CB467A5E044A9C403E9688943C704E55F@exchange2010-vm.office.practeo.ch> Message-ID: <4B6C4FD5.3010402@hp.com> Clair Johnson wrote: > > Hi all, > > > > I have simples questions. > > > > Is it possible *not* to execute host checks but execute services checks ? > > Is it possible to send services notifications when the host is not > checked ? > > > > Because I don't understand, I'm not notified of services changes. > > > > here are parts of my conf files (They come from my old nagios 2.9 > configuration, I'm actualy moving from 2.9 to 3.2) : > > > > [config snipped] > > > > A run a complete custom interface but everything else seems working > normally, except that the transition between state changes in only 1 > check attempt and not what is specified in the service description. > > > > Dose some one has any idea why I don't get notifications from services > checks and why the max_check_attempts is not working normaly ? > That all looks right at frist glance. It's normal for the state to transition after one check... is it going into a hard state at that point? That's when the notification is generated, so if you're not waiting for that third attempt, that would explain why you don't see a notification. It's also not clear to me if you're receiving notifications for the host but not for the service, which again, would be normal. If you don't care about the host, you can just igve it a check command that never fails (check_dummy works well for that), but I wouldn't normally recommend doing that. What's in the Nagios logs regarding notifications on that host and service? Those would give a far clearer idea of what's going on than just guessing. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ton.voon at opsera.com Fri Feb 5 18:27:38 2010 From: ton.voon at opsera.com (Ton Voon) Date: Fri, 5 Feb 2010 17:27:38 +0000 Subject: Nagios 3.2.0 process dies silently - help! In-Reply-To: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> References: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> Message-ID: <904E8633-5EAA-4E3D-820D-285BE86BFA35@opsera.com> On 5 Feb 2010, at 16:41, Tony Johansson wrote: > Hello, > > Our nagios 3.2.0 installation is having major problems. > The nagios process dies silently about 10-60 seconds after beeing > started. No record as to why in any logfiles. > > Have tried setting max debug (debug_level=-1 and debug_verbosity=2) > in nagios.cfg - nothing. > > System is a CentOS release 5.4 which has been running fine for months. > > Any ideas on how to troubleshoot this or what is going on? What entries are in nagios.log? Is there a core dump file? Take a copy of retention.dat, then remove it. Does that start it up? If that fixes it, there maybe a corruption in retention.dat. Can you then reduce the problem down by removing entries in retention.dat to find what the problem is. If we can get it recreated, I can add a test to pick on this specific error. Ton -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Fri Feb 5 18:36:24 2010 From: benny at bennyvision.com (C. Bensend) Date: Fri, 5 Feb 2010 11:36:24 -0600 Subject: Nagios 3.2.0 process dies silently - help! In-Reply-To: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> References: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> Message-ID: > Our nagios 3.2.0 installation is having major problems. > > The nagios process dies silently about 10-60 seconds after beeing > started. No record as to why in any logfiles. > > Have tried setting max debug (debug_level=-1 and debug_verbosity=2) in > nagios.cfg - nothing. > > System is a CentOS release 5.4 which has been running fine for months. > > Any ideas on how to troubleshoot this or what is going on? I'd fire it up on the command line *without* -d, so it doesn't fire up as a daemon. Run it under strace so you can hopefully see if a syscall is failing. In my environment (RedHat 5.4), this would be: strace /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg HTH, Benny -- "Blow with all your might into the bean stuffed straw." -- Spam message, 2006-11 ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 5 18:45:37 2010 From: marc at ena.com (Marc Powell) Date: Fri, 5 Feb 2010 11:45:37 -0600 Subject: Nagios 3.2.0 process dies silently - help! In-Reply-To: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> References: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> Message-ID: On Feb 5, 2010, at 10:41 AM, Tony Johansson wrote: > Hello, > > Our nagios 3.2.0 installation is having major problems. > The nagios process dies silently about 10-60 seconds after beeing started. No record as to why in any logfiles. > > Have tried setting max debug (debug_level=-1 and debug_verbosity=2) in nagios.cfg - nothing. > > System is a CentOS release 5.4 which has been running fine for months. > > Any ideas on how to troubleshoot this or what is going on? Try running it in the foreground (without -d). If you don't see anything interesting when it dies, run it in the foreground through strace (strace -fFs512 /path/to/nagios -c /path/to/nagios.cfg). Verify you haven't run out of disk space or anything simple like that. If you're running SElinux, verify that there are no errors related to it in /var/log/messages. Is there anything special about the install or the machine it's running on? Are any of the nagios directories mounted from remote machines? -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Fri Feb 5 18:52:36 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Fri, 5 Feb 2010 12:52:36 -0500 Subject: Problem with check_by_ssh In-Reply-To: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> References: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> Message-ID: <5bcb62b61002050952t7af84fe7m18eb42366a83a9bb@mail.gmail.com> My first guess would be that there is some sort of environmental variable being set when you log in that wouldn't be set when you aren't (ie, when Nagios triggers it). Are you using keys that have passphrases, maybe? Can you give us a look at the contents of check_by_ssh? --Matt On Fri, Feb 5, 2010 at 10:12 AM, mirko stefanelli wrote: > Hi to all, > > We run nagios on Ubuntu server with 2Gb ramm kernel version > 2.6.24-23-server. Nagios version is 3.2.0. > > We have configured all our server (Linux REL) to execute our custom plugin > by check_by_ssh. The problem is that for some server this check fails and > the output is: > > CRITICAL - Plugin timed out while executing system call > > If we try to do same command, executed with check_by_shh, from user nagios > on cli the command work fine! > > Any idea? > > Regards, > Mirko. > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 5 19:49:22 2010 From: marc at ena.com (Marc Powell) Date: Fri, 5 Feb 2010 12:49:22 -0600 Subject: Nagios 3.2.0 process dies silently - help! In-Reply-To: <4B6C6261.80103@svenskakyrkan.se> References: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> <4B6C6261.80103@svenskakyrkan.se> Message-ID: Please always respond on list so that others now, and in the future, can benefit from your experience. More below... On Feb 5, 2010, at 12:24 PM, Tony Johansson wrote: > Nothing special with the install, everything in the same machine. > Ran strace as suggested: > strace -fFs512 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg > > [pid 32731] write(3, "[1265393566.503713] [016.2] [pid=32731] Processed service performance data file output: 1265393559||AHS||C: Drive Space||c:\\ - total: 15.86 Gb - used: 7.60 Gb (48%) - free 8.26 Gb (52%)||c:\\ Used Space=7.60Gb;14.27;15.54;0.00;15.86\n", 232) = 232 > [pid 32731] _llseek(3, 0, [657557], SEEK_CUR) = 0 > [pid 32731] write(6, "1265393559||AHS||C: Drive Space||c:\\ - total: 15.86 Gb - used: 7.60 Gb (48%) - free 8.26 Gb (52%)||c:\\ Used Space=7.60Gb;14.27;15.54;0.00;15.86\n", 144) = -1 EFBIG (File too large) > [pid 32731] --- SIGXFSZ (File size limit exceeded) @ 0 (0) --- > [pid 32732] +++ killed by SIGXFSZ +++ > > "File size limit exceeded" seems to be the cause, ideas to remedy this? Looks like your performance data file has exceeded the size limit supported by your OS (probably 2GB). You need to remove it or truncate it. Look at the value of service_perfdata_file in nagios.cfg to determine what/were that file is. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tony.johansson at svenskakyrkan.se Fri Feb 5 19:39:47 2010 From: tony.johansson at svenskakyrkan.se (Tony Johansson) Date: Fri, 05 Feb 2010 19:39:47 +0100 Subject: Nagios 3.2.0 process dies silently - help! In-Reply-To: References: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> Message-ID: <4B6C65F3.2000400@svenskakyrkan.se> Marc Powell skrev: > On Feb 5, 2010, at 10:41 AM, Tony Johansson wrote: > > >> Hello, >> >> Our nagios 3.2.0 installation is having major problems. >> The nagios process dies silently about 10-60 seconds after beeing started. No record as to why in any logfiles. >> >> Have tried setting max debug (debug_level=-1 and debug_verbosity=2) in nagios.cfg - nothing. >> >> System is a CentOS release 5.4 which has been running fine for months. >> >> Any ideas on how to troubleshoot this or what is going on? >> > > > Try running it in the foreground (without -d). If you don't see anything interesting when it dies, run it in the foreground through strace (strace -fFs512 /path/to/nagios -c /path/to/nagios.cfg). > > Verify you haven't run out of disk space or anything simple like that. If you're running SElinux, verify that there are no errors related to it in /var/log/messages. > > Is there anything special about the install or the machine it's running on? Are any of the nagios directories mounted from remote machines? > > -- > Marc > > Hello all, Nothing special with the install, everything in the same machine. Ran strace as suggested: strace -fFs512 /usr/local/nagios/bin/nagios /usr/local/nagios/etc/nagios.cfg [pid 32731] write(3, "[1265393566.503713] [016.2] [pid=32731] Processed service performance data file output: 1265393559||AHS||C: Drive Space||c:\\ - total: 15.86 Gb - used: 7.60 Gb (48%) - free 8.26 Gb (52%)||c:\\ Used Space=7.60Gb;14.27;15.54;0.00;15.86\n", 232) = 232 [pid 32731] _llseek(3, 0, [657557], SEEK_CUR) = 0 [pid 32731] write(6, "1265393559||AHS||C: Drive Space||c:\\ - total: 15.86 Gb - used: 7.60 Gb (48%) - free 8.26 Gb (52%)||c:\\ Used Space=7.60Gb;14.27;15.54;0.00;15.86\n", 144) = -1 EFBIG (File too large) [pid 32731] --- SIGXFSZ (File size limit exceeded) @ 0 (0) --- [pid 32732] +++ killed by SIGXFSZ +++ "File size limit exceeded" seems to be the cause Disk space is plenty: df -h Filesystem Size Used Avail Use% Mounted on /dev/mapper/VolGroup00-LogVol00 68G 28G 38G 43% / /dev/sda1 99M 30M 65M 32% /boot tmpfs 506M 0 506M 0% /dev/shm Also, I did try renaming retention.dat, status.dat and moving files out of checkresults earlier with no result. Seems like /var/spool/nagios/perfdata.log is 2G while /var/spool/nagios/perfdata.log is a mere 11K I've tried renaming the file and started nagios which now seems to run ok. Looks like I need to set up log rotation or what is the best way to handle perfdata.log? Many thanks, Tony ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fevinkagen at gmail.com Sat Feb 6 06:23:33 2010 From: fevinkagen at gmail.com (fevin Kagen) Date: Sat, 6 Feb 2010 00:23:33 -0500 Subject: Hyperlink in Acknowledgement Comment Message-ID: Hi- I'm using nagios in conjunction w/ OTRS. All in all, it works great. However, we would like to replace the simple ticket number in the acknowledgement comment with a hyperlink to the actual ticket. Any ideas on how to do this? I've found the "Nagios::Acknowledge::HTTP::URL: " variable in the OTRS settings, but I can't seem to add a hyperlink since the "<" character is automatically removed. Thanks! fevin -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Sat Feb 6 17:10:14 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Sat, 06 Feb 2010 08:10:14 -0800 Subject: Hyperlink in Acknowledgement Comment In-Reply-To: References: Message-ID: <4B6D9466.3040006@hp.com> fevin Kagen wrote: > Hi- > I'm using nagios in conjunction w/ OTRS. All in all, it works great. > However, we would like to replace the simple ticket number in the > acknowledgement comment with a hyperlink to the actual ticket. Any > ideas on how to do this? I've found the > "Nagios::Acknowledge::HTTP::URL: " variable in the OTRS settings, but > I can't seem to add a hyperlink since the "<" character is > automatically removed. > Thanks! > fevin http://nagios.sourceforge.net/docs/3_0/configcgi.html#escape_html_tags ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From admin at dougware.net Mon Feb 8 05:36:03 2010 From: admin at dougware.net (Doug Eubanks) Date: Sun, 7 Feb 2010 23:36:03 -0500 Subject: 'Service is not scheduled to be checked...' & 'Host is not scheduled to be checked...' Message-ID: <8c87cb5a1002072036p50d14466tfb046b1e4f9e78e4@mail.gmail.com> I've used Nagios for a few years, but my new installation is really racking my brain. I was testing with my first host and first service, but they are never checked or scheduled to be checked. I want both active and passive checks, because I'm experimenting with my own distributed monitoring system (which is a first for me). I'm including the important parts of my config files. # cat /etc/nagios/nagios.cfg # Written by NagiosMainExporter from Lilac Configurator 1.0.3 on February 7, 2010, 3:19 pm log_file=/var/log/nagios/nagios.log temp_file=/var/nagios/nagios.tmp status_file=/var/nagios/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios enable_notifications=1 execute_service_checks=1 accept_passive_service_checks=1 enable_event_handlers=1 log_rotation_method=d log_archive_path=/var/log/nagios/archives check_external_commands=1 command_check_interval=-1 command_file=/var/nagios/rw/nagios.cmd lock_file=/var/run/nagios.pid retain_state_information=1 state_retention_file=/var/nagios/retention.dat retention_update_interval=60 use_retained_program_state=1 use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 external_command_buffer_slots=4096 sleep_time=0.25 service_interleave_factor=s max_concurrent_checks=0 service_reaper_frequency=10 interval_length=60 use_aggressive_host_checking=1 enable_flap_detection=1 low_service_flap_threshold=5 high_service_flap_threshold=20 low_host_flap_threshold=5 high_host_flap_threshold=20 soft_state_dependencies=0 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 obsess_over_services=1 process_performance_data=0 check_for_orphaned_services=1 check_service_freshness=1 date_format=us illegal_object_name_chars=()= illegal_macro_output_chars=`~$&|' admin_email=admin at dougware.net admin_pager=pagenagios at localhost execute_host_checks=1 service_inter_check_delay_method=s use_retained_scheduling_info=1 accept_passive_host_checks=1 max_service_check_spread=30 host_inter_check_delay_method=s max_host_check_spread=30 auto_reschedule_checks=1 auto_rescheduling_interval=30 auto_rescheduling_window=180 obsess_over_hosts=1 check_host_freshness=1 host_freshness_check_interval=60 service_freshness_check_interval=60 use_regexp_matching=0 use_true_regexp_matching=0 event_broker_options=-1 daemon_dumps_core=0 object_cache_file=/var/nagios/objects.cache precached_object_file=/var/nagios/objects.precache retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 check_result_reaper_frequency=2 max_check_result_reaper_time=30 check_result_path=/var/nagios/spool/checkresults max_check_result_file_age=3600 translate_passive_host_checks=1 passive_host_checks_are_soft=1 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 cached_host_check_horizon=15 cached_service_check_horizon=15 use_large_installation_tweaks=1 enable_environment_macros=1 additional_freshness_latency=15 enable_embedded_perl=1 use_embedded_perl_implicitly=1 p1_file=/usr/bin/p1.pl debug_file=/var/log/nagios/nagios.debug debug_level=0 debug_verbosity=1 max_debug_file_size=1000000 resource_file=/etc/nagios/resource.cfg cfg_dir=/etc/nagios/objects cat /etc/nagios/objects/services.cfg # Written by NagiosServiceExporter from Lilac Configurator 1.0.3 on February 7, 2010, 3:19 pm define service { host_name Server1 service_description SSH initial_state o check_command check_ssh max_check_attempts 3 normal_check_interval 2 retry_interval 1 first_notification_delay 5 active_checks_enabled 1 passive_checks_enabled 1 check_period 24x7 parallelize_check 1 obsess_over_service 1 check_freshness 1 freshness_threshold 120 notification_interval 15 notification_period 24x7 notifications_enabled 1 display_name SSH notification_options w,c,r,s } cat /etc/nagios/objects/hosts.cfg # Written by NagiosHostExporter from Lilac Configurator 1.0.3 on February 7, 2010, 3:19 pm define host { host_name Server1 initial_state o check_command check_ping retry_interval 1 first_notification_delay 15 max_check_attempts 3 check_interval 2 passive_checks_enabled 1 check_period 24x7 obsess_over_host 1 check_freshness 1 freshness_threshold 120 active_checks_enabled 1 event_handler_enabled 0 notification_interval 15 notification_period 24x7 notifications_enabled 1 failure_prediction_enabled 0 alias server1.dougware.net display_name Server1 address server1.dougware.net notification_options d,u,r,s contact_groups DougWare hostgroups DougWare } I'm sure that I have a really simple issue, but I just can't put my finger on it. Thanks for the help! Sincerely, Doug Eubanks admin at dougware.net (919) 201-8750 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lennart.karssen at snow.nl Mon Feb 8 09:40:55 2010 From: lennart.karssen at snow.nl (L.C. Karssen) Date: Mon, 8 Feb 2010 09:40:55 +0100 (CET) Subject: Nagios web GUI, contact groups, hosts and services Message-ID: <50402.213.75.32.135.1265618455.squirrel@imap.snow.nl> Dear list, I'm presently working on a Nagios 3.2.0 setup that monitors approximately 1000 hosts and about 5000 services. The setup doesn't make use of Nagios' notification system, instead people at a control center (NOC) use the Nagios web interface to alert the appropriate people in case of an alert. The Nagios configuration is based on a set of host (group) templates, where services are assigned to host groups. For example: the 'check_swap' service definition is associated with the host group 'all unix hosts'. The problem I'm confronted with is that the people at the NOC don't need to see all services on a given host. For example, they shouldn't call the sysadmin at night if an SSL certificate check goes into critical state because the certificate is only valid for ten more days. So we want to remove that service from their view. In the present situation contact groups (used to determine which servers are visible to which department) are added to each specific host, but according to the Nagios docs (http://nagios.sourceforge.net/docs/3_0/cgiauth.html) a contact group can see _all_ services on a given host if it is listed as a contract group for the host. So I decided to remove the NOC contact group from the individual host definitions and to assign the NOC contact group only to specific services. This leads me to another problem. Some service checks (i.e. host groups) are used in one department only. This works fine. However, some other service checks (like check_swap for the 'all unix hosts' hostgroup) are shared by all departments, but some departments don't want the NOC to see check_swap alerts whereas others do want to pass these alerts to NOC. It would be possible to make services with slightly different names (e.g. check_swap_dept1, check_swap_dept2), each with the correct contact group. However, that seems to be a needless increase of complexity. Another approach would be to make host groups for each department and somehow change the service contact group for each host group. Unfortunately I haven't been able to get that to work. Any suggestions would be highly appreciated. Lennart Karssen. ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Sascha.Runschke at gfkl.com Mon Feb 8 14:18:34 2010 From: Sascha.Runschke at gfkl.com (Sascha.Runschke at gfkl.com) Date: Mon, 8 Feb 2010 14:18:34 +0100 Subject: Antwort: Re: Nagios 3.2.0 process dies silently - help! In-Reply-To: <4B6C65F3.2000400@svenskakyrkan.se> References: <241F4872A72F0244B0A81A8C852682C3933B6C@khusex01.khus.ad.svenskakyrkan.se> <4B6C65F3.2000400@svenskakyrkan.se> Message-ID: Tony Johansson schrieb am 05.02.2010 19:39:47: > [pid 32731] write(6, "1265393559||AHS||C: Drive Space||c:\\ - total: > 15.86 Gb - used: 7.60 Gb (48%) - free 8.26 Gb (52%)||c:\\ Used > Space=7.60Gb;14.27;15.54;0.00;15.86\n", 144) = -1 EFBIG (File too large) > [pid 32731] --- SIGXFSZ (File size limit exceeded) @ 0 (0) --- > [pid 32732] +++ killed by SIGXFSZ +++ > > "File size limit exceeded" seems to be the cause > Disk space is plenty: > df -h > Filesystem Size Used Avail Use% Mounted on > /dev/mapper/VolGroup00-LogVol00 > 68G 28G 38G 43% / > /dev/sda1 99M 30M 65M 32% /boot > tmpfs 506M 0 506M 0% /dev/shm > > Also, I did try renaming retention.dat, status.dat and moving files out > of checkresults earlier with no result. > > Seems like /var/spool/nagios/perfdata.log is 2G while > /var/spool/nagios/perfdata.log is a mere 11K > I've tried renaming the file and started nagios which now seems to run ok. > Looks like I need to set up log rotation or what is the best way to > handle perfdata.log? 2 GiB is the maximum filesize for an ext3 on x86_x64 platforms. There is no need for nagios to "handle" the perfdata.log. Nagios only writes perfdata if you request it to do so. This data is _only_ interpreted by external tools, like pnp. They care about truncating the perfdata once they parsed it. Since you obviously don't use any perfdata tool - why do you write perfdata logs at all? ;) S GFKL Financial Services AG Vorstand: J?rgen Baltes, Dr. Tom Haverkamp Vorsitzender des Aufsichtsrats: Wilhelm Plumpe Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From peter.klausner at gmail.com Mon Feb 8 15:49:11 2010 From: peter.klausner at gmail.com (Peter Klausner) Date: Mon, 8 Feb 2010 15:49:11 +0100 Subject: How to set hard state from passive service check? Message-ID: <4b231bf01002080649m4130a690mfad1291994467bc4@mail.gmail.com> According to the docs (and in my set-up) a passive service check result of non-OK sets a SOFT state. So you need max_check_attempts passive and/or active checks until it changes to HARD. I want passive checks to set a HARD state immediately. Active checks should apply the max_check_attempts setting. Is there a way to achieve this? I found passive_host_checks_are_soft, but it applies only to hosts. Thanks, Peter Klausner ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 8 16:41:59 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 8 Feb 2010 10:41:59 -0500 Subject: Nagios web GUI, contact groups, hosts and services In-Reply-To: <50402.213.75.32.135.1265618455.squirrel@imap.snow.nl> References: <50402.213.75.32.135.1265618455.squirrel@imap.snow.nl> Message-ID: <6db4a4201002080741h31c18978r5bdc35eef41e7e25@mail.gmail.com> Karssen, Your problem appears to be process not the tool. I am not suggesting how to conduct business in your organization but here is my suggestion. I would reach out to noc management and put together a wiki or how to for each unique possible critical alert scenario. The I would add the "notes_url" parameter of Nagios to point to that particular wiki page. Only thing noc has to do click on the notes_url page and they will know exactly what to do with the alert. Thanks On Mon, Feb 8, 2010 at 3:40 AM, L.C. Karssen wrote: > Dear list, > > I'm presently working on a Nagios 3.2.0 setup that monitors approximately > 1000 hosts and about 5000 services. The setup doesn't make use of Nagios' > notification system, instead people at a control center (NOC) use the > Nagios web interface to alert the appropriate people in case of an alert. > > The Nagios configuration is based on a set of host (group) templates, > where services are assigned to host groups. For example: the 'check_swap' > service definition is associated with the host group 'all unix hosts'. > > The problem I'm confronted with is that the people at the NOC don't need > to see all services on a given host. For example, they shouldn't call the > sysadmin at night if an SSL certificate check goes into critical state > because the certificate is only valid for ten more days. So we want to > remove that service from their view. In the present situation contact > groups (used to determine which servers are visible to which department) > are added to each specific host, but according to the Nagios docs > (http://nagios.sourceforge.net/docs/3_0/cgiauth.html) a contact group can > see _all_ services on a given host if it is listed as a contract group for > the host. > So I decided to remove the NOC contact group from the individual host > definitions and to assign the NOC contact group only to specific services. > > This leads me to another problem. Some service checks (i.e. host groups) > are used in one department only. This works fine. > However, some other service checks (like check_swap for the 'all unix > hosts' hostgroup) are shared by all departments, but some departments > don't want the NOC to see check_swap alerts whereas others do want to pass > these alerts to NOC. It would be possible to make services with slightly > different names (e.g. check_swap_dept1, check_swap_dept2), each with the > correct contact group. However, that seems to be a needless increase of > complexity. > Another approach would be to make host groups for each department and > somehow change the service contact group for each host group. > Unfortunately I haven't been able to get that to work. > > > Any suggestions would be highly appreciated. > > > Lennart Karssen. > > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the > business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 8 16:44:36 2010 From: marc at ena.com (Marc Powell) Date: Mon, 8 Feb 2010 09:44:36 -0600 Subject: How to set hard state from passive service check? In-Reply-To: <4b231bf01002080649m4130a690mfad1291994467bc4@mail.gmail.com> References: <4b231bf01002080649m4130a690mfad1291994467bc4@mail.gmail.com> Message-ID: <64E88483-4D86-4876-BC1A-F6E1200DAA5A@ena.com> On Feb 8, 2010, at 8:49 AM, Peter Klausner wrote: > I want passive checks to set a HARD state immediately. Active checks should apply > the max_check_attempts setting. > > Is there a way to achieve this? Not that I know of without creating a separate service definition. Passive service checks are treated exactly the same as active service checks in this regard. > I found passive_host_checks_are_soft, but it applies only to hosts. Any passive host check results in an immediate hard state. This option makes them more service-like for those who want/need that functionality. There are historical reasons why host checks are treated this way that never applied to services... -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fevinkagen at gmail.com Mon Feb 8 16:53:04 2010 From: fevinkagen at gmail.com (fevin Kagen) Date: Mon, 8 Feb 2010 10:53:04 -0500 Subject: Hyperlink in Acknowledgement Comment In-Reply-To: <4B6D9466.3040006@hp.com> References: <4B6D9466.3040006@hp.com> Message-ID: Thanks, Patrick. I saw that, but it doesn't seem to make a my difference. I did notice that using a named pipe does write a hyperlink by default. Since the "http" option does not, I'm wondering if it is possible,. Does anyone have this working via http? Thanks! On Sat, Feb 6, 2010 at 11:10 AM, Morris, Patrick wrote: > fevin Kagen wrote: > >> Hi- >> I'm using nagios in conjunction w/ OTRS. All in all, it works great. >> However, we would like to replace the simple ticket number in the >> acknowledgement comment with a hyperlink to the actual ticket. Any ideas on >> how to do this? I've found the "Nagios::Acknowledge::HTTP::URL: " variable >> in the OTRS settings, but I can't seem to add a hyperlink since the "<" >> character is automatically removed. Thanks! >> fevin >> > > http://nagios.sourceforge.net/docs/3_0/configcgi.html#escape_html_tags > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Feb 8 17:16:42 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Mon, 08 Feb 2010 08:16:42 -0800 Subject: Hyperlink in Acknowledgement Comment In-Reply-To: References: <4B6D9466.3040006@hp.com> Message-ID: <4B7038EA.2060402@hp.com> fevin Kagen wrote: > Thanks, Patrick. I saw that, but it doesn't seem to make a my > difference. I did notice that using a named pipe does write a > hyperlink by default. Since the "http" option does not, I'm wondering > if it is possible,. Does anyone have this working via http? > Thanks! > > On Sat, Feb 6, 2010 at 11:10 AM, Morris, Patrick > > wrote: > > fevin Kagen wrote: > > Hi- > I'm using nagios in conjunction w/ OTRS. All in all, it works > great. However, we would like to replace the simple ticket > number in the acknowledgement comment with a hyperlink to the > actual ticket. Any ideas on how to do this? I've found the > "Nagios::Acknowledge::HTTP::URL: " variable in the OTRS > settings, but I can't seem to add a hyperlink since the "<" > character is automatically removed. Thanks! > fevin > > > http://nagios.sourceforge.net/docs/3_0/configcgi.html#escape_html_tags > > Seeing the command you're doing this in might help. I suspect that you're not so much getting hit by a character stripping issue as by a quoting one, and maybe the "<" character is being interpreted as an input redirection. I'm not too familiar with OTRS, but it should definitely be possible to put that link in a Nagios notification command (and, in fact, we do that here). ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fevinkagen at gmail.com Mon Feb 8 17:55:55 2010 From: fevinkagen at gmail.com (fevin Kagen) Date: Mon, 8 Feb 2010 11:55:55 -0500 Subject: Hyperlink in Acknowledgement Comment In-Reply-To: <4B7038EA.2060402@hp.com> References: <4B6D9466.3040006@hp.com> <4B7038EA.2060402@hp.com> Message-ID: I've tried all sorts of combinations, but what I have now is this: http://xxx.xxx.xxx.xxx/nagios/cgi-bin/cmd.cgi?cmd_typ=&cmd_mod=2&host=&service=&sticky_ack=on&send_notification=on&persistent=on&com_data=""&btnSubmit=Commit I know it can be done via the nagios.cmd file, but since these are two separate machines, I'm trying to do it via a http command. The above command results in the following comment: "a href='http://xxx.xxx.xxx.xxx/otrs/index.pl?Action=AgentZoom -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lennart.karssen at snow.nl Mon Feb 8 19:32:16 2010 From: lennart.karssen at snow.nl (L.C. Karssen) Date: Mon, 08 Feb 2010 19:32:16 +0100 Subject: Nagios web GUI, contact groups, hosts and services In-Reply-To: <6db4a4201002080741h31c18978r5bdc35eef41e7e25@mail.gmail.com> References: <50402.213.75.32.135.1265618455.squirrel@imap.snow.nl> <6db4a4201002080741h31c18978r5bdc35eef41e7e25@mail.gmail.com> Message-ID: <1265653936.4124.3.camel@sidonia.karssen.org> Hi Shadhin Rahman, Thanks for your reply. I hadn't mentioned in my e-mail, but adding the notes_url is indeed the option I had in mind as a last resort. I like your suggestion of linking to a wiki. If no others solution comes up, I guess that's what we'll do. Regards, Lennart Karssen. On Mon, 2010-02-08 at 10:41 -0500, shadih rahman wrote: > Karssen, > > Your problem appears to be process not the tool. I am not > suggesting how to conduct business in your organization but here is my > suggestion. > > I would reach out to noc management and put together a wiki or how > to for each unique possible critical alert scenario. The I would add > the "notes_url" parameter of Nagios to point to that particular wiki > page. > > Only thing noc has to do click on the notes_url page and they will > know exactly what to do with the alert. Thanks > > On Mon, Feb 8, 2010 at 3:40 AM, L.C. Karssen > wrote: > Dear list, > > I'm presently working on a Nagios 3.2.0 setup that monitors > approximately > 1000 hosts and about 5000 services. The setup doesn't make use > of Nagios' > notification system, instead people at a control center (NOC) > use the > Nagios web interface to alert the appropriate people in case > of an alert. > > The Nagios configuration is based on a set of host (group) > templates, > where services are assigned to host groups. For example: the > 'check_swap' > service definition is associated with the host group 'all unix > hosts'. > > The problem I'm confronted with is that the people at the NOC > don't need > to see all services on a given host. For example, they > shouldn't call the > sysadmin at night if an SSL certificate check goes into > critical state > because the certificate is only valid for ten more days. So we > want to > remove that service from their view. In the present situation > contact > groups (used to determine which servers are visible to which > department) > are added to each specific host, but according to the Nagios > docs > (http://nagios.sourceforge.net/docs/3_0/cgiauth.html) a > contact group can > see _all_ services on a given host if it is listed as a > contract group for > the host. > So I decided to remove the NOC contact group from the > individual host > definitions and to assign the NOC contact group only to > specific services. > > This leads me to another problem. Some service checks (i.e. > host groups) > are used in one department only. This works fine. > However, some other service checks (like check_swap for the > 'all unix > hosts' hostgroup) are shared by all departments, but some > departments > don't want the NOC to see check_swap alerts whereas others do > want to pass > these alerts to NOC. It would be possible to make services > with slightly > different names (e.g. check_swap_dept1, check_swap_dept2), > each with the > correct contact group. However, that seems to be a needless > increase of > complexity. > Another approach would be to make host groups for each > department and > somehow change the service contact group for each host group. > Unfortunately I haven't been able to get that to work. > > > Any suggestions would be highly appreciated. > > > Lennart Karssen. > > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, > colocation > Stay online with enterprise data centers and the best network > in the business > Choose flexible plans and management services without > long-term contracts > Personal 24x7 support from experience hosting pros just a > phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS > when reporting any issue. > ::: Messages without supporting info will risk being sent > to /dev/null > > > > -- > Cordially, > Shadhin Rahman ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Mon Feb 8 19:59:29 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Mon, 08 Feb 2010 13:59:29 -0500 Subject: How to access user-defined service variables in a command object Message-ID: <4B705F11.7000009@paul.dubuc.org> I'm trying to integrate the use of an internally developed alarm generation command into our Nagios configuration. So I want to define an Nagios command object that calls this command with arguments specific to the service that is generating the status condition that generates the alarm. One of the arguments is an alarm number. I can set this number in the service definition as a user defined variable: define service{ ... __ALARM_NUMBER 123 } Is it possible to access this variable in the command definition using on-demand macros? I tried to do this in the following way, but it doesn't seem to work: define command{ command_name notify-service-by-alarm command_line /usr/local/bin/sendalarm $HOSTALIAS$ $_SERVICE_ALARM_NUMBER:HOSTNAME:SERVICEDESC$ $SERVICESTATE$ $SERVICEDESC$ $SERVICEOUTPUT$ } Is there an alternative? Thanks, Paul M. Dubuc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Mon Feb 8 22:14:37 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Mon, 08 Feb 2010 16:14:37 -0500 Subject: How to access user-defined service variables in a command object In-Reply-To: <4B705F11.7000009@paul.dubuc.org> References: <4B705F11.7000009@paul.dubuc.org> Message-ID: <4B707EBD.7090405@paul.dubuc.org> I should have made more clear what I am trying to do below. I know I can access the service __ALARM_NUMBER from the command definition by giving the literal host_name and service description like this (I've updated the service definition in my previous example to illustrate): $_SERVICE_ALARM_NUMBER:localhost:DUMMY but I would like the command definition to be able to do this using the macro names $HOSTNAME$ and $SERVICEDESC$ so that one command definition works for all services that use it for notification. Is there a way to do this? I would not like to have to define a separate command and contact group for every alarm number. Also, I'm using Nagios 3.2.0. Thanks, Paul Dubuc Paul M. Dubuc wrote: > I'm trying to integrate the use of an internally developed alarm > generation command into our Nagios configuration. So I want to define > an Nagios command object that calls this command with arguments specific > to the service that is generating the status condition that generates > the alarm. One of the arguments is an alarm number. I can set this > number in the service definition as a user defined variable: > > define service{ > host_name localhost > service_description DUMMY > ... > __ALARM_NUMBER 123 > } > > Is it possible to access this variable in the command definition using > on-demand macros? I tried to do this in the following way, but it > doesn't seem to work: > > define command{ > command_name notify-service-by-alarm > command_line /usr/local/bin/sendalarm $HOSTALIAS$ > $_SERVICE_ALARM_NUMBER:HOSTNAME:SERVICEDESC$ $SERVICESTATE$ > $SERVICEDESC$ $SERVICEOUTPUT$ > } > > Is there an alternative? > > Thanks, > > Paul M. Dubuc > ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Mon Feb 8 22:39:52 2010 From: martinm at op5.org (Martin Melin) Date: Mon, 8 Feb 2010 22:39:52 +0100 Subject: Scheduled downtime for 1 host and its services In-Reply-To: <5b3c9e44de4aae2026afc688a3383778@smetj.net> References: <5b3c9e44de4aae2026afc688a3383778@smetj.net> Message-ID: <5f0bcb7c1002081339x640def7ewec3f17902a859611@mail.gmail.com> On Tue, Feb 2, 2010 at 4:44 PM, Jelle Smet wrote: > Hi List, > > I'm using Nagios 3.2.0 and have a question about scheduled downtimes which > I can't find in the docs. > > If I schedule downtime for a host, does this automatically schedule > downtime for all the host services too? > No, but notifications are suppressed for services whose hosts are in scheduled downtime. If so, why isn't there the ZZzzz icon next to these services? > Because they are not technically in scheduled downtime. > Will scheduled downtime for a host also make sure this doesn't impact the > availability report of the services? > No. > > Thanks in advance! > > Jelle > Scheduling downtime in vanilla Nagios can be a major pain. It is much easier if you define host and service groups though, because then you can use the extinfo.cgi to for example "schedule downtime for all services in this servicegroup, and all their hosts too". -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Mon Feb 8 22:56:18 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Mon, 08 Feb 2010 16:56:18 -0500 Subject: How to access user-defined service variables in a command object In-Reply-To: <4B707EBD.7090405@paul.dubuc.org> References: <4B705F11.7000009@paul.dubuc.org> <4B707EBD.7090405@paul.dubuc.org> Message-ID: <4B708882.50306@paul.dubuc.org> Sorry to have bothered the list. I was making the problem too hard because I was confused by what I'd read about on demand macros in Barth's book (p. 632). Using $_SERVICE_ALARM_NUMBER$ works in the command definition. I don't know why I didn't try that first. For some reason I thought you had to specify the host and service description to get the value of the variable. Paul Dubuc Paul M. Dubuc wrote: > I should have made more clear what I am trying to do below. I know I can > access the service __ALARM_NUMBER from the command definition by giving the > literal host_name and service description like this (I've updated the service > definition in my previous example to illustrate): > > $_SERVICE_ALARM_NUMBER:localhost:DUMMY > > but I would like the command definition to be able to do this using the macro > names $HOSTNAME$ and $SERVICEDESC$ so that one command definition works for > all services that use it for notification. Is there a way to do this? I > would not like to have to define a separate command and contact group for > every alarm number. > > Also, I'm using Nagios 3.2.0. > > Thanks, > Paul Dubuc > > Paul M. Dubuc wrote: >> I'm trying to integrate the use of an internally developed alarm >> generation command into our Nagios configuration. So I want to define >> an Nagios command object that calls this command with arguments specific >> to the service that is generating the status condition that generates >> the alarm. One of the arguments is an alarm number. I can set this >> number in the service definition as a user defined variable: >> >> define service{ > > host_name localhost > > service_description DUMMY >> ... >> __ALARM_NUMBER 123 >> } >> >> Is it possible to access this variable in the command definition using >> on-demand macros? I tried to do this in the following way, but it >> doesn't seem to work: >> >> define command{ >> command_name notify-service-by-alarm >> command_line /usr/local/bin/sendalarm $HOSTALIAS$ >> $_SERVICE_ALARM_NUMBER:HOSTNAME:SERVICEDESC$ $SERVICESTATE$ >> $SERVICEDESC$ $SERVICEOUTPUT$ >> } >> >> Is there an alternative? >> >> Thanks, >> >> Paul M. Dubuc >> > > ------------------------------------------------------------------------------ > The Planet: dedicated and managed hosting, cloud storage, colocation > Stay online with enterprise data centers and the best network in the business > Choose flexible plans and management services without long-term contracts > Personal 24x7 support from experience hosting pros just a phone call away. > http://p.sf.net/sfu/theplanet-com > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron at tvnz.co.nz Mon Feb 8 23:08:12 2010 From: ron at tvnz.co.nz (Ron Wilson) Date: Tue, 9 Feb 2010 11:08:12 +1300 Subject: problems with Distributed alerting Message-ID: <7D85034F6BBBD540AF52C59CED3AA1E801C71227@AKVXCH02.tvnzad.tvnz.co.nz> We have set up a distributed ngaios 3.02 system using Nagiosql with several slaves and one master. The master is responsible for all alerting. However when we disable a notification service on the master ngaios and then do a reload of any of the slave servers it overwrites the status of the disabled services. I am looking for some ideas how to avoid this. Is it possible to say extract the status flags somehow before we do a slave to master update so that we can then re-apply the status flags immediately after update. Or is there an easier way to handle this situation. I am aware of the caveat of not restarting ngaios but just reloading but the disabled notifications seem to get replaced regardless of restart or reload ________________________________ -- Ron Wilson Systems Engineer Television New Zealand P.O. Box 3819 Auckland, New Zealand Phone 649-916-7560 "I've noticed that the press tends to be quite accurate, except when they're writing on a subject I know something about." (Keith F. Lynch) ________________________________ ========================================================== For more information on the Television New Zealand Group, visit us online at tvnz.co.nz ========================================================== CAUTION: This e-mail and any attachment(s) contain information that is intended to be read only by the named recipient(s). This information is not to be used or stored by any other person and/or organisation. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 8 23:33:49 2010 From: marc at ena.com (Marc Powell) Date: Mon, 8 Feb 2010 16:33:49 -0600 Subject: problems with Distributed alerting In-Reply-To: <7D85034F6BBBD540AF52C59CED3AA1E801C71227@AKVXCH02.tvnzad.tvnz.co.nz> References: <7D85034F6BBBD540AF52C59CED3AA1E801C71227@AKVXCH02.tvnzad.tvnz.co.nz> Message-ID: On Feb 8, 2010, at 4:08 PM, Ron Wilson wrote: > We have set up a distributed ngaios 3.02 system using Nagiosql with several slaves and one master. The master is responsible for all alerting. However when we disable a notification service on the master ngaios and then do a reload of any of the slave servers it overwrites the status of the disabled services. I am looking for some ideas how to avoid this. Is it possible to say extract the status flags somehow before we do a slave to master update so that we can then re-apply the status flags immediately after update. Or is there an easier way to handle this situation. I am aware of the caveat of not restarting ngaios but just reloading but the disabled notifications seem to get replaced regardless of restart or reload I'm not certain I fully understand what you mean by 'reload of any of the slaves overwrites the status of the disabled services' on the host? It sounds though like you don't have retain_nonstatus_information enabled... -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 9 00:41:34 2010 From: marc at ena.com (Marc Powell) Date: Mon, 8 Feb 2010 17:41:34 -0600 Subject: problems with Distributed alerting In-Reply-To: <7D85034F6BBBD540AF52C59CED3AA1E801C7122A@AKVXCH02.tvnzad.tvnz.co.nz> References: <7D85034F6BBBD540AF52C59CED3AA1E801C71227@AKVXCH02.tvnzad.tvnz.co.nz> <7D85034F6BBBD540AF52C59CED3AA1E801C7122A@AKVXCH02.tvnzad.tvnz.co.nz> Message-ID: Please always respond on list. More below... On Feb 8, 2010, at 4:48 PM, Ron Wilson wrote: > Thanks for reply. I am a little confused between Retain_Status and > Retain_NonStatus > The help screen is not very helpful It's in the documentation. > If I want to make sure that any host or service on the master server > never gets changed when distributes servers force a reload etc which one > do I need to set? This is where my confusion lies. There should be no way that a distributed server can make that kind of change on the central server, reload, restart or whatever. Assuming you've followed the distributed documentation, the only thing they can do is submit a check result unless you've created some special program that propagates other changes. I think you need to describe more clearly how you're set up, how your configs are created and distributed and how you're sending distributed results back to the central server. -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From queenie245 at gmail.com Tue Feb 9 09:56:46 2010 From: queenie245 at gmail.com (Queenie de Melo) Date: Tue, 9 Feb 2010 14:26:46 +0530 Subject: Error while cross compiling -error: parse error before "lt_ptr" Message-ID: <344a1c5e1002090056l7a33946ay5ec7210938ead47c@mail.gmail.com> I have downloaded nagios 3.2.0 and iam trying to cross compile for xscale big endian platform. It configured fine as shown below but while compiling, i get Error. I configured as *./configure host=xscale CC=xscale_be-gcc* > checking for a BSD-compatible install... /usr/bin/install -c > checking build system type... i686-pc-linux-gnu > checking host system type... xscale-unknown-none > checking for xscale-gcc... xscale_be-gcc > checking for C compiler default output file name... a.out > checking whether the C compiler works... yes > checking whether we are cross compiling... yes > checking for suffix of executables... > checking for suffix of object files... o > checking whether we are using the GNU C compiler... yes > checking whether xscale_be-gcc accepts -g... yes > checking for xscale_be-gcc option to accept ANSI C... none needed > checking whether make sets $(MAKE)... yes > checking for strip... /usr/bin/strip > checking how to run the C preprocessor... xscale_be-gcc -E > checking for egrep... grep -E > checking for ANSI C header files... yes > checking whether time.h and sys/time.h may both be included... yes > checking for sys/wait.h that is POSIX.1 compatible... yes > and so on................ then When i do a "make all', I get the error as below > In file included from ../include/nebcallbacks.h:29, > from broker.c:30: > ../include/nebmodules.h:80: error: parse error before "lt_ptr" > ../include/nebmodules.h:80: warning: no semicolon at end of struct or union > ../include/nebmodules.h:81: warning: data definition has no type or storage > class > ../include/nebmodules.h:91: error: parse error before '}' token > ../include/nebmodules.h:91: warning: data definition has no type or storage > class > In file included from broker.c:30: > ../include/nebcallbacks.h:81: error: parse error before '*' token > In file included from broker.c:32: > ../include/nebmods.h:54: error: parse error before '*' token > ../include/nebmods.h:57: error: parse error before '*' token Iam using xscale_be-gcc (GCC) 3.3.1 "make cgis" compiles fine without any errors. Please help me how to proceed. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From queenie245 at gmail.com Tue Feb 9 10:04:26 2010 From: queenie245 at gmail.com (Queenie de Melo) Date: Tue, 9 Feb 2010 14:34:26 +0530 Subject: Fwd: Error while cross compiling -error: parse error before "lt_ptr" In-Reply-To: <344a1c5e1002090056l7a33946ay5ec7210938ead47c@mail.gmail.com> References: <344a1c5e1002090056l7a33946ay5ec7210938ead47c@mail.gmail.com> Message-ID: <344a1c5e1002090104p124cdef2m83898a9c20df4505@mail.gmail.com> I have downloaded nagios 3.2.0 and iam trying to cross compile for xscale big endian platform. It configured fine as shown below but while compiling, i get Error. I configured as *./configure host=xscale CC=xscale_be-gcc* > checking for a BSD-compatible install... /usr/bin/install -c > checking build system type... i686-pc-linux-gnu > checking host system type... xscale-unknown-none > checking for xscale-gcc... xscale_be-gcc > checking for C compiler default output file name... a.out > checking whether the C compiler works... yes > checking whether we are cross compiling... yes > checking for suffix of executables... > checking for suffix of object files... o > checking whether we are using the GNU C compiler... yes > checking whether xscale_be-gcc accepts -g... yes > checking for xscale_be-gcc option to accept ANSI C... none needed > checking whether make sets $(MAKE)... yes > checking for strip... /usr/bin/strip > checking how to run the C preprocessor... xscale_be-gcc -E > checking for egrep... grep -E > checking for ANSI C header files... yes > checking whether time.h and sys/time.h may both be included... yes > checking for sys/wait.h that is POSIX.1 compatible... yes > and so on................ then When i do a "make all', I get the error as below > In file included from ../include/nebcallbacks.h:29, > from broker.c:30: > ../include/nebmodules.h:80: error: parse error before "lt_ptr" > ../include/nebmodules.h:80: warning: no semicolon at end of struct or union > ../include/nebmodules.h:81: warning: data definition has no type or storage > class > ../include/nebmodules.h:91: error: parse error before '}' token > ../include/nebmodules.h:91: warning: data definition has no type or storage > class > In file included from broker.c:30: > ../include/nebcallbacks.h:81: error: parse error before '*' token > In file included from broker.c:32: > ../include/nebmods.h:54: error: parse error before '*' token > ../include/nebmods.h:57: error: parse error before '*' token Iam using xscale_be-gcc (GCC) 3.3.1 "make cgis" compiles fine without any errors. Please help me how to proceed. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From EWScott at scotborders.gov.uk Tue Feb 9 10:59:08 2010 From: EWScott at scotborders.gov.uk (Scott, Ewan) Date: Tue, 9 Feb 2010 09:59:08 +0000 Subject: How can I get more information - eventlog descriptions - in my notification emails In-Reply-To: <1EBC9A40-E73F-4218-BA85-3114D846C3DE@ena.com> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD233A@HQ-MAIL-01.scotborders.gov.uk> <1EBC9A40-E73F-4218-BA85-3114D846C3DE@ena.com> Message-ID: <771645F4ADF2C9449B7E6AE630C720018BC4BD233F@HQ-MAIL-01.scotborders.gov.uk> Marc Thanks for this. By returning to using $SERVICEOUTPUT$ and placing this towards the beginning of the printf output - see below - I now get the EventLog text into the notification email. I lose the "Additional Info:" text literal at the end of the email so presumably there is a fixed size for the output which cannot be extended. However, for what I'm trying to do, I think it is sufficient. Regards Ewan define command{ command_name notify-service-by-email command_line /usr/bin/printf "%b" "***** Nagios *****\nNotification Type: $NOTIFICATIONTYPE$\nTEXT: $SERVICEOUTPUT$\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: $HOSTADDRESS$\nState: $SERVICESTATE$\nDate/Time: $LONGDATETIME$\nAdditional Info:" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ } -----Original Message----- From: Marc Powell [mailto:marc at ena.com] Sent: 05 February 2010 16:51 To: Scott, Ewan Cc: Nagios Mailinglist Subject: Re: [Nagios-users] How can I get more information - eventlog descriptions - in my notification emails On Feb 5, 2010, at 10:25 AM, Scott, Ewan wrote: > Hi > > I'm looking to get more information in my notificiation emails. Specifically, I'd like to get the text of Windows server Eventlogs into the body of the email. This information is coming across to the nagios server and can be found in the log file. > See example below: > Feb 5 15:08:36 sbc-omega-test nagios: SERVICE NOTIFICATION: nagiosadmin;hq-test-01;EventLog TempDB start chk;WARNING;notify-service-by-email;informational: MSSQLSERVER: Starting up database ReportServerTempDB. (384), eventlog: 384 warning > I had hoped I could resolve this by changing the $SERVICEOUTPUT$ to $LONGSERVICEOUTPUT$ in the notify-service-by-email command definition (see below). > # 'notify-service-by-email' command definition > define command{ > command_name notify-service-by-email > command_line /usr/bin/printf "%b" "***** Nagios *****\n\nNotification > Type: $NOTIFICATIONTYPE$\n\nService: $SERVICEDESC$\nHost: $HOSTALIAS$\nAddress: > $HOSTADDRESS$\nState: $SERVICESTATE$\n\nDate/Time: $LONGDATETIME$\n\nAdditional > Info:\n\n$LONGSERVICEOUTPUT$" | /usr/bin/mail -s "** $NOTIFICATIONTYPE$ Service > Alert: $HOSTALIAS$/$SERVICEDESC$ is $SERVICESTATE$ **" $CONTACTEMAIL$ > } > However this did not give any change in the output. That's all I've changed on the server. Can anyone give me pointers as to what else I need to change? $SERVICEOUTPUT$ is the macro that you want. It looks like the plugin is only outputting one line of text. Make sure it is/was typed correctly in the notification command above. -- http://nagios.sourceforge.net/docs/3_0/macrolist.html#serviceoutput $SERVICEOUTPUT$ The first line of text output from the last service check (i.e. "Ping OK"). $LONGSERVICEOUTPUT$ The full text output (aside from the first line) from the last service check. (e.g. everything beyond the first line of text). -- Marc ********************************************************************** This email and any files transmitted with it are privileged, confidential and subject to copyright. Any unauthorised use or disclosure of any part of this email is prohibited. If you are not the intended recipient please inform the sender immediately; you should then delete the email and remove any copies from your system. The views or opinions expressed in this communication may not necessarily be those of Scottish Borders Council. Please be advised that Scottish Borders Council's incoming and outgoing email is subject to regular monitoring and any email may require to be disclosed by the Council under the provisions of the Freedom of Information (Scotland) Act 2002. ********************************************************************** ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 9 12:38:03 2010 From: marc at ena.com (Marc Powell) Date: Tue, 9 Feb 2010 05:38:03 -0600 Subject: How can I get more information - eventlog descriptions - in my notification emails In-Reply-To: <771645F4ADF2C9449B7E6AE630C720018BC4BD233F@HQ-MAIL-01.scotborders.gov.uk> References: <771645F4ADF2C9449B7E6AE630C720018BC4BD233A@HQ-MAIL-01.scotborders.gov.uk> <1EBC9A40-E73F-4218-BA85-3114D846C3DE@ena.com> <771645F4ADF2C9449B7E6AE630C720018BC4BD233F@HQ-MAIL-01.scotborders.gov.uk> Message-ID: <259787FA-3890-4143-B3F2-68DEE4B254D2@ena.com> On Feb 9, 2010, at 3:59 AM, Scott, Ewan wrote: > Marc > Thanks for this. > > By returning to using $SERVICEOUTPUT$ and placing this towards the beginning of the printf output - see below - I now get the EventLog text into the notification email. I lose the "Additional Info:" text literal at the end of the email so presumably there is a fixed size for the output which cannot be extended. However, for what I'm trying to do, I think it is sufficient. I can't explain why it's happening but there is no fixed limit within nagios that would cause this behavior that I am aware of... -- Marc ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Tue Feb 9 14:06:02 2010 From: reachtarak at hotmail.com (Tarak Mukherjee) Date: Tue, 9 Feb 2010 18:36:02 +0530 Subject: User Authentication using LDAP in Nagios Message-ID: Dear Friends, Is there any way to Authenticate user using LDAP in nagios. /\ Ricky _________________________________________________________________ Hotmail: Powerful Free email with security by Microsoft. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Feb 9 14:15:15 2010 From: jpratt at norwich.edu (James Pratt) Date: Tue, 9 Feb 2010 08:15:15 -0500 Subject: User Authentication using LDAP in Nagios In-Reply-To: References: Message-ID: <24B6509E4191AF44B60A24EAA3B4AD494B5426@nuexchg.norwich.edu> >> -----Original Message----- >> From: Tarak Mukherjee [mailto:reachtarak at hotmail.com] >> Sent: Tuesday, February 09, 2010 8:06 AM >> To: nagios-users at lists.sourceforge.net >> Subject: [Nagios-users] User Authentication using LDAP in Nagios >> >> Dear Friends, >> >> Is there any way to Authenticate user using LDAP in nagios. >> >> >> >> >> /\ >> Ricky >> Yes - http://wiki.nagios.org/index.php/Windows_Authentication cheers, james ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Tue Feb 9 15:08:06 2010 From: benny at bennyvision.com (C. Bensend) Date: Tue, 9 Feb 2010 08:08:06 -0600 Subject: check_openmanage not using my custom temperature thresholds Message-ID: <4f19a3d373e8831c63799057029b49c2.squirrel@webmail.stinkweasel.net> Hey folks, I am trying to use custom temperature thresholds with one of my servers, and it doesn't seem to take them into account. The full command (as defined in NSC.ini for NSClient++): command[check_openmanage]=check_openmanage.exe -e -p -w 0=50 -c 0=54 -b bat_charge=ALL/ctrl_fw=ALL/ctrl_driver=ALL --omreport F:\dellopenmanage\oma\bin\omreport.exe Per http://folk.uio.no/trondham/software/check_openmanage.html and the man page, I'm pretty sure that's supposed to set the temperature probe 0's warning threshold to 50C and critical to 54C. However, I'm still getting a non-OK for temp probe 0: Temperature Probe 0 [System Board Ambient Temp] is too high at 43 C ------ SYSTEM: PowerEdge 2900, SN: 4PVXSK1 Just to be sure this wasn't a glitch with the display of the temp probe #, I've tried 0=50,1=50,2=50 and 0=54,1=54,2=54 but it still complains. Am I missing something? I've looked at this until I'm crosseyed, and I'm pretty sure I'm using it correctly. Is there a hardcoded threshold in there that I'm not aware of? Thanks much! Benny -- "Show me on the doll where the marketing touched you." -- "Mally" on Fazed.net ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Tue Feb 9 16:05:32 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Tue, 09 Feb 2010 16:05:32 +0100 Subject: check_openmanage not using my custom temperature thresholds In-Reply-To: <4f19a3d373e8831c63799057029b49c2.squirrel@webmail.stinkweasel.net> (C. Bensend's message of "Tue, 9 Feb 2010 08:08:06 -0600") References: <4f19a3d373e8831c63799057029b49c2.squirrel@webmail.stinkweasel.net> Message-ID: <15tmxziwitf.fsf@tux.uio.no> "C. Bensend" writes: > Hey folks, > > I am trying to use custom temperature thresholds with one of my > servers, and it doesn't seem to take them into account. > > The full command (as defined in NSC.ini for NSClient++): > > command[check_openmanage]=check_openmanage.exe -e -p -w 0=50 -c 0=54 -b > bat_charge=ALL/ctrl_fw=ALL/ctrl_driver=ALL --omreport > F:\dellopenmanage\oma\bin\omreport.exe > > Per http://folk.uio.no/trondham/software/check_openmanage.html and > the man page, I'm pretty sure that's supposed to set the temperature > probe 0's warning threshold to 50C and critical to 54C. However, I'm > still getting a non-OK for temp probe 0: > > Temperature Probe 0 [System Board Ambient Temp] is too high at 43 C > ------ SYSTEM: PowerEdge 2900, SN: 4PVXSK1 > > Just to be sure this wasn't a glitch with the display of the temp > probe #, I've tried 0=50,1=50,2=50 and 0=54,1=54,2=54 but it still > complains. > > Am I missing something? I've looked at this until I'm crosseyed, > and I'm pretty sure I'm using it correctly. Is there a hardcoded > threshold in there that I'm not aware of? Hi Benny, Openmanage has its own limits. From a random M600 server here, the limits for ambient temperature is # omreport chassis temps Temperature Probes Information ------------------------------------ Main System Chassis Temperatures: Ok ------------------------------------ Index : 0 Status : Ok Probe Name : System Board Ambient Temp Reading : 16.0 C Minimum Warning Threshold : 8.0 C Maximum Warning Threshold : 42.0 C Minimum Failure Threshold : 3.0 C Maximum Failure Threshold : 47.0 C To be honest, I've never considered the possibility of anyone wanting to set custom temperatures *higher* than the OMSA maximum. I allways assumed that people wanted to use the custom limits to set the max temperature *lower* than the default limits. Clearly I was wrong :) What happens in your case is that the OMSA limits kicks in. It is possible to adjust the OMSA warning limits, e.g. # omconfig chassis temps index=0 maxwarnthresh=45 Temperature probe warning threshold(s) set successfully. It is not possible to adjust the critical (failure) limits like this, only the warning limits can be set manually. Also, I believe that when a server hits the critical limit, in the interest of self preservation it shuts itself down. The plugin could be made to ignore the OMSA warning limit if the custom limit is set beyond it, but I'm not sure that we want this in general. What do you think? Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From benny at bennyvision.com Tue Feb 9 16:15:21 2010 From: benny at bennyvision.com (C. Bensend) Date: Tue, 9 Feb 2010 09:15:21 -0600 Subject: check_openmanage not using my custom temperature thresholds In-Reply-To: <15tmxziwitf.fsf@tux.uio.no> References: <4f19a3d373e8831c63799057029b49c2.squirrel@webmail.stinkweasel.net> <15tmxziwitf.fsf@tux.uio.no> Message-ID: <17baf01c2ad8bbda9a81350c40bb58d7.squirrel@webmail.stinkweasel.net> > Openmanage has its own limits. From a random M600 server here, the > limits for ambient temperature is > > # omreport chassis temps > Temperature Probes Information > > ------------------------------------ > Main System Chassis Temperatures: Ok > ------------------------------------ > > Index : 0 > Status : Ok > Probe Name : System Board Ambient Temp > Reading : 16.0 C > Minimum Warning Threshold : 8.0 C > Maximum Warning Threshold : 42.0 C > Minimum Failure Threshold : 3.0 C > Maximum Failure Threshold : 47.0 C > > To be honest, I've never considered the possibility of anyone wanting to > set custom temperatures *higher* than the OMSA maximum. I allways > assumed that people wanted to use the custom limits to set the max > temperature *lower* than the default limits. Clearly I was wrong :) Yes and no... I don't *want* to tweak this, but we have a single server in a remote office in a wiring closet that just barely hits 43C each morning when people start showing up and things start working a bit. It causes a lot of nuisance alerts, so we were playing with massaging the thresholds a bit. > What happens in your case is that the OMSA limits kicks in. It is > possible to adjust the OMSA warning limits, e.g. > > # omconfig chassis temps index=0 maxwarnthresh=45 > Temperature probe warning threshold(s) set successfully. > > It is not possible to adjust the critical (failure) limits like this, > only the warning limits can be set manually. Also, I believe that when a > server hits the critical limit, in the interest of self preservation it > shuts itself down. > > The plugin could be made to ignore the OMSA warning limit if the custom > limit is set beyond it, but I'm not sure that we want this in general. > What do you think? Now that I know what's going on (and how to adjust the OMSA threshold if need be), I'd say keep it where it is. However, if these details were mentioned on the page: http://folk.uio.no/trondham/software/check_openmanage.html it would have saved me a lot of time, hair, and such. Could this be added? Thanks so much! Benny -- "Show me on the doll where the marketing touched you." -- "Mally" on Fazed.net ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Tue Feb 9 18:02:44 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Tue, 09 Feb 2010 18:02:44 +0100 Subject: check_openmanage not using my custom temperature thresholds In-Reply-To: <17baf01c2ad8bbda9a81350c40bb58d7.squirrel@webmail.stinkweasel.net> (C. Bensend's message of "Tue, 9 Feb 2010 09:15:21 -0600") References: <4f19a3d373e8831c63799057029b49c2.squirrel@webmail.stinkweasel.net> <15tmxziwitf.fsf@tux.uio.no> <17baf01c2ad8bbda9a81350c40bb58d7.squirrel@webmail.stinkweasel.net> Message-ID: <15t8wb2wde3.fsf@tux.uio.no> "C. Bensend" writes: > Now that I know what's going on (and how to adjust the OMSA threshold > if need be), I'd say keep it where it is. However, if these details > were mentioned on the page: > > http://folk.uio.no/trondham/software/check_openmanage.html > > it would have saved me a lot of time, hair, and such. Could this be > added? Yes, I have updated the documentation: http://folk.uio.no/trondham/software/check_openmanage.html#custom-temperature-thresholds Hopefully this will clarify things for other users. BTW, thanks for reporting this, the documentation was ambiguous and in need of an update :) Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From AHKAPLAN at PARTNERS.ORG Tue Feb 9 18:45:48 2010 From: AHKAPLAN at PARTNERS.ORG (Kaplan, Andrew H.) Date: Tue, 9 Feb 2010 12:45:48 -0500 Subject: Improving performace of the check_folder_size script Message-ID: Hi there -- I am running Nagios 3.1.2 with the nrpe 2.12 client. One of the scripts that I am using is the check_folder_size script. The Nagios server is on one subnet, while the client is on another. The connection between the two is via a 10GB WAN connection. The script is configured on the client to send warnings when the folder in question reaches 500 gigabytes. The entire disk that the folder is on is four terabytes. The script appears to be doing its job pretty well, but I notice that occasionally I will get timeout issues with the script. I suspect the configuration of the script on the Nagios server is the cause, and I wanted to get feedback as to the best settings to have for the script. The current configuration is shown below: # Service definition define service{ use generic-service ; Name of service template to use host_name service_description Folder Size Active 1 is_volatile 0 check_period 24x7 max_check_attempts 1 normal_check_interval 1440 retry_check_interval 720 contact_groups linux-admins notification_interval 1440 notification_period 24x7 notification_options w,u,c,r check_command check_nrpe!check_folder_size1 } Does anyone have any suggestions as to what settings would be better here? Thanks. The information in this e-mail is intended only for the person to whom it is addressed. If you believe this e-mail was sent to you in error and the e-mail contains patient information, please contact the Partners Compliance HelpLine at http://www.partners.org/complianceline . If the e-mail was sent to you in error but does not contain patient information, please contact the sender and properly dispose of the e-mail. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From david at davidkrider.com Tue Feb 9 19:53:13 2010 From: david at davidkrider.com (David Krider) Date: Tue, 09 Feb 2010 13:53:13 -0500 Subject: Passive checks show "red" in tactical overview Message-ID: <1265741593.15037.8.camel@workstation> I have many services that are passively monitored. They have a check_command defined, but, like my SNMP trap catcher service, it only fires when it's time to check freshness. All these services show up as "X Services Disabled" in the "Active Checks" area of the tactical overview screen, as though this was a problem. However, this is normal, and I don't want to see any "red" areas on the overview. (I want _any_ red on that screen to mean, "PANIC!") Can anyone tell me how to make Nagios ignore this? Also, these same services have flapping detection disabled, so, same thing for that as well. Or, rather, is the only way to get rid of this to go ahead and enable the active_check command, probably on a lengthy interval time? Thanks, dk ------------------------------------------------------------------------------ The Planet: dedicated and managed hosting, cloud storage, colocation Stay online with enterprise data centers and the best network in the business Choose flexible plans and management services without long-term contracts Personal 24x7 support from experience hosting pros just a phone call away. http://p.sf.net/sfu/theplanet-com _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From matt at baerconsult.com Wed Feb 10 08:43:46 2010 From: matt at baerconsult.com (Matt Baer) Date: Wed, 10 Feb 2010 01:43:46 -0600 (CST) Subject: Check for Web Page Changes Message-ID: <13481008.880.1265787826047.JavaMail.root@mail.baerconsult.com> I want to check certain web pages on my servers to see if they change at all, such as a comment is left or someone has hacked my site. Is there a way to have Nagios check it's last polled HTML code and compare it to the next poll? Or something to that effect? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jonathan.wheeler at stfc.ac.uk Wed Feb 10 09:18:56 2010 From: jonathan.wheeler at stfc.ac.uk (jonathan.wheeler at stfc.ac.uk) Date: Wed, 10 Feb 2010 08:18:56 -0000 Subject: FW: Check for Web Page Changes Message-ID: From: Matt Baer [mailto:matt at baerconsult.com] Sent: 10 February 2010 07:44 > I want to check certain web pages on my servers to see if they change at all, such as a > comment is left or someone has hacked my site.? Is there a way to have Nagios check it's > last polled HTML code and compare it to the next poll?? Or something to that effect? The check_log plugin might do what you want if you choose the check string carefully. Note the caveats in the documentation about max_attempts etc. Jonathan Wheeler e-Science Centre Rutherford Appleton Laboratory -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: ATT1442263.txt URL: -------------- next part -------------- An embedded and charset-unspecified text was scrubbed... Name: ATT1442264.txt URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Wed Feb 10 10:09:31 2010 From: reachtarak at hotmail.com (Tarak Mukherjee) Date: Wed, 10 Feb 2010 14:39:31 +0530 Subject: latency & response tme Message-ID: Dear Friends, How do I get the response time for the particular website using nagios. As well as the latency report. will it be possible is nagios. /\ Ricky _________________________________________________________________ Hotmail: Free, trusted and rich email service. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios.raja at gmail.com Wed Feb 10 10:11:52 2010 From: nagios.raja at gmail.com (raja nagios) Date: Wed, 10 Feb 2010 14:41:52 +0530 Subject: nagios help Message-ID: <48dff22c1002100111n5664f691j962e0297f9833886@mail.gmail.com> Hi, I have been facing the same problem. I have installed Nagios 3.o on RHEL5. everything is ok. Now when i open my web interface in the browser. it is showing like this. Please find the attachment. There you can see ? marc. why am i getting ? mark there. Thanks in advance. yours, Raja antony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Screenshot.png Type: image/png Size: 149372 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From christian.masopust at siemens.com Wed Feb 10 10:17:08 2010 From: christian.masopust at siemens.com (Masopust, Christian) Date: Wed, 10 Feb 2010 10:17:08 +0100 Subject: nagios help In-Reply-To: <48dff22c1002100111n5664f691j962e0297f9833886@mail.gmail.com> References: <48dff22c1002100111n5664f691j962e0297f9833886@mail.gmail.com> Message-ID: <60721B67EAF0994EAFFB561767B700140460F84C@nets13ha.ww300.siemens.net> This is because you don't have defined an icon for the hosts in hostsextinfo... christian -- "I sense much NT in you, NT leads to Blue Screen. Blue Screen leads to downtime, downtime leads to suffering. NT is the path to the darkside." - Unknown Unix Jedi ________________________________ From: raja nagios [mailto:nagios.raja at gmail.com] Sent: Wednesday, February 10, 2010 10:12 AM To: Nagios-users at lists.sourceforge.net Subject: [Nagios-users] nagios help Hi, I have been facing the same problem. I have installed Nagios 3.o on RHEL5. everything is ok. Now when i open my web interface in the browser. it is showing like this. Please find the attachment. There you can see ? marc. why am i getting ? mark there. Thanks in advance. yours, Raja antony -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From queenie245 at gmail.com Wed Feb 10 10:26:06 2010 From: queenie245 at gmail.com (Queenie de Melo) Date: Wed, 10 Feb 2010 14:56:06 +0530 Subject: nagios 3.2.0- Cross compilation -xscale plaftorm Message-ID: <344a1c5e1002100126q3d4c0bacsbe7fd378eb5f317d@mail.gmail.com> > > I have downloaded nagios 3.2.0 and iam trying to cross compile for xscale > big endian platform. > > It configured fine as shown below but while compiling, i get Error. > > I configured as > ./configure host=xscale CC=xscale_be-gcc > checking for a BSD-compatible install... /usr/bin/install -c > checking build system type... i686-pc-linux-gnu > checking host system type... xscale-unknown-none > checking for xscale-gcc... xscale_be-gcc > checking for C compiler default output file name... a.out > checking whether the C compiler works... yes > checking whether we are cross compiling... yes > checking for suffix of executables... > checking for suffix of object files... o > checking whether we are using the GNU C compiler... yes > checking whether xscale_be-gcc accepts -g... yes > checking for xscale_be-gcc option to accept ANSI C... none needed > checking whether make sets $(MAKE)... yes > checking for strip... /usr/bin/strip > checking how to run the C preprocessor... xscale_be-gcc -E > > and so on................ > > then When i do a "make all', > I get the error as below > > In file included from ../include/nebcallbacks.h:29, > from broker.c:30: > ../include/nebmodules.h:80: error: parse error before "lt_ptr" > ../include/nebmodules.h:80: warning: no semicolon at end of struct or union > ../include/nebmodules.h:81: warning: data definition has no type or storage > class > ../include/nebmodules.h:91: error: parse error before '}' token > ../include/nebmodules.h:91: warning: data definition has no type or storage > class > In file included from broker.c:30: > ../include/nebcallbacks.h:81: error: parse error before '*' token > In file included from broker.c:32: > ../include/nebmods.h:54: error: parse error before '*' token > ../include/nebmods.h:57: error: parse error before '*' token > > Iam using xscale_be-gcc (GCC) 3.3.1 > > "make cgis" compiles fine without any errors. > > Please help me how to proceed. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 10 17:10:14 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 10 Feb 2010 08:10:14 -0800 Subject: latency & response tme In-Reply-To: References: Message-ID: <4B72DA66.1090007@hp.com> Tarak Mukherjee wrote: > Dear Friends, > > How do I get the response time for the particular website using > nagios. As well as the latency report. will it be possible is nagios. See check_http ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From durket at highwire.stanford.edu Wed Feb 10 16:52:50 2010 From: durket at highwire.stanford.edu (Michael Durket) Date: Wed, 10 Feb 2010 07:52:50 -0800 Subject: Scheduled downtime doesn't clear? Message-ID: <5F48703F-0559-44C1-B183-6CA9AE40DD72@highwire.stanford.edu> We're running Nagios 2.5. Every night we take hundreds of services being monitored by Nagios down for a quick restart. The restart process takes no more than 15 minutes and just before each service goes down, a script sends a scheduled flexible downtime request to Nagios (for a 15 minute flexible window). We've noticed that, sometimes, Nagios doesn't cancel the downtime at the end of the window, and days later we have services that are listed as still being in scheduled downtime (even though the window time is long past). Has this been found and/or fixed in a later release of Nagios? ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 10 22:42:19 2010 From: marc at ena.com (Marc Powell) Date: Wed, 10 Feb 2010 15:42:19 -0600 Subject: Improving performace of the check_folder_size script In-Reply-To: References: Message-ID: <9E7BC440-3147-4945-8FED-498AE5CDBA3E@ena.com> On Feb 9, 2010, at 11:45 AM, Kaplan, Andrew H. wrote: > Hi there -- > > I am running Nagios 3.1.2 with the nrpe 2.12 client. One of the scripts that I am using is the > check_folder_size script. When asking about a non-standard plugin, it's very useful to also include where you got that plugin. How it does what it does is very relevant to your question. > The script is configured on > the client to send warnings when the folder in question reaches 500 gigabytes. How does it determine this? > The script appears to be doing its job pretty well, but I notice that occasionally I will get > timeout issues with the script. I suspect the configuration of the script on the Nagios server is > the cause, and I wanted to get feedback as to the best settings to have for the script. Much more likely is that the script is using something like '/bin/du' to calculate the size of the directory and that it is sometimes taking longer than you've told nagios to wait to do so. That doesn't really surprise me for a directory of that size. > The current > configuration is shown below: > > # Service definition > define service{ > check_command check_nrpe!check_folder_size1 > } > > Does anyone have any suggestions as to what settings would be better here? Thanks. Look at your check_nrpe command definition. If it specifies a timeout value (-to), increase it as needed. If it doesn't, add one. Also check the value of service_check_timeout in nagios.cfg. It should be higher than the check_nrpe timeout. An even better solution is to make that directory it's own partition. Then you can use check_disk, which will be nearly instantaneous, to determine the size of the directory. -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cyruspy at gmail.com Wed Feb 10 23:30:43 2010 From: cyruspy at gmail.com (Ciro Iriarte) Date: Wed, 10 Feb 2010 19:30:43 -0300 Subject: NRPE compilation on HPUX Message-ID: Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what missing library can cause this? ------------- utils.c: ld: Unsatisfied symbol "fromhost" in file nrpe.o 1 errors. *** Error exit code 1 Stop. *** Error exit code 1 Stop. ------------- Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Thu Feb 11 05:35:23 2010 From: perldork at webwizarddesign.com (Max) Date: Wed, 10 Feb 2010 23:35:23 -0500 Subject: NRPE compilation on HPUX In-Reply-To: References: Message-ID: On Wed, Feb 10, 2010 at 5:30 PM, Ciro Iriarte wrote: > Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what > missing library can cause this? Colleague of mine used the NRPE package here: http://mayoxide.com/naghpux/ and said it works well. - max ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From cyruspy at gmail.com Thu Feb 11 05:50:29 2010 From: cyruspy at gmail.com (Ciro Iriarte) Date: Thu, 11 Feb 2010 01:50:29 -0300 Subject: NRPE compilation on HPUX In-Reply-To: References: Message-ID: 2010/2/11 Max : > On Wed, Feb 10, 2010 at 5:30 PM, Ciro Iriarte wrote: >> Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you now what >> missing library can cause this? > > Colleague of mine used the NRPE package here: > > http://mayoxide.com/naghpux/ > > and said it works well. > > - max > I used his build notes, and got that errors... Regards, -- Ciro Iriarte http://cyruspy.wordpress.com -- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Thu Feb 11 09:32:08 2010 From: reachtarak at hotmail.com (Tarak Mukherjee) Date: Thu, 11 Feb 2010 14:02:08 +0530 Subject: latency & response tme In-Reply-To: <4B72DA66.1090007@hp.com> References: , <4B72DA66.1090007@hp.com> Message-ID: Hi List, I don't think check_http will server the requirement, check_http will attempt to open an HTTP connection with the host. Successful connects return STATE_OK, refusals and timeouts return STATE_CRITICAL other errors return STATE_UNKNOWN. I want the response / latency time, mean to say that How long time it's taking to load the page/URL. /\ Ricky > Date: Wed, 10 Feb 2010 08:10:14 -0800 > From: patrick.morris at hp.com > To: reachtarak at hotmail.com > CC: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] latency & response tme > > Tarak Mukherjee wrote: > > Dear Friends, > > > > How do I get the response time for the particular website using > > nagios. As well as the latency report. will it be possible is nagios. > > See check_http _________________________________________________________________ Hotmail: Powerful Free email with security by Microsoft. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Thu Feb 11 12:02:49 2010 From: flyinvap at orange.fr (Flyinvap) Date: Thu, 11 Feb 2010 12:02:49 +0100 Subject: latency & response tme In-Reply-To: References: <4B72DA66.1090007@hp.com> Message-ID: <20100211120249.20c376d3@orange.fr> Le Thu, 11 Feb 2010 14:02:08 +0530, Tarak Mukherjee a ?crit : > I don't think check_http will server the requirement, check_http will > attempt to open an HTTP connection with the host. Successful connects > return STATE_OK, refusals and timeouts return STATE_CRITICAL other > errors return STATE_UNKNOWN. Really ? $ /usr/lib/nagios/plugins/check_http -h [..] -w, --warning=DOUBLE Response time to result in warning status (seconds) -c, --critical=DOUBLE Response time to result in critical status (seconds) ? Response time ? is nearly what is wrote in your subject. -- Flyinvap ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ralf.prengel at comline.de Thu Feb 11 13:01:53 2010 From: ralf.prengel at comline.de (prengel) Date: Thu, 11 Feb 2010 13:01:53 +0100 Subject: Monitoring citrix xen Server 5.5 and vmware esx4i In-Reply-To: <60721B67EAF0994EAFFB561767B700140460F84C@nets13ha.ww300.siemens.net> References: <48dff22c1002100111n5664f691j962e0297f9833886@mail.gmail.com> <60721B67EAF0994EAFFB561767B700140460F84C@nets13ha.ww300.siemens.net> Message-ID: <4B73F1B1.2090605@comline.de> Hallo, any hints how to monitore citrix xen and esx4i as detailed as possible? Thanks Ralf Prengel Ralf Prengel Manager Customer Care Comline AG Hauert 8 D-44227 Dortmund/Germany Fon +49 231 97575 904 Fax +49 231 97575 257 Mobil +49 151 10831 157 EMail Ralf.Prengel at comline.de www.comline.de Vorstand Stephan Schilling, Erwin Leonhardi Aufsichtsrat Dr. Franz Schoser (Vorsitzender) HR Dortmund B 14570 USt.-ID-Nr. DE 124727422 ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Thu Feb 11 13:48:49 2010 From: nagios at flatto.net (Assaf Flatto) Date: Thu, 11 Feb 2010 12:48:49 +0000 Subject: Service Availability Report issue Message-ID: <4B73FCB1.7090501@flatto.net> Hello I have an odd issue with the Service Availability Report . The setup is as follows , i have a distributed nagios (3.2.0 from source) with a server running 6 http queries from and external network and submitting the result to the central server . The results are arriving fine and functionality is working on . When I go to view the Service Availability for the services monitored by the remote server , for 5 out of the 6 i get the proper report ( up ,down, success etc') , but for the 6th , it only entries like this : 11-01-2010 16:05:23 11-01-2010 16:05:23 0d 0h 0m 0s PROGRAM (RE)START Program start 11-01-2010 16:05:23 11-01-2010 16:07:18 0d 0h 1m 55s PROGRAM END Abnormal program termination All the services configuration is identical ( with the exception of the IP of course) , but this is the only one to display this behaviour , any one has any ideas why ? Thanks Assaf -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 11 14:03:57 2010 From: marc at ena.com (Marc Powell) Date: Thu, 11 Feb 2010 07:03:57 -0600 Subject: latency & response tme In-Reply-To: References: , <4B72DA66.1090007@hp.com> Message-ID: On Feb 11, 2010, at 2:32 AM, Tarak Mukherjee wrote: > Hi List, > > I don't think check_http will server the requirement, check_http will attempt to open an HTTP connection with the host. Successful connects return STATE_OK, refusals and timeouts return STATE_CRITICAL other errors return STATE_UNKNOWN. > > I want the response / latency time, mean to say that How long time it's taking to load the page/URL. $ ~nagios/libexec/check_http -H www.google.com HTTP OK HTTP/1.0 200 OK - 6929 bytes in 0.067 seconds |time=0.067043s;;;0.000000 size=6929B;;;0 0.067 seconds is the response time. -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From johannes at dagemark.com Thu Feb 11 15:15:55 2010 From: johannes at dagemark.com (Johannes Dagemark) Date: Thu, 11 Feb 2010 15:15:55 +0100 Subject: Monitoring citrix xen Server 5.5 and vmware esx4i In-Reply-To: <4B73F1B1.2090605@comline.de> References: <48dff22c1002100111n5664f691j962e0297f9833886@mail.gmail.com> <60721B67EAF0994EAFFB561767B700140460F84C@nets13ha.ww300.siemens.net> <4B73F1B1.2090605@comline.de> Message-ID: <4B74111B.3020407@dagemark.com> Hi At least for vmware you can check out http://git.op5.org/git/?p=nagios/op5plugins.git;a=blob_plain;f=check_esx3.pl;hb=HEAD I think it should work for esx4i as well Cheers Johannes Dagemark On 2010-02-11 13:01, prengel wrote: > Hallo, > > any hints how to monitore citrix xen and esx4i as detailed as possible? > > Thanks > > Ralf Prengel > Ralf Prengel > Manager > Customer Care > Comline AG > Hauert 8 > D-44227 Dortmund/Germany > > Fon +49 231 97575 904 > Fax +49 231 97575 257 > Mobil +49 151 10831 157 > EMail Ralf.Prengel at comline.de > > > www.comline.de > Vorstand Stephan Schilling, Erwin Leonhardi > Aufsichtsrat Dr. Franz Schoser (Vorsitzender) HR Dortmund B 14570 USt.-ID-Nr. DE 124727422 > > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Mvh Johannes Dagemark JD Consulting (op5 service partner) Phone: +46733709024 Email: johannes at dagemark.com ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ralf.prengel at comline.de Thu Feb 11 15:27:45 2010 From: ralf.prengel at comline.de (prengel) Date: Thu, 11 Feb 2010 15:27:45 +0100 Subject: Monitoring citrix xen Server 5.5 and vmware esx4i In-Reply-To: <4B74111B.3020407@dagemark.com> References: <48dff22c1002100111n5664f691j962e0297f9833886@mail.gmail.com> <60721B67EAF0994EAFFB561767B700140460F84C@nets13ha.ww300.siemens.net> <4B73F1B1.2090605@comline.de> <4B74111B.3020407@dagemark.com> Message-ID: <4B7413E1.9060601@comline.de> Am 11.02.2010 15:15, schrieb Johannes Dagemark: > Hi > > At least for vmware you can check out > http://git.op5.org/git/?p=nagios/op5plugins.git;a=blob_plain;f=check_esx3.pl;hb=HEAD > > I think it should work for esx4i as well > There are some perl-Version problem using the vmware-vSphere CLI 4.0.0 build-161974. Ralf Ralf Prengel Manager Customer Care Comline AG Hauert 8 D-44227 Dortmund/Germany Fon +49 231 97575 904 Fax +49 231 97575 257 Mobil +49 151 10831 157 EMail Ralf.Prengel at comline.de www.comline.de Vorstand Stephan Schilling, Erwin Leonhardi Aufsichtsrat Dr. Franz Schoser (Vorsitzender) HR Dortmund B 14570 USt.-ID-Nr. DE 124727422 ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Koen.Verhaeghe at meucci-solutions.com Thu Feb 11 14:10:46 2010 From: Koen.Verhaeghe at meucci-solutions.com (Verhaeghe, Koen) Date: Thu, 11 Feb 2010 14:10:46 +0100 Subject: check_openmanage and net-snmp v3 Message-ID: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> Hi All, does anyone have an explanation for this: when using check_openmanage with snmp v3, the script exits because some OIDs do not exist for a type of server. (e.g. '1.3.6.1.4.1.674.10893.1.20.130.4.1.9' => 'arrayDiskEnclosureID' for PowerEdge 860). output: ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx --authpassword xxxxx --privpassword xxxxx --privprotocol des -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all SNMP ERROR [storage / pdisk]: The requested entries are empty or do not exist. When enabling the windows snmp service again and disabling the net-snmp v3, I get the correct output: ./check_openmanage -H xx.xx.xx.xx -P 2 -C xxxxxx -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all Physical Disk 0:1 [Ata WDC WD800JD-75MSA3, 0GB] on ctrl 0 needs attention: Failed Logical drive 0 'Windows Disk 0' [RAID-1, 73.57 GB] on ctrl 0 needs attention: Degraded|'fan_1_bmc_cpu#fan'=3225RPM;0;0 'fan_2_bmc_dimm_fan'=3150RPM;0;0 'temp_0_bmc_planar'=31C;48;53 tested with: OMSA version: 5.1 and 6.2 Net-snmp (x86) versions 5.4.2.1 and 5.5 NET::SNMP 6.0.0 on the nagios server Any ideas? I've tried commenting out the OIDs that do not exist (and all related script steps) but then the output gives 'OK', but I know there is a degraded disk... ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx --authpassword xxxxx --privpassword xxxxx --privprotocol des -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all OK - System: 'PowerEdge 860', SN: 'J478F3J', hardware working fine, 1 logical drives, 2 physical drives ----- BIOS='A05 10/04/2007', DRAC4='1.60', BMC='1.75' ----- Ctrl 0 [SAS 5/iR Adapter]: Fw='00.10.51.00.06.12.05.00', Dr='1.21.08.00' ----- OpenManage Server Administrator (OMSA) version: '5.1.0'|'temp_0_bmc_planar'=30C;48;53 On other types of servers I get a similar error for [cooling] (e.g on a 2950) Any help would be greatly appreciated. Thanks, Koen ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Koen.Verhaeghe at meucci-solutions.com Thu Feb 11 16:25:03 2010 From: Koen.Verhaeghe at meucci-solutions.com (Verhaeghe, Koen) Date: Thu, 11 Feb 2010 16:25:03 +0100 Subject: check_openmanage and net-snmp v3 Message-ID: <701DEDF6795DA64DAA315656332BE29102C073FF@webmail.meucci.be> Hi All, does anyone have an explanation for this: when using check_openmanage with snmp v3, the script exits because some OIDs do not exist for a type of server. (e.g. '1.3.6.1.4.1.674.10893.1.20.130.4.1.9' => 'arrayDiskEnclosureID' for PowerEdge 860). output: ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx --authpassword xxxxx --privpassword xxxxx --privprotocol des -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all SNMP ERROR [storage / pdisk]: The requested entries are empty or do not exist. When enabling the windows snmp service again and disabling the net-snmp v3, I get the correct output: ./check_openmanage -H xx.xx.xx.xx -P 2 -C xxxxxx -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all Physical Disk 0:1 [Ata WDC WD800JD-75MSA3, 0GB] on ctrl 0 needs attention: Failed Logical drive 0 'Windows Disk 0' [RAID-1, 73.57 GB] on ctrl 0 needs attention: Degraded|'fan_1_bmc_cpu#fan'=3225RPM;0;0 'fan_2_bmc_dimm_fan'=3150RPM;0;0 'temp_0_bmc_planar'=31C;48;53 tested with: OMSA version: 5.1 and 6.2 Net-snmp (x86) versions 5.4.2.1 and 5.5 NET::SNMP 6.0.0 on the nagios server Any ideas? I've tried commenting out the OIDs that do not exist (and all related script steps) but then the output gives 'OK', but I know there is a degraded disk... ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx --authpassword xxxxx --privpassword xxxxx --privprotocol des -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all OK - System: 'PowerEdge 860', SN: 'J478F3J', hardware working fine, 1 logical drives, 2 physical drives ----- BIOS='A05 10/04/2007', DRAC4='1.60', BMC='1.75' ----- Ctrl 0 [SAS 5/iR Adapter]: Fw='00.10.51.00.06.12.05.00', Dr='1.21.08.00' ----- OpenManage Server Administrator (OMSA) version: '5.1.0'|'temp_0_bmc_planar'=30C;48;53 On other types of servers I get a similar error for [cooling] (e.g on a 2950) Any help would be greatly appreciated. Thanks, Koen ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Thu Feb 11 16:46:24 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Thu, 11 Feb 2010 16:46:24 +0100 Subject: check_openmanage and net-snmp v3 In-Reply-To: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> (Koen Verhaeghe's message of "Thu, 11 Feb 2010 14:10:46 +0100") References: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> Message-ID: <15tvde3srlb.fsf@tux.uio.no> "Verhaeghe, Koen" writes: > Hi All, > > does anyone have an explanation for this: > when using check_openmanage with snmp v3, the script exits because some > OIDs do not exist for a type of server. > (e.g. '1.3.6.1.4.1.674.10893.1.20.130.4.1.9' => 'arrayDiskEnclosureID' > for PowerEdge 860). > > output: > ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx > --authpassword xxxxx --privpassword xxxxx --privprotocol des -p > multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > SNMP ERROR [storage / pdisk]: The requested entries are empty or do not > exist. > > When enabling the windows snmp service again and disabling the net-snmp > v3, I get the correct output: > > ./check_openmanage -H xx.xx.xx.xx -P 2 -C xxxxxx -p multiline -t 120 > -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > Physical Disk 0:1 [Ata WDC WD800JD-75MSA3, 0GB] on ctrl 0 needs > attention: Failed > Logical drive 0 'Windows Disk 0' [RAID-1, 73.57 GB] on ctrl 0 needs > attention: Degraded|'fan_1_bmc_cpu#fan'=3225RPM;0;0 > 'fan_2_bmc_dimm_fan'=3150RPM;0;0 > 'temp_0_bmc_planar'=31C;48;53 > > tested with: > OMSA version: 5.1 and 6.2 > Net-snmp (x86) versions 5.4.2.1 and 5.5 > NET::SNMP 6.0.0 on the nagios server > > Any ideas? > I've tried commenting out the OIDs that do not exist (and all related > script steps) but then the output gives 'OK', but I know there is a > degraded disk... > > ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx > --authpassword xxxxx --privpassword xxxxx --privprotocol des -p > multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > OK - System: 'PowerEdge 860', SN: 'J478F3J', hardware working fine, 1 > logical drives, 2 physical drives > ----- BIOS='A05 10/04/2007', DRAC4='1.60', BMC='1.75' > ----- Ctrl 0 [SAS 5/iR Adapter]: Fw='00.10.51.00.06.12.05.00', > Dr='1.21.08.00' > ----- OpenManage Server Administrator (OMSA) version: > '5.1.0'|'temp_0_bmc_planar'=30C;48;53 > > On other types of servers I get a similar error for [cooling] (e.g on a > 2950) Hi Koen, I'm the author of that plugin. To be honest, I've never actually tested the SNMPv3 stuff. I just pass the options to Net::SNMP and let it handle it, and hope that it works. You are the first to report SNMPv3 troubles, and I assume that the SNMPv3 users are a minority. I'm always interested in fixing bugs, but I'm unable to reproduce this problem. I see that you're checking a Windows box. I have none of those to play with, but I have set up SNMPv3 on a RHEL5 box. Checking the RHEL5 host via SNMPv3 works just fine: $ ./check_openmanage -H myhost -P 3 --authprotocol md5 -U xxxx \ --authpassword xxxx --privpassword xxxx --privprotocol des Controller 0 [SAS 6/iR Integrated]: Driver '3.04.07rh' is out of date Windows + OMSA + SNMP has had some problems in the past, but at least for SNMPv2c and SNMPv1 these issues should be resolved with OMSA 5.5.0.1 and later versions. It seems there are still issues with SNMPv3. In the past, there have been problems with SNMP and using the Net::SNMP function get_entries() vs. get_table(). The former is preferred because it is faster, since we're not interested in all the OIDs. This is especially true for servers with many physical disks. I have created a test version that fetches the cooling OIDs with get_table() instead of get_entries() if SNMPv3 is used. This version is available here: http://folk.uio.no/trondham/tmp/check_openmanage-snmpv3test Can you try this version on the servers where checking the cooling devices fail? (It's a bit more complicated for physical drives). PS. Please upgrade to OMSA version 5.5.0.1 or later. Previous versions are known to perform badly with SNMP on Windows. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsi.com Thu Feb 11 17:30:37 2010 From: mark.frost1 at pepsi.com (Frost, Mark {PBG}) Date: Thu, 11 Feb 2010 11:30:37 -0500 Subject: NRPE compilation on HPUX In-Reply-To: References: Message-ID: -----Original Message----- From: Ciro Iriarte [mailto:cyruspy at gmail.com] Sent: Wednesday, February 10, 2010 5:31 PM To: nagios List Subject: [Nagios-users] NRPE compilation on HPUX Hi, i'm getting issues compiling NRPE on HPUX 11.31, do you know what missing library can cause this? ------------- utils.c: ld: Unsatisfied symbol "fromhost" in file nrpe.o 1 errors. *** Error exit code 1 Stop. *** Error exit code 1 Stop. ------------- Regards, -- Ciro Iriarte Ciro, It's kind of strange, but I know I had the problem you're having at one point, but I no longer seem to have it on HPUX 11.31 (IA64). I wonder if there was some OS patch that was applied that took care of that. The OS level on our servers is shown as "HP-UX B.11.31 U ia64 1200835863" if that helps. swlist does show that we have an installed tcp wrapper package, however: HPUX-TCPWRAP B.11.31 HPUX TCPWrapper daemon 'nm' shows that there is a fromhost() entity in /usr/lib/libwrap.sl (libwrap.1) Unless I'm mistaken, your issue is with the libwrap library which I think exists on HPUX, but isn't complete as far as NRPE is concerned. I never spent a lot of time trying to find a generalized solution to this, but instead kind of hacked something into nrpe.c where I think I just commented out some of the #ifdef HAVE_LIBWRAP code segments. Note that the fromhost() call you're missing is in one of the HAVE_LIBRWRAP sections. I believe that NRPE expects that . We don't use TCP wrappers anyway, so it's not a loss. I first encountered this issue with PA-RISC HPUX and I believe the way I got around was to download the latest tcp wrapper code from the internet, then pulled out the function that I was missing (I don't recall if it was from host My current issue with NRPE on HPUX is twofold -- the lack of some syslog capabilities, and that the configure script doesn't correctly find our SSL libraries. I have made the following 2 hacky changes to make ours work: *** configure.old Mon Mar 31 13:49:11 2008 --- configure Thu Feb 11 11:07:10 2010 *************** *** 6676,6686 **** fi done ! if test x_$found_ssl != x_yes; then ! { { echo "$as_me:$LINENO: error: Cannot find ssl libraries" >&5 ! echo "$as_me: error: Cannot find ssl libraries" >&2;} ! { (exit 1); exit 1; }; } ! else printf "SSL libraries found in $ssllibdir\n"; LDFLAGS="$LDFLAGS -L$ssllibdir"; --- 6676,6686 ---- fi done ! # if test x_$found_ssl != x_yes; then ! # { { echo "$as_me:$LINENO: error: Cannot find ssl libraries" >&5 ! #echo "$as_me: error: Cannot find ssl libraries" >&2;} ! # { (exit 1); exit 1; }; } ! # else printf "SSL libraries found in $ssllibdir\n"; LDFLAGS="$LDFLAGS -L$ssllibdir"; *************** *** 6701,6707 **** fi # awk to strip off meta data at bottom of dhparam output $sslbin dhparam -C 512 | awk '/^-----/ {exit} {print}' > include/dh.h ! fi fi echo "$as_me:$LINENO: checking for Kerberos include files" >&5 --- 6701,6707 ---- fi # awk to strip off meta data at bottom of dhparam output $sslbin dhparam -C 512 | awk '/^-----/ {exit} {print}' > include/dh.h ! # fi fi echo "$as_me:$LINENO: checking for Kerberos include files" >&5 *** src/nrpe.c.old Mon Mar 31 14:02:59 2008 --- src/nrpe.c Thu Feb 11 11:18:34 2010 *************** *** 613,622 **** log_facility=LOG_UUCP; else if(!strcmp(varvalue,"cron")) log_facility=LOG_CRON; ! else if(!strcmp(varvalue,"authpriv")) log_facility=LOG_AUTHPRIV; else if(!strcmp(varvalue,"ftp")) ! log_facility=LOG_FTP; else if(!strcmp(varvalue,"local0")) log_facility=LOG_LOCAL0; else if(!strcmp(varvalue,"local1")) --- 613,622 ---- log_facility=LOG_UUCP; else if(!strcmp(varvalue,"cron")) log_facility=LOG_CRON; ! /* else if(!strcmp(varvalue,"authpriv")) log_facility=LOG_AUTHPRIV; else if(!strcmp(varvalue,"ftp")) ! log_facility=LOG_FTP; */ else if(!strcmp(varvalue,"local0")) log_facility=LOG_LOCAL0; else if(!strcmp(varvalue,"local1")) Hope that's helpful. Mark -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mirko.stefanelli at gmail.com Thu Feb 11 17:26:44 2010 From: mirko.stefanelli at gmail.com (mirko stefanelli) Date: Thu, 11 Feb 2010 17:26:44 +0100 Subject: [SOLVED]Re: Problem with check_by_ssh In-Reply-To: <5bcb62b61002050952t7af84fe7m18eb42366a83a9bb@mail.gmail.com> References: <9eaab8b81002050712m75df21cbh61eaf9175c08eee0@mail.gmail.com> <5bcb62b61002050952t7af84fe7m18eb42366a83a9bb@mail.gmail.com> Message-ID: <9eaab8b81002110826r973e2a0o7e8171df2e516aae@mail.gmail.com> Hi to all, I have solved the problem by setting -t parameter in template command. By defaul plugin time out is set to 10 second. Some our server needs more time to execute control. At the moment I have set -t at 20 seconds and it's work fine. Regards, Mirko. 2010/2/5 Matt Simmons > My first guess would be that there is some sort of environmental > variable being set when you log in that wouldn't be set when you > aren't (ie, when Nagios triggers it). Are you using keys that have > passphrases, maybe? Can you give us a look at the contents of > check_by_ssh? > > --Matt > > > On Fri, Feb 5, 2010 at 10:12 AM, mirko stefanelli > wrote: > > Hi to all, > > > > We run nagios on Ubuntu server with 2Gb ramm kernel version > > 2.6.24-23-server. Nagios version is 3.2.0. > > > > We have configured all our server (Linux REL) to execute our custom > plugin > > by check_by_ssh. The problem is that for some server this check fails and > > the output is: > > > > CRITICAL - Plugin timed out while executing system call > > > > If we try to do same command, executed with check_by_shh, from user > nagios > > on cli the command work fine! > > > > Any idea? > > > > Regards, > > Mirko. > > > > > ------------------------------------------------------------------------------ > > The Planet: dedicated and managed hosting, cloud storage, colocation > > Stay online with enterprise data centers and the best network in the > > business > > Choose flexible plans and management services without long-term contracts > > Personal 24x7 support from experience hosting pros just a phone call > away. > > http://p.sf.net/sfu/theplanet-com > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > -- > > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating process. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From h.majnoonian at farasatcom.net Thu Feb 11 22:19:14 2010 From: h.majnoonian at farasatcom.net (Hamed Majnoonian) Date: Fri, 12 Feb 2010 00:49:14 +0330 Subject: I can't browse CGI files in browser Message-ID: <42406.5438834534$1265956711@news.gmane.org> Hi! I have set up Nagios 3 on FreeBSD 8 and Apache 2. I haven't set up any authentication on my web server. When I run my browser I can see the front page of Nagios, but whenever I try to click on a report page, browser tells me that it can save the CGI file (as a download) and I can't browse the content of the report on my browser. Thanks Hamed -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Fri Feb 12 07:56:19 2010 From: reachtarak at hotmail.com (Tarak Mukherjee) Date: Fri, 12 Feb 2010 12:26:19 +0530 Subject: latency & response tme In-Reply-To: References: , , <4B72DA66.1090007@hp.com>, , Message-ID: ohh gr8, what about , webinject integration in Nagios. /\ Ricky > From: marc at ena.com > Date: Thu, 11 Feb 2010 07:03:57 -0600 > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] latency & response tme > > > On Feb 11, 2010, at 2:32 AM, Tarak Mukherjee wrote: > > > Hi List, > > > > I don't think check_http will server the requirement, check_http will attempt to open an HTTP connection with the host. Successful connects return STATE_OK, refusals and timeouts return STATE_CRITICAL other errors return STATE_UNKNOWN. > > > > I want the response / latency time, mean to say that How long time it's taking to load the page/URL. > > $ ~nagios/libexec/check_http -H www.google.com > HTTP OK HTTP/1.0 200 OK - 6929 bytes in 0.067 seconds |time=0.067043s;;;0.000000 size=6929B;;;0 > > 0.067 seconds is the response time. > > -- > Marc > > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null _________________________________________________________________ Hotmail: Trusted email with Microsoft?s powerful SPAM protection. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Fri Feb 12 07:59:18 2010 From: reachtarak at hotmail.com (Tarak Mukherjee) Date: Fri, 12 Feb 2010 12:29:18 +0530 Subject: check_mysql_health Message-ID: Hi , I have used check_mysql_health ( http://labs.consol.de/lang/en/nagios/check_mysql_health/) in my nagios server. When i am running the checks from the command line , it works fine. ./check_mysql_health -H xxx.xxx.xxx.xxx --username abcd --password xyz --mode tablecache-hitrate OK - table cache hitrate 98.54%, 1.31% filled | tablecache_hitrate=98.54%;99:;95: tablecache_fillrate=1.31% but when it?s displaying the output on Nagios front end , that time i?m getting the below error?. ? **ePN failed to compile /usr/local/nagios/libexec/check_mysql_health: ?Missing right curly or square bracket at (eval 23) line 3116, at end of line ? Please help /\ Ricky _________________________________________________________________ Hotmail: Free, trusted and rich email service. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Feb 12 08:27:01 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 11 Feb 2010 23:27:01 -0800 Subject: I can't browse CGI files in browser In-Reply-To: <000301caab5f$e0f9a3a0$a2eceae0$@majnoonian@farasatcom.net> References: <000301caab5f$e0f9a3a0$a2eceae0$@majnoonian@farasatcom.net> Message-ID: <4B7502C5.70607@hp.com> Sounds like you haven't configured your webserver correctly. Make sure you've enabled running CGIs (using the ExecCGI directive, assuming you're using apache) from the CGI directory. Hamed Majnoonian wrote: > > Hi! > > > > I have set up Nagios 3 on FreeBSD 8 and Apache 2. I haven?t set up > any authentication on my web server. When I run my browser I can see > the front page of Nagios, but whenever I try to click on a report > page, browser tells me that it can save the CGI file (as a download) > and I can?t browse the content of the report on my browser. > > > > Thanks > > Hamed > ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Roland.Sorgenfrei at spb.de Fri Feb 12 09:01:38 2010 From: Roland.Sorgenfrei at spb.de (Roland.Sorgenfrei at spb.de) Date: Fri, 12 Feb 2010 09:01:38 +0100 Subject: Antwort: Nagios Integration with IBM TEC ( Tivoli ) In-Reply-To: References: Message-ID: <6411_1265961608_4B750A88_6411_45_1_OF68E89988.1A3060BB-ONC12576C8.002B985C-C12576C8.002BFB29@nbg.sdv.spb.de> Hello Steve, we use the TEC POSTEMSG for sending of events from Nagios to the TEC. A big benefit is the postXmsg - cache mechanism. When the TEC system is not reachable, all events are cached on the NAGIOS system. Regards, Roland Von: steve f An: Datum: 11.01.2010 15:24 Betreff: [Nagios-users] Nagios Integration with IBM TEC ( Tivoli ) Hello, We have IBM TEC ( Tivoli Enterprise Console) for alot of our current monitoring. Does anyone have any experience / history integrating Nagios with IBM TEC? We use postd messages to get the info from the clients back to TEC. Can Nagios read the postd messages? I guess I could somehow use a passive check to do this correct ?? Would Nagios be considered a viable replacement for TEC? I have not been able to find any Marketing type of info as far as Nagios as a replacement for other monitoring solutions. Thanks, Steve Hotmail: Powerful Free email with security by Microsoft. Get it now. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From skymen4 at hotmail.com Fri Feb 12 10:18:23 2010 From: skymen4 at hotmail.com (Renaud Vanderhagen) Date: Fri, 12 Feb 2010 10:18:23 +0100 Subject: Ndo2DB and Check for network traffic Message-ID: Hello world ! What backup ndo2db exactly in ndo database ? Is it possible to backup the nagios.log ? Where can I find a "map" of the ndo database ? What do you use to monitor network traffic on servers and switchs interfaces ? I found check_iftraffic.pl and check_centreon_snmp_traffic but I use Nagios and not Centreon... Thanks a lot ! _________________________________________________________________ Nouveau Windows 7 : Trouvez le PC qui vous convient. En savoir plus. http://windows.microsoft.com/shop -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Koen.Verhaeghe at meucci-solutions.com Fri Feb 12 10:28:26 2010 From: Koen.Verhaeghe at meucci-solutions.com (Verhaeghe, Koen) Date: Fri, 12 Feb 2010 10:28:26 +0100 Subject: check_openmanage and net-snmp v3 References: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> <15tvde3srlb.fsf@tux.uio.no> Message-ID: <701DEDF6795DA64DAA315656332BE29102C074AE@webmail.meucci.be> Hi Trond, Thanks for your reply and the new script. These are the results: With windows SNMP (v2) it works: ./check_openmanagetest -H xx.xx.xx.xx -C xxxxx -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all OK - System: 'PowerEdge 2950', SN: '7Z3DR3J', hardware working fine, 1 logical drives, 2 physical drives ----- BIOS='2.2.6 02/05/2008', DRAC5='1.33', BMC='2.05' ----- Ctrl 0 [PERC 5/i Integrated]: Fw='5.2.1-0067', Dr='1.21.00.32', Storport: '5.2.3790.3959' ----- Encl 0:0:0 [Backplane]: Fw='1.05' ----- OpenManage Server Administrator (OMSA) version: '6.2.0'|'fan_1_system_board_fan_1_rpm'=5175RPM;0;0 'fan_2_system_board_fan_2_rpm'=5250RPM;0;0 'fan_3_system_board_fan_3_rpm'=5250RPM;0;0 'fan_4_system_board_fan_4_rpm'=5175RPM;0;0 'pwr_mon_0_ps_1_current_1'=0.4A;0;0 'pwr_mon_1_ps_2_current_2'=0.4A;0;0 'pwr_mon_2_system_board_system_level'=184W;916;964 'temp_0_system_board_ambient'=16C;42;47 With net-snmp v3 (version 5.4.2.1) on the same server, diabling the windows snmp, I get: ./check_openmanagetest -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxxxx --authpassword xxxxxxx --privpassword xxxxxx --privprotocol des -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all SNMP ERROR [processors]: Received genError(5) error-status at error-index 3. This normally indicates a too low version of OMSA, but I am using 6.2.0. Any ideas? Regards, Koen -----Original Message----- From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] Sent: 2010-02-11 16:46 To: Verhaeghe, Koen Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_openmanage and net-snmp v3 "Verhaeghe, Koen" writes: > Hi All, > > does anyone have an explanation for this: > when using check_openmanage with snmp v3, the script exits because > some OIDs do not exist for a type of server. > (e.g. '1.3.6.1.4.1.674.10893.1.20.130.4.1.9' => 'arrayDiskEnclosureID' > for PowerEdge 860). > > output: > ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx > --authpassword xxxxx --privpassword xxxxx --privprotocol des -p > multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > SNMP ERROR [storage / pdisk]: The requested entries are empty or do > not exist. > > When enabling the windows snmp service again and disabling the > net-snmp v3, I get the correct output: > > ./check_openmanage -H xx.xx.xx.xx -P 2 -C xxxxxx -p multiline -t > 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > Physical Disk 0:1 [Ata WDC WD800JD-75MSA3, 0GB] on ctrl 0 needs > attention: Failed > Logical drive 0 'Windows Disk 0' [RAID-1, 73.57 GB] on ctrl 0 needs > attention: Degraded|'fan_1_bmc_cpu#fan'=3225RPM;0;0 > 'fan_2_bmc_dimm_fan'=3150RPM;0;0 > 'temp_0_bmc_planar'=31C;48;53 > > tested with: > OMSA version: 5.1 and 6.2 > Net-snmp (x86) versions 5.4.2.1 and 5.5 NET::SNMP 6.0.0 on the nagios > server > > Any ideas? > I've tried commenting out the OIDs that do not exist (and all related > script steps) but then the output gives 'OK', but I know there is a > degraded disk... > > ./check_openmanage -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxx > --authpassword xxxxx --privpassword xxxxx --privprotocol des -p > multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > OK - System: 'PowerEdge 860', SN: 'J478F3J', hardware working fine, 1 > logical drives, 2 physical drives > ----- BIOS='A05 10/04/2007', DRAC4='1.60', BMC='1.75' > ----- Ctrl 0 [SAS 5/iR Adapter]: Fw='00.10.51.00.06.12.05.00', > Dr='1.21.08.00' > ----- OpenManage Server Administrator (OMSA) version: > '5.1.0'|'temp_0_bmc_planar'=30C;48;53 > > On other types of servers I get a similar error for [cooling] (e.g on > a > 2950) Hi Koen, I'm the author of that plugin. To be honest, I've never actually tested the SNMPv3 stuff. I just pass the options to Net::SNMP and let it handle it, and hope that it works. You are the first to report SNMPv3 troubles, and I assume that the SNMPv3 users are a minority. I'm always interested in fixing bugs, but I'm unable to reproduce this problem. I see that you're checking a Windows box. I have none of those to play with, but I have set up SNMPv3 on a RHEL5 box. Checking the RHEL5 host via SNMPv3 works just fine: $ ./check_openmanage -H myhost -P 3 --authprotocol md5 -U xxxx \ --authpassword xxxx --privpassword xxxx --privprotocol des Controller 0 [SAS 6/iR Integrated]: Driver '3.04.07rh' is out of date Windows + OMSA + SNMP has had some problems in the past, but at least for SNMPv2c and SNMPv1 these issues should be resolved with OMSA 5.5.0.1 and later versions. It seems there are still issues with SNMPv3. In the past, there have been problems with SNMP and using the Net::SNMP function get_entries() vs. get_table(). The former is preferred because it is faster, since we're not interested in all the OIDs. This is especially true for servers with many physical disks. I have created a test version that fetches the cooling OIDs with get_table() instead of get_entries() if SNMPv3 is used. This version is available here: http://folk.uio.no/trondham/tmp/check_openmanage-snmpv3test Can you try this version on the servers where checking the cooling devices fail? (It's a bit more complicated for physical drives). PS. Please upgrade to OMSA version 5.5.0.1 or later. Previous versions are known to perform badly with SNMP on Windows. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kve at bankdata.dk Fri Feb 12 10:38:46 2010 From: kve at bankdata.dk (Karsten Vendler) Date: Fri, 12 Feb 2010 10:38:46 +0100 Subject: Vedr.: Antwort: Nagios Integration with IBM TEC ( Tivoli ) In-Reply-To: <6411_1265961608_4B750A88_6411_45_1_OF68E89988.1A3060BB-ONC12576C8.002B985C-C12576C8.002BFB29@nbg.sdv.spb.de> References: <6411_1265961608_4B750A88_6411_45_1_OF68E89988.1A3060BB-ONC12576C8.002B985C-C12576C8.002BFB29@nbg.sdv.spb.de> Message-ID: Hi Roland Is that TEC POSTEMSG a plugin, and/or can othesr get a hold on that, where to get it? Venlig hilsen / Best regards Karsten Vendler IT-tekniker errits? bygade 102 ? dk-7000 fredericia dir. tlf.: +45 7924 2828 www.bankdata.dk Roland.Sorgenfrei at spb.de 12-02-2010 09:21 e89c441c-6210-0efa-d033-05729e303305 Til steve f cc nagios-users at lists.sourceforge.net Emne [Nagios-users] Antwort: Nagios Integration with IBM TEC ( Tivoli ) Hello Steve, we use the TEC POSTEMSG for sending of events from Nagios to the TEC. A big benefit is the postXmsg - cache mechanism. When the TEC system is not reachable, all events are cached on the NAGIOS system. Regards, Roland Von: steve f An: Datum: 11.01.2010 15:24 Betreff: [Nagios-users] Nagios Integration with IBM TEC ( Tivoli ) Hello, We have IBM TEC ( Tivoli Enterprise Console) for alot of our current monitoring. Does anyone have any experience / history integrating Nagios with IBM TEC? We use postd messages to get the info from the clients back to TEC. Can Nagios read the postd messages? I guess I could somehow use a passive check to do this correct ?? Would Nagios be considered a viable replacement for TEC? I have not been able to find any Marketing type of info as far as Nagios as a replacement for other monitoring solutions. Thanks, Steve Hotmail: Powerful Free email with security by Microsoft. Get it now. ------------------------------------------------------------------------------ This SF.Net email is sponsored by the Verizon Developer Community Take advantage of Verizon's best-in-class app development support A streamlined, 14 day to market process makes app distribution fast and easy Join now and get one step closer to millions of Verizon customers http://p.sf.net/sfu/verizon-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/gif Size: 1656 bytes Desc: not available URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Fri Feb 12 12:26:46 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Fri, 12 Feb 2010 12:26:46 +0100 Subject: check_openmanage and net-snmp v3 In-Reply-To: <701DEDF6795DA64DAA315656332BE29102C074AE@webmail.meucci.be> (Koen Verhaeghe's message of "Fri, 12 Feb 2010 10:28:26 +0100") References: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> <15tvde3srlb.fsf@tux.uio.no> <701DEDF6795DA64DAA315656332BE29102C074AE@webmail.meucci.be> Message-ID: <15taavesnih.fsf@tux.uio.no> "Verhaeghe, Koen" writes: > Thanks for your reply and the new script. > These are the results: > > With windows SNMP (v2) it works: Yep, that was expected :) > With net-snmp v3 (version 5.4.2.1) on the same server, diabling the > windows snmp, I get: > ./check_openmanagetest -H xx.xx.xx.xx -P 3 --authprotocol md5 -U xxxxxx > --authpassword xxxxxxx --privpassword xxxxxx --privprotocol des -p > multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > SNMP ERROR [processors]: Received genError(5) error-status at > error-index 3. Hmm.. was this on one of the servers that previously has problems fetching the cooling OIDs? I believe it would be better to make this work with the standard Windows SNMP service, which is what most people would use. Where the results any different without net-snmp? > This normally indicates a too low version of OMSA, but I am using 6.2.0. With SNMPv2 on Windows, that usually is the case, yes. I have a new test version for you: http://folk.uio.no/trondham/tmp/check_openmanage-snmpv3test2 This version uses get_table() for fetching OIDs for CPUs and physical drives as well as cooling devices. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Eliot.Picken at wenaas.co.uk Fri Feb 12 12:48:20 2010 From: Eliot.Picken at wenaas.co.uk (Eliot.Picken at wenaas.co.uk) Date: Fri, 12 Feb 2010 11:48:20 +0000 Subject: AUTO: Eliot Picken is out of the office (returning 15/02/2010) Message-ID: I am out of the office until 15/02/2010. I am currently out of the office and will not have access to email. Your email has not been forwarded Best regards Eliot Note: This is an automated response to your message "Re: [Nagios-users] check_openmanage and net-snmp v3" sent on 2/12/2010 11:26:46 AM. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From skymen4 at hotmail.com Fri Feb 12 13:02:06 2010 From: skymen4 at hotmail.com (Renaud Vanderhagen) Date: Fri, 12 Feb 2010 13:02:06 +0100 Subject: Ndo2DB and Check for network traffic Message-ID: Hello world ! What backup ndo2db exactly in ndo database ? Is it possible to backup the nagios.log ? Where can I find a "map" of the ndo database ? What do you use to monitor network traffic on servers and switchs interfaces ? I found check_iftraffic.pl and check_centreon_snmp_traffic but I use Nagios and not Centreon... Sorry for my poor english :( Thanks a lot ! _________________________________________________________________ Nouveau Windows 7 : Trouvez le PC qui vous convient. En savoir plus. http://windows.microsoft.com/shop -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 12 13:40:23 2010 From: marc at ena.com (Marc Powell) Date: Fri, 12 Feb 2010 06:40:23 -0600 Subject: latency & response tme In-Reply-To: References: , , <4B72DA66.1090007@hp.com>, , Message-ID: <83D45A81-A583-4A9F-BCD2-4AE326C0FFB5@ena.com> On Feb 12, 2010, at 12:56 AM, Tarak Mukherjee wrote: > ohh gr8, > > what about , webinject integration in Nagios. http://lmgtfy.com/?q=nagios+webinject -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 12 13:45:40 2010 From: marc at ena.com (Marc Powell) Date: Fri, 12 Feb 2010 06:45:40 -0600 Subject: check_mysql_health In-Reply-To: References: Message-ID: <54E6178A-316A-414A-97C9-A80C068AE764@ena.com> On Feb 12, 2010, at 12:59 AM, Tarak Mukherjee wrote: > Hi , > > > > > I have used check_mysql_health ( http://labs.consol.de/lang/en/nagios/check_mysql_health/) in my nagios server. > but when it?s displaying the output on Nagios front end , that time i?m getting the below error?. > > > ? **ePN failed to compile /usr/local/nagios/libexec/check_mysql_health: ?Missing right curly or square bracket at (eval 23) line 3116, at end of line ? It appears that the author of this plugin did not write it to the specifications required to be compatible with the embedded perl interpreter. You need to either re-write it to be compatible or disable use of the ePN for this plugin. See the bottom of the following link for more information -- http://nagios.sourceforge.net/docs/3_0/embeddedperl.html -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 12 13:51:37 2010 From: marc at ena.com (Marc Powell) Date: Fri, 12 Feb 2010 06:51:37 -0600 Subject: Ndo2DB and Check for network traffic In-Reply-To: References: Message-ID: <15671F60-D7D8-4CD6-8192-B16C0944A3C5@ena.com> On Feb 12, 2010, at 3:18 AM, Renaud Vanderhagen wrote: > > Hello world ! > > What backup ndo2db exactly in ndo database ? > Where can I find a "map" of the ndo database ? See the NDOUtils_DB_Model.pdf included with the program or online at http://nagios.sourceforge.net/docs/ndoutils/NDOUtils_DB_Model.pdf. > Is it possible to backup the nagios.log ? Much or all of it is stored in the Historical Tables. > What do you use to monitor network traffic on servers and switchs interfaces ? Cricket. -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Feb 12 15:26:15 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 12 Feb 2010 14:26:15 +0000 Subject: Service Availability Report issue Message-ID: <4B756507.8040204@flatto.net> Hello I have an odd issue with the Service Availability Report . The setup is as follows , i have a distributed nagios (3.2.0 from source) with a server running 6 http queries from and external network and submitting the result to the central server . The results are arriving fine and functionality is working on . When I go to view the Service Availability for the services monitored by the remote server , for 5 out of the 6 i get the proper report ( up ,down, success etc') , but for the 6th , it only entries like this : 11-01-2010 16:05:23 11-01-2010 16:05:23 0d 0h 0m 0s PROGRAM (RE)START Program start 11-01-2010 16:05:23 11-01-2010 16:07:18 0d 0h 1m 55s PROGRAM END Abnormal program termination All the services configuration is identical ( with the exception of the IP of course) , but this is the only one to display this behaviour , any one has any ideas why ? Thanks Assaf ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nccomp at gmail.com Fri Feb 12 16:09:15 2010 From: nccomp at gmail.com (Andrew Davis) Date: Fri, 12 Feb 2010 10:09:15 -0500 Subject: Acknowledgement text in emails... Message-ID: <4B756F1B.7030408@gmail.com> For months I've been acknowledging alerts and including a reason why, etc but I just realized that the acknowledgment emails don't include the reasons that I take the time to include... and thus, my boss sees my ack'ing them, but doesn't know if its a non-critical issue, etc. Is there an option I can add to include the acknowledgment reasons in the acknowledgment emails that go out? -- A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From fevinkagen at gmail.com Fri Feb 12 16:10:02 2010 From: fevinkagen at gmail.com (fevin Kagen) Date: Fri, 12 Feb 2010 10:10:02 -0500 Subject: Hyperlink in Acknowledgement Comment In-Reply-To: References: <4B6D9466.3040006@hp.com> <4B7038EA.2060402@hp.com> Message-ID: Does anyone know if this is possible? Thanks- On Mon, Feb 8, 2010 at 11:55 AM, fevin Kagen wrote: > I've tried all sorts of combinations, but what I have now is this: > > http://xxx.xxx.xxx.xxx/nagios/cgi-bin/cmd.cgi?cmd_typ=&cmd_mod=2&host=&service=&sticky_ack=on&send_notification=on&persistent=on&com_data=" href='http://xxx.xxx.xxx.xxx/otrs/index.pl?Action=AgentZoom&TicketID= > '"&btnSubmit=Commit > > I know it can be done via the nagios.cmd file, but since these are two > separate machines, I'm trying to do it via a http command. > > The above command results in the following comment: > > "a href='http://xxx.xxx.xxx.xxx/otrs/index.pl?Action=AgentZoom > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Fri Feb 12 16:11:39 2010 From: nagios at flatto.net (Assaf Flatto) Date: Fri, 12 Feb 2010 15:11:39 +0000 Subject: changing the status map Message-ID: <4B756FAB.8000307@flatto.net> Hello I want to have the map start in another "map" entity then the " Nagios Process " ( for instance the nagios host itself ) . Is there a way to impact the CGI to do so ? I am not a coder , so my understanding of the code is limited , so if i remove the code in the statusmap.c from line 1599 to 1631 to eliminate the "nagios process" entry in the map - will that cause the Nagios server to become the centre of the map? or will it break the CGI completely ? Thanks Assaf ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Feb 12 16:36:47 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 12 Feb 2010 07:36:47 -0800 Subject: Acknowledgement text in emails... In-Reply-To: <4B756F1B.7030408@gmail.com> References: <4B756F1B.7030408@gmail.com> Message-ID: <4B75758F.6010505@hp.com> Andrew Davis wrote: > For months I've been acknowledging alerts and including a reason why, > etc but I just realized that the acknowledgment emails don't include > the reasons that I take the time to include... and thus, my boss sees > my ack'ing them, but doesn't know if its a non-critical issue, etc. Is > there an option I can add to include the acknowledgment reasons in the > acknowledgment emails that go out? http://nagios.sourceforge.net/docs/2_0/macros.html#serviceackcomment ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 12 16:50:08 2010 From: marc at ena.com (Marc Powell) Date: Fri, 12 Feb 2010 09:50:08 -0600 Subject: Hyperlink in Acknowledgement Comment In-Reply-To: References: <4B6D9466.3040006@hp.com> <4B7038EA.2060402@hp.com> Message-ID: On Feb 12, 2010, at 9:10 AM, fevin Kagen wrote: > Does anyone know if this is possible? Thanks- Yes, I expect that it is but you need to give some real examples of the command{} definitions you're using and your exact workflow. How you're trying to do this is very unclear. IMHO, you just need to use the ticket number as the ack comment and then insert that into an appropriately formatted notification command. > > On Mon, Feb 8, 2010 at 11:55 AM, fevin Kagen wrote: > I've tried all sorts of combinations, but what I have now is this: > > http://xxx.xxx.xxx.xxx/nagios/cgi-bin/cmd.cgi?cmd_typ=&cmd_mod=2&host=&service=&sticky_ack=on&send_notification=on&persistent=on&com_data=""&btnSubmit=Commit > > > > > I know it can be done via the nagios.cmd file, but since these are two separate machines, I'm trying to do it via a http command. > > The above command results in the following comment: > > "a href='http://xxx.xxx.xxx.xxx/otrs/index.pl?Action=AgentZoom -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Koen.Verhaeghe at meucci-solutions.com Fri Feb 12 17:02:42 2010 From: Koen.Verhaeghe at meucci-solutions.com (Verhaeghe, Koen) Date: Fri, 12 Feb 2010 17:02:42 +0100 Subject: check_openmanage and net-snmp v3 References: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be><15tvde3srlb.fsf@tux.uio.no><701DEDF6795DA64DAA315656332BE29102C074AE@webmail.meucci.be> <15taavesnih.fsf@tux.uio.no> Message-ID: <701DEDF6795DA64DAA315656332BE29102C07658@webmail.meucci.be> Hi Trond, The script is working, at least, it does not give any errors anymore. I even get "Physical Disk 0:1 [Ata WDC WD800JD-75MSA3, 0GB] on ctrl 0 needs attention: Failure Predicted" as expected. I was expecting also an errormessage from the Virtual disks, as they are degraded, but that's not there. Moreover, I know some of our servers have problems with power supplies or memory, so I changed a section in the below mentioned script like you did for the disks and others, just to test: #my $result = $snmp_session->get_entries(-columns => [keys %ps_oid]); ######################################################################## ## # SNMPv3 test ######################################################################## ## my $result = q{}; if ($opt{protocol} == 3) { my $powerDeviceTable = '1.3.6.1.4.1.674.10892.1.600.12.1'; $result = $snmp_session->get_table(-baseoid => $powerDeviceTable); } else { $result = $snmp_session->get_entries(-columns => [keys %ps_oid]); } ######################################################################## ## ######################################################################## ## And now I do get the expected error: "Power Supply 1 [AC] needs attention: Presence detected, Failure detected, AC lost" I think it is safe to say that, when using net-snmp v3, the get_entries method is not giving the expected result. Maybe other people have experience with that. Many thanks for your help. Regards, Koen -----Original Message----- From: Trond Hasle Amundsen [mailto:t.h.amundsen at usit.uio.no] Sent: 2010-02-12 12:27 To: Verhaeghe, Koen Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_openmanage and net-snmp v3 "Verhaeghe, Koen" writes: > Thanks for your reply and the new script. > These are the results: > > With windows SNMP (v2) it works: Yep, that was expected :) > With net-snmp v3 (version 5.4.2.1) on the same server, diabling the > windows snmp, I get: > ./check_openmanagetest -H xx.xx.xx.xx -P 3 --authprotocol md5 -U > xxxxxx --authpassword xxxxxxx --privpassword xxxxxx --privprotocol des > -p multiline -t 120 -o 3 -b ctrl_fw=all/ctrl_driver=all/ctrl_stdr=all > SNMP ERROR [processors]: Received genError(5) error-status at > error-index 3. Hmm.. was this on one of the servers that previously has problems fetching the cooling OIDs? I believe it would be better to make this work with the standard Windows SNMP service, which is what most people would use. Where the results any different without net-snmp? > This normally indicates a too low version of OMSA, but I am using 6.2.0. With SNMPv2 on Windows, that usually is the case, yes. I have a new test version for you: http://folk.uio.no/trondham/tmp/check_openmanage-snmpv3test2 This version uses get_table() for fetching OIDs for CPUs and physical drives as well as cooling devices. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Fri Feb 12 17:47:23 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Fri, 12 Feb 2010 11:47:23 -0500 Subject: Acknowledgement text in emails... In-Reply-To: <4B756F1B.7030408@gmail.com> References: <4B756F1B.7030408@gmail.com> Message-ID: <5bcb62b61002120847p364b7a2aoc29f5a60e1815131@mail.gmail.com> As Patrick Morris suggested, it's possible to include the acknowledgement text using the $SERVICEACKCOMMENT$ macro. Just insert it into the proper place in your "notify-host-by-email" command definition. --Matt On Fri, Feb 12, 2010 at 10:09 AM, Andrew Davis wrote: > For months I've been acknowledging alerts and including a reason why, etc > but I just realized that the acknowledgment emails don't include the reasons > that I take the time to include... and thus, my boss sees my ack'ing them, > but doesn't know if its a non-critical issue, etc. Is there an option I can > add to include the acknowledgment reasons in the acknowledgment emails that > go out? > > -- > > > A. Davis > Email: nccomp at gmail.com > > "There is no limit to what a man can accomplish > if he doesn't care who gets the credit." - Ronald Reagan > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nccomp at gmail.com Fri Feb 12 18:21:47 2010 From: nccomp at gmail.com (Andrew Davis) Date: Fri, 12 Feb 2010 12:21:47 -0500 Subject: Acknowledgement text in emails... In-Reply-To: <5bcb62b61002120847p364b7a2aoc29f5a60e1815131@mail.gmail.com> References: <4B756F1B.7030408@gmail.com> <5bcb62b61002120847p364b7a2aoc29f5a60e1815131@mail.gmail.com> Message-ID: <4B758E2B.9030107@gmail.com> Thanks all. I was able to get the $SERVICEACKCOMMENT$ and $HOSTACKCOMMENT$ options added. Now I'm trying to have it tell me *who* ack'd it. Any thoughts? A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan On 2/12/10 11:47 AM, Matt Simmons wrote: > As Patrick Morris suggested, it's possible to include the > acknowledgement text using the $SERVICEACKCOMMENT$ macro. Just insert > it into the proper place in your "notify-host-by-email" command > definition. > > --Matt > > > On Fri, Feb 12, 2010 at 10:09 AM, Andrew Davis wrote: > >> For months I've been acknowledging alerts and including a reason why, etc >> but I just realized that the acknowledgment emails don't include the reasons >> that I take the time to include... and thus, my boss sees my ack'ing them, >> but doesn't know if its a non-critical issue, etc. Is there an option I can >> add to include the acknowledgment reasons in the acknowledgment emails that >> go out? >> >> -- >> >> >> A. Davis >> Email: nccomp at gmail.com >> >> "There is no limit to what a man can accomplish >> if he doesn't care who gets the credit." - Ronald Reagan >> >> ------------------------------------------------------------------------------ >> SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, >> Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW >> http://p.sf.net/sfu/solaris-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting >> any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> >> > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nccomp at gmail.com Fri Feb 12 18:23:56 2010 From: nccomp at gmail.com (Andrew Davis) Date: Fri, 12 Feb 2010 12:23:56 -0500 Subject: How do I know *who* ack'd an alert? Message-ID: <4B758EAC.2040202@gmail.com> This is a follow-up to my question about including comments in ack emails. I just tested and ack'd an issue and got an email: ***** Nagios ***** Notification Type: ACKNOWLEDGEMENT Host: KVM6 State: DOWN Address: 10.5.12.75 Info: CRITICAL - 10.1.3.75: Host unreachable @ 10.10.45.170. rta nan, lost 100% Date/Time: Fri Feb 12 12:17:58 EST 2010 Additional Info: Comment: Brought down intentionally to test comments in ack emails - From IP:10.5.7.22 How do I see who ack'd this message? Better put, what macro do I use to show who ack'd an alert? -- A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mmelin at gmail.com Fri Feb 12 18:29:06 2010 From: mmelin at gmail.com (Martin Melin) Date: Fri, 12 Feb 2010 18:29:06 +0100 Subject: How do I know *who* ack'd an alert? In-Reply-To: <4B758EAC.2040202@gmail.com> References: <4B758EAC.2040202@gmail.com> Message-ID: Come on. You got the link to the macro list in the previous thread, couldn't you be bothered to even look at that? Hint: the macro names end with AUTHOR. Regards Martin Melin On Fri, Feb 12, 2010 at 6:23 PM, Andrew Davis wrote: > This is a follow-up to my question about including comments in ack emails. I > just tested and ack'd an issue and got an email: > > ***** Nagios ***** > > Notification Type: ACKNOWLEDGEMENT > Host: KVM6 > State: DOWN > Address: 10.5.12.75 > Info: CRITICAL - 10.1.3.75: Host unreachable @ 10.10.45.170. rta nan, lost > 100% > > Date/Time: Fri Feb 12 12:17:58 EST 2010 > > Additional Info: > > Comment: > > Brought down intentionally to test comments in ack emails - From > IP:10.5.7.22 > > How do I see who ack'd this message? Better put, what macro do I use to show > who ack'd an alert? > > -- > > > A. Davis > Email: nccomp at gmail.com > > "There is no limit to what a man can accomplish > if he doesn't care who gets the credit." - Ronald Reagan > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nccomp at gmail.com Fri Feb 12 18:32:16 2010 From: nccomp at gmail.com (Andrew Davis) Date: Fri, 12 Feb 2010 12:32:16 -0500 Subject: How do I know *who* ack'd an alert? In-Reply-To: References: <4B758EAC.2040202@gmail.com> Message-ID: <4B7590A0.801@gmail.com> Thanks for the hint. I did, indeed search the page. I didn't, however, think to look for the string "AUTHOR". I was looking at "ACK", "ADMIN", etc... didn't think to look for AUTHOR. Thanks much. :) A. Davis Email: nccomp at gmail.com "There is no limit to what a man can accomplish if he doesn't care who gets the credit." - Ronald Reagan On 2/12/10 12:29 PM, Martin Melin wrote: > Come on. You got the link to the macro list in the previous thread, > couldn't you be bothered to even look at that? > > Hint: the macro names end with AUTHOR. > > Regards > Martin Melin > > On Fri, Feb 12, 2010 at 6:23 PM, Andrew Davis wrote: > >> This is a follow-up to my question about including comments in ack emails. I >> just tested and ack'd an issue and got an email: >> >> ***** Nagios ***** >> >> Notification Type: ACKNOWLEDGEMENT >> Host: KVM6 >> State: DOWN >> Address: 10.5.12.75 >> Info: CRITICAL - 10.1.3.75: Host unreachable @ 10.10.45.170. rta nan, lost >> 100% >> >> Date/Time: Fri Feb 12 12:17:58 EST 2010 >> >> Additional Info: >> >> Comment: >> >> Brought down intentionally to test comments in ack emails - From >> IP:10.5.7.22 >> >> How do I see who ack'd this message? Better put, what macro do I use to show >> who ack'd an alert? >> >> -- >> >> >> A. Davis >> Email: nccomp at gmail.com >> >> "There is no limit to what a man can accomplish >> if he doesn't care who gets the credit." - Ronald Reagan >> >> ------------------------------------------------------------------------------ >> SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, >> Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW >> http://p.sf.net/sfu/solaris-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when reporting >> any issue. >> ::: Messages without supporting info will risk being sent to /dev/null >> >> -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Fri Feb 12 19:20:23 2010 From: israel at frontierflying.com (Israel Brewster) Date: Fri, 12 Feb 2010 09:20:23 -0900 Subject: [OT]Network speed test Message-ID: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> This isn't a nagios question so much as a general network diagnostic question, but since nagios is all about network monitoring and diagnostics, I figured it was likely someone here would know the answer. I am looking for a tool for testing network speeds on our "local" network, similar to internet speed sites. The difference being that I need something I can install somewhere on our local network so we can get site-to-site results between our main hub and our outstations. The idea being to test the actual speed we are getting over our private links. This wouldn't necessarily need to integrate with Nagios however, as it would just be for periodic spot checks - thus the OT label. Thanks. ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Feb 12 20:01:57 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Fri, 12 Feb 2010 11:01:57 -0800 Subject: [OT]Network speed test In-Reply-To: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> References: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> Message-ID: <4B75A5A5.6080000@hp.com> Israel Brewster wrote: > This isn't a nagios question so much as a general network diagnostic > question, but since nagios is all about network monitoring and > diagnostics, I figured it was likely someone here would know the > answer. I am looking for a tool for testing network speeds on our > "local" network, similar to internet speed sites. The difference being > that I need something I can install somewhere on our local network so > we can get site-to-site results between our main hub and our > outstations. The idea being to test the actual speed we are getting > over our private links. This wouldn't necessarily need to integrate > with Nagios however, as it would just be for periodic spot checks - > thus the OT label. Thanks. Most routers have a way to check current throughput. I'd start there. If you're looking for single-connection throughput, the old download-a-file-and-see-how-long-it-takes method works pretty well. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dido at imperium.ph Fri Feb 12 19:39:40 2010 From: dido at imperium.ph (Rafael R. Sevilla) Date: Fri, 12 Feb 2010 11:39:40 -0700 Subject: [OT]Network speed test In-Reply-To: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> References: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> Message-ID: <4B75A06C.50600@imperium.ph> http://oss.oetiker.ch/smokeping/ This will help measure latency between various destinations within your network, which is one important component that affects network speed. Obviously if your links are having high latency and/or packet loss your speed will be noticeably degraded. Despite the name you can also use HTTP or FTP downloads as part of your tests as opposed to pings, and these can be used to get a better idea of how fast or how slow your network is. -- I am the soul of my code. Fire is my heart and lightning is my brain. I have debugged over a million lines of code. Unaware of loss. Nor aware of gain. Withstood pain to create many programs. Waiting for one's arrival. I have no regrets, this is the only path. My whole life was Unlimited Code Works. http://stormwyrm.blogspot.com -------------- next part -------------- A non-text attachment was scrubbed... Name: signature.asc Type: application/pgp-signature Size: 261 bytes Desc: OpenPGP digital signature URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 12 20:51:28 2010 From: marc at ena.com (Marc Powell) Date: Fri, 12 Feb 2010 13:51:28 -0600 Subject: [OT]Network speed test In-Reply-To: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> References: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> Message-ID: On Feb 12, 2010, at 12:20 PM, Israel Brewster wrote: > This isn't a nagios question so much as a general network diagnostic question, but since nagios is all about network monitoring and diagnostics, I figured it was likely someone here would know the answer. I am looking for a tool for testing network speeds on our "local" network, similar to internet speed sites. The difference being that I need something I can install somewhere on our local network so we can get site-to-site results between our main hub and our outstations. http://www.speedtest.net/mini.php or http://en.wikipedia.org/wiki/Iperf, depending on your requirements. -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From israel at frontierflying.com Fri Feb 12 21:20:41 2010 From: israel at frontierflying.com (Israel Brewster) Date: Fri, 12 Feb 2010 11:20:41 -0900 Subject: [OT]Network speed test In-Reply-To: <5bcb62b61002121103i309cc06g6008008aa5b1ce47@mail.gmail.com> References: <9D61395A-8130-4B42-8D9D-C0399F17A56B@frontierflying.com> <5bcb62b61002121103i309cc06g6008008aa5b1ce47@mail.gmail.com> Message-ID: <1A2BE0F8-9F27-408C-B855-C0412F1A5158@frontierflying.com> On Feb 12, 2010, at 10:03 AM, Matt Simmons wrote: > I think you're looking for this: http://www.speedtest.net/mini.php Exactly what I was looking for. Thanks. > > Feel free to direct general non-nagios sysadmin related questions to > http://www.serverfault.com as well. Lots of good people on that site. Good tip - I'll definitely look into it. Thanks again! > > --Matt > > > On Fri, Feb 12, 2010 at 1:20 PM, Israel Brewster > wrote: >> This isn't a nagios question so much as a general network diagnostic >> question, but since nagios is all about network monitoring and >> diagnostics, >> I figured it was likely someone here would know the answer. I am >> looking for >> a tool for testing network speeds on our "local" network, similar to >> internet speed sites. The difference being that I need something I >> can >> install somewhere on our local network so we can get site-to-site >> results >> between our main hub and our outstations. The idea being to test >> the actual >> speed we are getting over our private links. This wouldn't >> necessarily need >> to integrate with Nagios however, as it would just be for periodic >> spot >> checks - thus the OT label. Thanks. >> ----------------------------------------------- >> Israel Brewster >> Computer Support Technician II >> Frontier Flying Service Inc. >> 5245 Airport Industrial Rd >> Fairbanks, AK 99709 >> (907) 450-7250 x293 >> ----------------------------------------------- >> >> >> >> ------------------------------------------------------------------------------ >> SOLARIS 10 is the OS for Data Centers - provides features such as >> DTrace, >> Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW >> http://p.sf.net/sfu/solaris-dev2dev >> _______________________________________________ >> Nagios-users mailing list >> Nagios-users at lists.sourceforge.net >> https://lists.sourceforge.net/lists/listinfo/nagios-users >> ::: Please include Nagios version, plugin version (-v) and OS when >> reporting >> any issue. >> ::: Messages without supporting info will risk being sent to /dev/ >> null >> > > > > -- > > LITTLE GIRL: But which cookie will you eat FIRST? > COOKIE MONSTER: Me think you have misconception of cookie-eating > process. ----------------------------------------------- Israel Brewster Computer Support Technician II Frontier Flying Service Inc. 5245 Airport Industrial Rd Fairbanks, AK 99709 (907) 450-7250 x293 ----------------------------------------------- -------------- next part -------------- A non-text attachment was scrubbed... Name: Israel Brewster.vcf Type: text/directory Size: 417 bytes Desc: not available URL: -------------- next part -------------- -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lennart.kvam at softronic.se Mon Feb 15 09:04:29 2010 From: lennart.kvam at softronic.se (lennart.kvam at softronic.se) Date: Mon, 15 Feb 2010 09:04:29 +0100 Subject: check_mem plugin for Solaris10 with perfdata? Message-ID: <437A4C7678CF8348801182B50762E87E0238B874@STSSTOMAIL02.softronic.se> Hello everyone! Does anybody have or knows where i can get a "check_mem" for solaris 10, both with and without Zfs. And I really need the plugin to generate perfdata. Thanks in advanced RD Lennart __\/__ . / ^ ^ \ . |\| (o)(o) |/| #------------------OOOo----O0----oOOO --------------# # Lennart Kvam # # Softronic AB, Ringv?gen 100, 118 60 Stockholm # # Tel: 0708-813279 Vx:08-51909000 # # lennart.kvam at softronic.se # #____________________ Oooo. ___________ # .oooO ( ) ( ) ) / \ ( (_/ \_) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lennart.kvam at softronic.se Mon Feb 15 09:46:45 2010 From: lennart.kvam at softronic.se (lennart.kvam at softronic.se) Date: Mon, 15 Feb 2010 09:46:45 +0100 Subject: check_mem plugin for Solaris10 with perfdata? Message-ID: <437A4C7678CF8348801182B50762E87E0238B879@STSSTOMAIL02.softronic.se> Hello everyone! Does anybody have or knows where i can get a "check_mem" for solaris 10, both with and without Zfs. And I really need the plugin to generate perfdata. Thanks in advanced RD Lennart __\/__ . / ^ ^ \ . |\| (o)(o) |/| #------------------OOOo----O0----oOOO --------------# # Lennart Kvam # # Softronic AB, Ringv?gen 100, 118 60 Stockholm # # Tel: 0708-813279 Vx:08-51909000 # # lennart.kvam at softronic.se # #____________________ Oooo. ___________ # .oooO ( ) ( ) ) / \ ( (_/ \_) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lennart.kvam at softronic.se Mon Feb 15 09:50:35 2010 From: lennart.kvam at softronic.se (lennart.kvam at softronic.se) Date: Mon, 15 Feb 2010 09:50:35 +0100 Subject: check_mem plugin for Solaris10 with perfdata? Message-ID: <437A4C7678CF8348801182B50762E87E0238B87A@STSSTOMAIL02.softronic.se> Hello everyone! Does anybody have or knows where i can get a "check_mem" for solaris 10, both with and without Zfs. And I really need the plugin to generate perfdata. Thanks in advanced RD Lennart __\/__ . / ^ ^ \ . |\| (o)(o) |/| #------------------OOOo----O0----oOOO --------------# # Lennart Kvam # # Softronic AB, Ringv?gen 100, 118 60 Stockholm # # Tel: 0708-813279 Vx:08-51909000 # # lennart.kvam at softronic.se # #____________________ Oooo. ___________ # .oooO ( ) ( ) ) / \ ( (_/ \_) -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Mon Feb 15 12:53:43 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Mon, 15 Feb 2010 12:53:43 +0100 Subject: check_mem plugin for Solaris10 with perfdata? In-Reply-To: <437A4C7678CF8348801182B50762E87E0238B87A@STSSTOMAIL02.softronic.se> References: <437A4C7678CF8348801182B50762E87E0238B87A@STSSTOMAIL02.softronic.se> Message-ID: <4B7935C7.2080506@ederdrom.de> Am 15.02.10 09:50, schrieb lennart.kvam at softronic.se: > > > Hello everyone! > > > > Does anybody have or knows where i can get a "check_mem" for solaris 10, both with and without Zfs. Like this one? http://sysadminsjourney.com/content/2009/06/05/new-and-improved-checkmempl-nagios-plugin ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Mon Feb 15 14:13:05 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Mon, 15 Feb 2010 14:13:05 +0100 Subject: check_openmanage and net-snmp v3 In-Reply-To: <701DEDF6795DA64DAA315656332BE29102C07658@webmail.meucci.be> (Koen Verhaeghe's message of "Fri, 12 Feb 2010 17:02:42 +0100") References: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> <15tvde3srlb.fsf@tux.uio.no> <701DEDF6795DA64DAA315656332BE29102C074AE@webmail.meucci.be> <15taavesnih.fsf@tux.uio.no> <701DEDF6795DA64DAA315656332BE29102C07658@webmail.meucci.be> Message-ID: <15tpr46r6am.fsf@tux.uio.no> "Verhaeghe, Koen" writes: > The script is working, at least, it does not give any errors anymore. > I even get "Physical Disk 0:1 [Ata WDC WD800JD-75MSA3, 0GB] on ctrl 0 > needs attention: Failure Predicted" as expected. I was expecting also an > errormessage from the Virtual disks, as they are degraded, but that's > not there. If the error is just "Failure Predicted", it means that the disk is working fine for the time being and the virtual drive status is not affected. When/if the drive eventually fails the virtual drive will be degraded. > Moreover, I know some of our servers have problems with power supplies > or memory, so I changed a section in the below mentioned script like you > did for the disks and others, just to test: > > #my $result = $snmp_session->get_entries(-columns => [keys > %ps_oid]); > > ######################################################################## > ## > # SNMPv3 test > > ######################################################################## > ## > my $result = q{}; > if ($opt{protocol} == 3) { > my $powerDeviceTable = '1.3.6.1.4.1.674.10892.1.600.12.1'; > $result = $snmp_session->get_table(-baseoid => > $powerDeviceTable); > } > else { > $result = $snmp_session->get_entries(-columns => [keys > %ps_oid]); > } > > ######################################################################## > ## > > ######################################################################## > ## > > And now I do get the expected error: > "Power Supply 1 [AC] needs attention: Presence detected, Failure > detected, AC lost" > > I think it is safe to say that, when using net-snmp v3, the get_entries > method is not giving the expected result. The complete picture is still a little unclear to me. Do these problems occur only when you use net-snmp instead of Windows' native snmp agent? (I'm assuming that "net-snmp" refers to http://freshmeat.net/projects/net-snmp). I would be interested in any test results you might have using the native Windows snmp agent with SNMPv3. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From skymen4 at hotmail.com Mon Feb 15 16:01:19 2010 From: skymen4 at hotmail.com (Renaud Vanderhagen) Date: Mon, 15 Feb 2010 16:01:19 +0100 Subject: Ndo2DB and Check for network traffic Message-ID: On Feb 12, 2010, at 3:18 AM, Renaud Vanderhagen wrote: > > Hello world ! > > What backup ndo2db exactly in ndo database ? > Where can I find a "map" of the ndo database ? See the NDOUtils_DB_Model.pdf included with the program or online at http://nagios.sourceforge.net/docs/ndoutils/NDOUtils_DB_Model.pdf. > Is it possible to backup the nagios.log ? Much or all of it is stored in the Historical Tables. > What do you use to monitor network traffic on servers and switchs interfaces ? Cricket. -- Marc -------------------------------------------------------------------------------------------------------------------------------------- Thanks for your link Marc, it is exactly what I search ! Now I understand the links between the ndo tables ! About monitoring network traffic, Cricket isn't integrated in Nagios, I search for a plugin... Do you know something for that ? Like check_centreon_snmp_traffic but for Nagios. Thanks a lot, Renaud. _________________________________________________________________ Pour le printemps, Windows Live vous propose une foule des jeux rafraichissants ? d?couvrir entre amis. http://www.messengerbillboard.be/fr/play -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lennart.kvam at softronic.se Mon Feb 15 16:06:51 2010 From: lennart.kvam at softronic.se (lennart.kvam at softronic.se) Date: Mon, 15 Feb 2010 16:06:51 +0100 Subject: check_mem plugin for Solaris10 with perfdata? References: <437A4C7678CF8348801182B50762E87E0238B87A@STSSTOMAIL02.softronic.se> <4B7935C7.2080506@ederdrom.de> Message-ID: <437A4C7678CF8348801182B50762E87E0238B886@STSSTOMAIL02.softronic.se> Hello! Yes this would be nice, but it doesnt work on ufs filesystem!? Keep getting: Attempt to access disallowed key 'zfs' in a restricted hash at ./check_mem_test line 173. RD Lelle -----Original Message----- From: Joerg Linge [mailto:pitchfork at ederdrom.de] Sent: den 15 februari 2010 12:54 To: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] check_mem plugin for Solaris10 with perfdata? Am 15.02.10 09:50, schrieb lennart.kvam at softronic.se: > > > Hello everyone! > > > > Does anybody have or knows where i can get a "check_mem" for solaris 10, both with and without Zfs. Like this one? http://sysadminsjourney.com/content/2009/06/05/new-and-improved-checkmem pl-nagios-plugin ------------------------------------------------------------------------ ------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 15 16:15:36 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 15 Feb 2010 10:15:36 -0500 Subject: Ndo2DB and Check for network traffic In-Reply-To: <15671F60-D7D8-4CD6-8192-B16C0944A3C5@ena.com> References: <15671F60-D7D8-4CD6-8192-B16C0944A3C5@ena.com> Message-ID: <6db4a4201002150715v3a2c39aew5d62cb21affb6126@mail.gmail.com> Marc, I have similar question about cricket. We use cricket to poll network related data. However I have not found a way to poll once in nagios and use the data for cricket. Currently (for many devices), I do snmp poll twice, one for nagios, one for cricket. Do you have any suggestion as to poll once and have the data available for both system. Thanks On Fri, Feb 12, 2010 at 7:51 AM, Marc Powell wrote: > > On Feb 12, 2010, at 3:18 AM, Renaud Vanderhagen wrote: > > > > > Hello world ! > > > > What backup ndo2db exactly in ndo database ? > > Where can I find a "map" of the ndo database ? > > See the NDOUtils_DB_Model.pdf included with the program or online at > http://nagios.sourceforge.net/docs/ndoutils/NDOUtils_DB_Model.pdf. > > > Is it possible to backup the nagios.log ? > > Much or all of it is stored in the Historical Tables. > > > What do you use to monitor network traffic on servers and switchs > interfaces ? > > Cricket. > > -- > Marc > > > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 15 16:19:06 2010 From: marc at ena.com (Marc Powell) Date: Mon, 15 Feb 2010 09:19:06 -0600 Subject: Ndo2DB and Check for network traffic In-Reply-To: References: Message-ID: <071F0E54-6CD2-460C-B9AE-3DA1B484487D@ena.com> On Feb 15, 2010, at 9:01 AM, Renaud Vanderhagen wrote: > About monitoring network traffic, Cricket isn't integrated in Nagios, I search for a plugin... Do you know something for that ? Like check_centreon_snmp_traffic but for Nagios. http://exchange.nagios.org/directory/Addons/Graphing-and-Trending -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Mon Feb 15 16:27:59 2010 From: marc at ena.com (Marc Powell) Date: Mon, 15 Feb 2010 09:27:59 -0600 Subject: Ndo2DB and Check for network traffic In-Reply-To: <6db4a4201002150715v3a2c39aew5d62cb21affb6126@mail.gmail.com> References: <15671F60-D7D8-4CD6-8192-B16C0944A3C5@ena.com> <6db4a4201002150715v3a2c39aew5d62cb21affb6126@mail.gmail.com> Message-ID: <506B1294-E345-4462-93FA-CD7A66DDD5AA@ena.com> On Feb 15, 2010, at 9:15 AM, shadih rahman wrote: > Marc, > I have similar question about cricket. We use cricket to poll network related data. However I have not found a way to poll once in nagios and use the data for cricket. Currently (for many devices), I do snmp poll twice, one for nagios, one for cricket. Do you have any suggestion as to poll once and have the data available for both system. Thanks No, I don't. The bandwidth necessary to do the polling is so minimal that there's never been a need to consider it. Additionally, cricket polls so many more things than we care about to monitor in Nagios that it doesn't make sense for us to poll just with nagios. Finally, cricket is just much better at doing that kind of polling than nagios is because that's what it was designed to do. -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Mon Feb 15 18:19:36 2010 From: flyinvap at orange.fr (Flyinvap) Date: Mon, 15 Feb 2010 18:19:36 +0100 Subject: failover and scheduling Message-ID: <20100215181936.001d382a@orange.fr> Hi, I have to setup two nagios servers in failover mode. I read [1] and have a question. Slave host will not check any service as execute_service_check will be disable. In this case, are services scheduled but not executed or services are not scheduled _and_ not executed ? Regards. [1] http://nagios.sourceforge.net/docs/3_0/redundancy.html -- Flyinvap ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pelama at gmail.com Tue Feb 16 11:16:45 2010 From: pelama at gmail.com (Magnus Persson) Date: Tue, 16 Feb 2010 11:16:45 +0100 Subject: mirror server / slave In-Reply-To: <371a55af1002160206p56a80476jae631ecf50265f28@mail.gmail.com> References: <371a55af1002160206p56a80476jae631ecf50265f28@mail.gmail.com> Message-ID: <0F01A11C-798B-4E85-B48E-BD5C634897AB@gmail.com> > > Hi, > > > Is there a way to put an extended nagios Server (windows) from the Primary > Nagios monitor server? > > I have a situation there I would like to push al current status from the > Primary Nagios (monitor) server to an other Nagios (monitor) server. The > > server that I want to push status from Is behind a firewall and the second > one is not. > > NSCA is not a solution because the Remote Server can access the Second > Nagios server.. The only way it possible > To communicate is between the 2 Nagios servers.. > > > This Second Nagios would act like a global nagios server and have > information from many companys.. > > Any suggestions? > > Regards. > Magnus ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sean.obrien at jp.fujitsu.com Tue Feb 16 11:37:22 2010 From: sean.obrien at jp.fujitsu.com (Sean O'Brien) Date: Tue, 16 Feb 2010 19:37:22 +0900 Subject: Nagios 2.5 stalling Message-ID: <201002161037.AA00536@hunte.jp.fujitsu.com> Dear All, I currently am having a some problems with an older version of Nagios (2.5). The situation is that at random intervals and for no apparent reason Nagios just halts. It doesn't process alerts, forward information or allow access, it simply stops functioning. It is not a hardware/spec issue and most of the time functions perfectly. I was hoping that someone else had come across this issue and could guide me in the right direction. Best Regards, Sean ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ae at op5.se Tue Feb 16 12:10:31 2010 From: ae at op5.se (Andreas Ericsson) Date: Tue, 16 Feb 2010 12:10:31 +0100 Subject: Nagios 2.5 stalling In-Reply-To: <201002161037.AA00536@hunte.jp.fujitsu.com> References: <201002161037.AA00536@hunte.jp.fujitsu.com> Message-ID: <4B7A7D27.6010908@op5.se> On 02/16/2010 11:37 AM, Sean O'Brien wrote: > Dear All, > > I currently am having a some problems with an older version of Nagios (2.5). > Upgrade to latest Nagios 2.x before you report bugs, please. It's likely your problem has already been fixed, so we'd rather not waste time trying to hunt down a bug that we can't reproduce with the latest bugfix release of the version you're using. -- Andreas Ericsson andreas.ericsson at op5.se OP5 AB www.op5.se Tel: +46 8-230225 Fax: +46 8-230231 Considering the successes of the wars on alcohol, poverty, drugs and terror, I think we should give some serious thought to declaring war on peace. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Tue Feb 16 13:53:59 2010 From: nigel.leach at uk.bnpparibas.com (nigel.leach at uk.bnpparibas.com) Date: Tue, 16 Feb 2010 12:53:59 +0000 Subject: ndomod: Could not open data sink Message-ID: Nagios Core 3.2.0 NDO2DB 1.4b9 MySQL 5.0.17 Redhat AS4 U6 I have the dreaded "ndomod: Could not open data sink!" problem. Nagios itself is running happily, and I am trying to integrate Nagvis. These are my Nagios processes, and socket file nagios 29946 1 0 10:50 ? 00:00:00 /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg nagios 29990 1 0 10:50 ? 00:00:08 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg srwxrwx--- 1 nagios nagios 0 Feb 16 12:36 /usr/local/nagios/var/ndo.sock No Nagios data is going through to the mysql Nagios tables though. They are all created, but none have been populated. I've gone through all the numerous other posts on this topic, but nothing has worked. I can write to my mysql d/b using the db_user/db_pass in ndo2db.cfg. This is from my Nagios log (coincidently, I can never get anything written to the ndo2db.debug file). [1266317457] ndomod: Could not open data sink! I'll keep trying, but some output may get lost... [1266317457] Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully. [1266321105] ndomod: Still unable to connect to data sink. 32292 items lost, 5000 queued items to flush. The ndo2db.cfg and nagios.cfg are below. Any help much appreciated. Regards Nigel [ndo2db.cfg] lock_file=/usr/local/nagios/var/ndo2db.lock ndo2db_user=nagios ndo2db_group=nagios socket_type=unix socket_name=/usr/local/nagios/var/ndo.sock use_ssl=0 db_servertype=mysql db_host=localhost db_port=3306 db_name=nagios db_prefix=nagios_ db_user=ndouser db_pass=xxxxxx max_timedevents_age=1440 max_systemcommands_age=10080 max_servicechecks_age=10080 max_hostchecks_age=10080 max_eventhandlers_age=44640 max_externalcommands_age=44640 debug_level=-1 debug_verbosity=1 debug_file=/usr/local/nagios/var/ndo2db.debug max_debug_file_size=1000000 [nagios.cfg] log_file=/var/log/nagios/nagios.log cfg_file=/usr/local/nagios/etc/objects/linux-remote.cfg cfg_file=/usr/local/nagios/etc/objects/commands.cfg cfg_file=/usr/local/nagios/etc/objects/contacts.cfg cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg cfg_file=/usr/local/nagios/etc/objects/templates.cfg cfg_file=/usr/local/nagios/etc/objects/localhost.cfg object_cache_file=/usr/local/nagios/var/objects.cache precached_object_file=/usr/local/nagios/var/objects.precache resource_file=/usr/local/nagios/etc/resource.cfg status_file=/usr/local/nagios/var/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/usr/local/nagios/var/nagios.lock temp_file=/usr/local/nagios/var/nagios.tmp temp_path=/tmp event_broker_options=-1 broker_module=/usr/local/nagios/bin/ndomod.o config_file=/usr/local/nagios/etc/ndomod.cfg log_rotation_method=d log_archive_path=/var/log/nagios/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=1 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/usr/local/nagios/var/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/usr/local/nagios/var/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 check_for_updates=1 bare_update_check=0 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=1 enable_event_handlers=1 process_performance_data=0 obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us p1_file=/usr/local/nagios/bin/p1.pl enable_embedded_perl=1 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$&|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=nagios at localhost admin_pager=pagenagios at localhost daemon_dumps_core=0 use_large_installation_tweaks=0 enable_environment_macros=1 debug_level=0 debug_verbosity=1 debug_file=/usr/local/nagios/var/nagios.debug max_debug_file_size=1000000 Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas ___________________________________________________________ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. ____________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Feb 16 14:02:43 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 16 Feb 2010 13:02:43 +0000 Subject: Passive checks show "red" in tactical overview In-Reply-To: <1265741593.15037.8.camel@workstation> References: <1265741593.15037.8.camel@workstation> Message-ID: <765d77c81002160502r18eb3283hf249310f8e7a4fcb@mail.gmail.com> On 9 February 2010 18:53, David Krider wrote: > I have many services that are passively monitored. They have a > check_command defined, but, like my SNMP trap catcher service, it only > fires when it's time to check freshness. > > All these services show up as "X Services Disabled" in the "Active > Checks" area of the tactical overview screen, as though this was a > problem. However, this is normal, and I don't want to see any "red" > areas on the overview. (I want _any_ red on that screen to mean, > "PANIC!") > > Can anyone tell me how to make Nagios ignore this? Also, these same > services have flapping detection disabled, so, same thing for that as > well. No, I think this is just how Nagios works. I agree it would be nice if the tactical overview would only show those services which are in a state which is not 'as configured'. > Or, rather, is the only way to get rid of this to go ahead and enable > the active_check command, probably on a lengthy interval time? I suppose you could, so long as the active check actually returned a correct check result. Personally I just don't use the tactical overview screen very often. I'm sorry I don't suppose that helps a great deal! Cheers, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Feb 16 14:08:59 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 16 Feb 2010 13:08:59 +0000 Subject: Scheduled downtime doesn't clear? In-Reply-To: <5F48703F-0559-44C1-B183-6CA9AE40DD72@highwire.stanford.edu> References: <5F48703F-0559-44C1-B183-6CA9AE40DD72@highwire.stanford.edu> Message-ID: <765d77c81002160508q157222ddme75b1aeef695a965@mail.gmail.com> On 10 February 2010 15:52, Michael Durket wrote: > We're running Nagios 2.5. Every night we take hundreds of services being monitored > by Nagios down for a quick restart. The restart process takes no more than 15 minutes > and just before each service goes down, a script sends a scheduled flexible downtime > request to Nagios (for a 15 minute flexible window). > > We've noticed that, sometimes, Nagios doesn't cancel the downtime at the end of the > window, and days later we have services that are listed as still being in scheduled > downtime (even though the window time is long past). Has this been found and/or > fixed in a later release of Nagios? I used to find in Nagios 2 that I submitted dozens of scheduled downtime periods to start at the same time, it could crash Nagios. If you look in the version history for Nagios 3.x at http://www.nagios.org/development/history/core-3x you will see that in version 3.0b4 there was "Fix for segfault when cancelling active scheduled host downtime" which I guess is what fixed the problem I was seeing anyway. hth, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Feb 16 14:17:40 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 16 Feb 2010 13:17:40 +0000 Subject: Service Availability Report issue In-Reply-To: <4B756507.8040204@flatto.net> References: <4B756507.8040204@flatto.net> Message-ID: <765d77c81002160517k54d67a90q510ffcb98764230f@mail.gmail.com> On 12 February 2010 14:26, Assaf Flatto wrote: > Hello > > I have an odd issue with the Service Availability Report . > The setup is as follows , i have a distributed nagios (3.2.0 from > source) ?with a server running 6 http queries from and external network > and submitting the result to the central server . > > The results are arriving fine and functionality is working on . > When I go to view the Service Availability for the services monitored by > the remote server , for 5 out of the 6 i get the proper report ( up > ,down, success etc') , but for the 6th , it only entries like this : > > 11-01-2010 16:05:23 ? ? 11-01-2010 16:05:23 ? ? 0d 0h 0m 0s ? ? PROGRAM > (RE)START ? ? ? Program start > 11-01-2010 16:05:23 ? ? 11-01-2010 16:07:18 ? ? 0d 0h 1m 55s ? ?PROGRAM END > Abnormal program termination > > > All the services configuration is identical ( with the exception of the > IP of course) , but this is the only one to display this behaviour , any > one has any ideas why ? I'm not sure, no, but I guess there could be some corruption in the log files which only affects lines in the log file relating to that service. I would grep the nagios log files (under /usr/local/nagios/var/archives) for the entries relating to that service to see if there is anything different about them. Try running an availability report for a very short period (say, the last 24 hours) and see if that works. Also, I would see if there are any special characters in the service definition which might mess up the report CGI. Can you post an example of a service definition which is working ok and one which is not from your objects.cache file? hth, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Feb 16 14:34:45 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 16 Feb 2010 13:34:45 +0000 Subject: changing the status map In-Reply-To: <4B756FAB.8000307@flatto.net> References: <4B756FAB.8000307@flatto.net> Message-ID: <765d77c81002160534m54f17d3fp6b50c9c76d467939@mail.gmail.com> On 12 February 2010 15:11, Assaf Flatto wrote: > Hello > > I want to have the map start in another "map" entity then the " Nagios > Process " > ( for instance the nagios host itself ) . > Is there a way to impact the CGI to do so ? > > I am not a coder ?, so my understanding of the code is limited , so if i > remove the code in the statusmap.c from line 1599 to 1631 to eliminate > the "nagios process" entry in the map - will that cause the Nagios > server to become the centre of the map? or will it break the CGI > completely ? It wouldn't do any harm to try! (assuming you take a backup of your compiled statusmap binary first). I don't know 'C' either. You might find the automap in NagVis ( http://www.nagvis.org/ ) is easier to use once it is set up. I would also recommend you try NEXSM (Nagios EXtended Status Map) but it looks like the web page for it is no longer accessible. I'm not sure if this means it is no longer maintained, or whether it's just a temporary problem. Does anyone know what the story is there? hth, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Tue Feb 16 14:41:18 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Tue, 16 Feb 2010 13:41:18 +0000 Subject: Nagios 2.5 stalling In-Reply-To: <201002161037.AA00536@hunte.jp.fujitsu.com> References: <201002161037.AA00536@hunte.jp.fujitsu.com> Message-ID: <765d77c81002160541t171f1630rc540b31f2e1cb52e@mail.gmail.com> On 16 February 2010 10:37, Sean O'Brien wrote: > Dear All, > > I currently am having a some problems with an older version of Nagios (2.5). > > The situation is that at random intervals and for no apparent reason Nagios just halts. It doesn't > process alerts, forward information or allow access, it simply stops functioning. > > It is not a hardware/spec issue and most of the time functions perfectly. > I was hoping that someone else had come across this issue and could guide me in the right direction. > > Best Regards, > > Sean The only thing which used to crash my Nagios 2.x install was when I tried to have dozens of scheduled down-time periods starting at the same time. It would write some information to the log file though if I remember correctly. I would echo Andreas' comment - it would be wise if possible to consider upgrading if not to the current supported version then to the latest version of Nagios 2.x. I don't think the segfault problem with scheduled downtime was ever fixed in Nagios 2.x so I would recommend an upgrade to 3.x if you can. hth, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Tue Feb 16 15:13:36 2010 From: nigel.leach at uk.bnpparibas.com (nigel.leach at uk.bnpparibas.com) Date: Tue, 16 Feb 2010 14:13:36 +0000 Subject: ndomod: Could not open data sink Message-ID: Nagios Core 3.2.0 NDO2DB 1.4b9 MySQL 5.0.17 Redhat AS4 U6 I have the dreaded "ndomod: Could not open data sink!" problem. Nagios itself is running happily, and I am trying to integrate Nagvis. These are my Nagios processes, and socket file nagios 29946 1 0 10:50 ? 00:00:00 /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg nagios 29990 1 0 10:50 ? 00:00:08 /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg srwxrwx--- 1 nagios nagios 0 Feb 16 12:36 /usr/local/nagios/var/ndo.sock No Nagios data is going through to the mysql Nagios tables though. They are all created, but none have been populated. I've gone through all the numerous other posts on this topic, but nothing has worked. I can write to my mysql d/b using the db_user/db_pass in ndo2db.cfg. This is from my Nagios log (coincidently, I can never get anything written to the ndo2db.debug file). [1266317457] ndomod: Could not open data sink! I'll keep trying, but some output may get lost... [1266317457] Event broker module '/usr/local/nagios/bin/ndomod.o' initialized successfully. [1266321105] ndomod: Still unable to connect to data sink. 32292 items lost, 5000 queued items to flush. The ndo2db.cfg and nagios.cfg are below. Any help much appreciated. Regards Nigel Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas ___________________________________________________________ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. ____________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 16 16:15:12 2010 From: marc at ena.com (Marc Powell) Date: Tue, 16 Feb 2010 09:15:12 -0600 Subject: Nagios 2.5 stalling In-Reply-To: <201002161037.AA00536@hunte.jp.fujitsu.com> References: <201002161037.AA00536@hunte.jp.fujitsu.com> Message-ID: On Feb 16, 2010, at 4:37 AM, Sean O'Brien wrote: > Dear All, > > I currently am having a some problems with an older version of Nagios (2.5). > > The situation is that at random intervals and for no apparent reason Nagios just halts. It doesn't > {...} allow access This seems like a clue. Please describe this in more detail. What are you trying and what happens, very specifically? -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 16 16:34:52 2010 From: marc at ena.com (Marc Powell) Date: Tue, 16 Feb 2010 09:34:52 -0600 Subject: ndomod: Could not open data sink In-Reply-To: References: Message-ID: <26E07E3A-63AB-4AD7-BC24-6244D2717182@ena.com> On Feb 16, 2010, at 8:13 AM, nigel.leach at uk.bnpparibas.com wrote: > > Nagios Core 3.2.0 > NDO2DB 1.4b9 > MySQL 5.0.17 > Redhat AS4 U6 > No Nagios data is going through to the mysql Nagios tables though. They are all created, but none have been populated. I've gone through all the numerous other posts on this topic, but nothing has worked. Hints of what you've tried will keep us from offering up repeats.. > I can write to my mysql d/b using the db_user/db_pass in ndo2db.cfg. Into the NDO tables? > This is from my Nagios log (coincidently, I can never get anything written to the ndo2db.debug file). Check /var/log/messages or wherever LOG_ERR goes to for possible clues. -- Marc p.s. posting the same question more than once in a 24 hour period could be considered to be rude. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Feb 16 17:00:30 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 16 Feb 2010 16:00:30 +0000 Subject: changing the status map In-Reply-To: <765d77c81002160534m54f17d3fp6b50c9c76d467939@mail.gmail.com> References: <4B756FAB.8000307@flatto.net> <765d77c81002160534m54f17d3fp6b50c9c76d467939@mail.gmail.com> Message-ID: <4B7AC11E.80306@flatto.net> Jim Avery wrote: > On 12 February 2010 15:11, Assaf Flatto wrote: > >> Hello >> >> I want to have the map start in another "map" entity then the " Nagios >> Process " >> ( for instance the nagios host itself ) . >> Is there a way to impact the CGI to do so ? >> >> I am not a coder , so my understanding of the code is limited , so if i >> remove the code in the statusmap.c from line 1599 to 1631 to eliminate >> the "nagios process" entry in the map - will that cause the Nagios >> server to become the centre of the map? or will it break the CGI >> completely ? >> > > > It wouldn't do any harm to try! (assuming you take a backup of your > compiled statusmap binary first). I don't know 'C' either. > > You might find the automap in NagVis ( http://www.nagvis.org/ ) is > easier to use once it is set up. > > I would also recommend you try NEXSM (Nagios EXtended Status Map) but > it looks like the web page for it is no longer accessible. I'm not > sure if this means it is no longer maintained, or whether it's just a > temporary problem. Does anyone know what the story is there? > > hth, > > Jim > Thanks Jim I have already have NagVis installed , how ever i wanted to find out about the map - cause i find it easier to "beautify" it with icons then the Nagvis . I will do the test and post the results . Assaf ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Tue Feb 16 17:16:37 2010 From: nagios at flatto.net (Assaf Flatto) Date: Tue, 16 Feb 2010 16:16:37 +0000 Subject: changing the status map In-Reply-To: <4B7AC11E.80306@flatto.net> References: <4B756FAB.8000307@flatto.net> <765d77c81002160534m54f17d3fp6b50c9c76d467939@mail.gmail.com> <4B7AC11E.80306@flatto.net> Message-ID: <4B7AC4E5.2070503@flatto.net> On 12 February 2010 15:11, Assaf Flatto wrote: >> >> >>> Hello >>> >>> I want to have the map start in another "map" entity then the " Nagios >>> Process " >>> ( for instance the nagios host itself ) . >>> Is there a way to impact the CGI to do so ? >>> >>> I am not a coder , so my understanding of the code is limited , so if i >>> remove the code in the statusmap.c from line 1599 to 1631 to eliminate >>> the "nagios process" entry in the map - will that cause the Nagios >>> server to become the centre of the map? or will it break the CGI >>> completely ? >>> >>> >> It wouldn't do any harm to try! (assuming you take a backup of your >> compiled statusmap binary first). I don't know 'C' either. >> >> You might find the automap in NagVis ( http://www.nagvis.org/ ) is >> easier to use once it is set up. >> >> I would also recommend you try NEXSM (Nagios EXtended Status Map) but >> it looks like the web page for it is no longer accessible. I'm not >> sure if this means it is no longer maintained, or whether it's just a >> temporary problem. Does anyone know what the story is there? >> >> hth, >> >> Jim >> >> > Thanks Jim > > I have already have NagVis installed , how ever i wanted to find out > about the map - cause i find it easier to "beautify" it with icons then > the Nagvis . > > I will do the test and post the results . > > Assaf > > I've done the test , I backed up the current statusmap.c and statusmap.cgi files , then I commented out the lines 1599-1631 and run the make cgis command . After the make is finished i copied the new statusmap.cgi to the sbin directory and accessed the map . As I wished the map is centred on the "core" network device that all other connections branch from . Assaf ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Tue Feb 16 17:18:47 2010 From: marc at ena.com (Marc Powell) Date: Tue, 16 Feb 2010 10:18:47 -0600 Subject: Service Availability Report issue In-Reply-To: <4B756507.8040204@flatto.net> References: <4B756507.8040204@flatto.net> Message-ID: <62FBBEAA-A5CD-4864-8FEB-20B4E7A5B0E1@ena.com> On Feb 12, 2010, at 8:26 AM, Assaf Flatto wrote: > 11-01-2010 16:05:23 11-01-2010 16:05:23 0d 0h 0m 0s PROGRAM > (RE)START Program start > 11-01-2010 16:05:23 11-01-2010 16:07:18 0d 0h 1m 55s PROGRAM END > Abnormal program termination > > > All the services configuration is identical ( with the exception of the > IP of course) , but this is the only one to display this behaviour , any > one has any ideas why ? Did the service change state within your reporting period? Can you find entries for it in the log files within your reporting period? -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Tue Feb 16 19:22:56 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Tue, 16 Feb 2010 19:22:56 +0100 Subject: ndomod: Could not open data sink In-Reply-To: References: Message-ID: <4B7AE280.809@univie.ac.at> nigel.leach at uk.bnpparibas.com wrote: > > Nagios Core 3.2.0 > NDO2DB 1.4b9 > MySQL 5.0.17 > Redhat AS4 U6 > > I have the dreaded "ndomod: Could not open data sink!" problem. Nagios > itself is running happily, and I am trying to integrate Nagvis. > > These are my Nagios processes, and socket file > > nagios 29946 1 0 10:50 ? 00:00:00 > /usr/local/nagios/bin/ndo2db -c /usr/local/nagios/etc/ndo2db.cfg > nagios 29990 1 0 10:50 ? 00:00:08 > /usr/local/nagios/bin/nagios -d /usr/local/nagios/etc/nagios.cfg > > srwxrwx--- 1 nagios nagios 0 Feb 16 12:36 /usr/local/nagios/var/ndo.sock > > No Nagios data is going through to the mysql Nagios tables though. > They are all created, but none have been populated. I've gone through > all the numerous other posts on this topic, but nothing has worked. > > I can write to my mysql d/b using the db_user/db_pass in ndo2db.cfg. > > This is from my Nagios log (coincidently, I can never get anything > written to the ndo2db.debug file). > > [1266317457] ndomod: Could not open data sink! I'll keep trying, but > some output may get lost... > [1266317457] Event broker module '/usr/local/nagios/bin/ndomod.o' > initialized successfully. > [1266321105] ndomod: Still unable to connect to data sink. 32292 > items lost, 5000 queued items to flush. debug level in ndo2db.cfg to highest, what says ndo2db.debug? > > The ndo2db.cfg and nagios.cfg are below. and ndomod.cfg? Regards, Michael > > Any help much appreciated. > > Regards > Nigel -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From h.majnoonian at farasatcom.net Tue Feb 16 19:53:36 2010 From: h.majnoonian at farasatcom.net (Hamed Majnoonian) Date: Tue, 16 Feb 2010 22:23:36 +0330 Subject: I can't browse CGI files in browser In-Reply-To: <4B7502C5.70607@hp.com> References: <000301caab5f$e0f9a3a0$a2eceae0$@majnoonian@farasatcom.net> <4B7502C5.70607@hp.com> Message-ID: <791.450896258716$1266346650@news.gmane.org> Thanks for answer. Actually I solved this problem about a week ago... I almost forgot to enable this on my apache configuration file "AddHandler" for cgi files. Thanks -----Original Message----- From: Morris, Patrick [mailto:patrick.morris at hp.com] Sent: Friday, February 12, 2010 10:57 AM To: Hamed Majnoonian Cc: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] I can't browse CGI files in browser Sounds like you haven't configured your webserver correctly. Make sure you've enabled running CGIs (using the ExecCGI directive, assuming you're using apache) from the CGI directory. Hamed Majnoonian wrote: > > Hi! > > > > I have set up Nagios 3 on FreeBSD 8 and Apache 2. I haven't set up > any authentication on my web server. When I run my browser I can see > the front page of Nagios, but whenever I try to click on a report > page, browser tells me that it can save the CGI file (as a download) > and I can't browse the content of the report on my browser. > > > > Thanks > > Hamed > ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From samuel.mutel at free.fr Tue Feb 16 21:05:54 2010 From: samuel.mutel at free.fr (Samuel Mutel) Date: Tue, 16 Feb 2010 21:05:54 +0100 Subject: Problem with recovery notification Message-ID: <4B7AFAA2.7000908@free.fr> Hello, I have two Nagios servers that monitor the same equipement. This two nagios send the result of check by notification to another monitoring system (OpenNMS). I use Nagios 3.2. I received the recovery notification from server 2 but I did not received recovery notification from server 1. Why ? I think that SOFT and HARD states are the problem but I am not sur. In the second server 2 the status of service is HARD - OK so the notification is sent but on server 1, the service is SOFT - OK !!! Here is the log of Nagios : Service Alert of server 1 : [1266299592] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266299885] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266299925] SERVICE ALERT: test-server;CPU;CRITICAL;HARD;3;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266303080] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266303380] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266308485] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266308500] SERVICE ALERT: test-server;CPU;OK;SOFT;2;CHECK_ESX3.PL OK - "test-server" cpu usage=2.29 % Service Notification of server 1 : [1266299925] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266300645] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266301385] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266301720] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266303575] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266304175] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266304810] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266305270] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266305975] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server Service Alert of server 2 : [1266299856] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266300161] SERVICE ALERT: test-server;CPU;CRITICAL;HARD;1;(Service Check Timed Out) [1266300516] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266301481] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266301512] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266304201] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266304506] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;(Service Check Timed Out) [1266307211] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266307411] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266308616] SERVICE ALERT: test-server;CPU;OK;HARD;3;CHECK_ESX3.PL OK - "test-server" cpu usage=2.15 % Service Notification of server 2 : [1266301901] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266302501] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266303106] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266304811] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266305411] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266306001] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266307611] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266308206] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266308616] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;OK;send_service_trap_to_onms_preprod;CHECK_ESX3.PL OK - test-server cpu usage=2.15 % Thank for your help. Regards, Samuel Mutel. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron at tvnz.co.nz Wed Feb 17 00:21:25 2010 From: ron at tvnz.co.nz (Ron Wilson) Date: Wed, 17 Feb 2010 12:21:25 +1300 Subject: Nagios ALerting Message-ID: <7D85034F6BBBD540AF52C59CED3AA1E801C71278@AKVXCH02.tvnzad.tvnz.co.nz> In ngaios 3 how do I do alerting for a small user group of sql admins that require alerting for anything sql related. I can create a contact group for the sql admins but as there are several services on around 30 servers do I have to add the contact group in to each services Alarm Settings Tab? This seems a complicated way to provide notifications to a small group of disparate users. Is there another w ay I can achieve this. It was a lot easier in earlier versions of Nagios ________________________________ -- Ron Wilson Systems Engineer Television New Zealand P.O. Box 3819 Auckland, New Zealand Phone 649-916-7560 A conclusion is the place where you get tired of thinking. (Arthur Bloch) ________________________________ ========================================================== For more information on the Television New Zealand Group, visit us online at tvnz.co.nz ========================================================== CAUTION: This e-mail and any attachment(s) contain information that is intended to be read only by the named recipient(s). This information is not to be used or stored by any other person and/or organisation. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 17 01:01:24 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Tue, 16 Feb 2010 16:01:24 -0800 Subject: Nagios ALerting In-Reply-To: <7D85034F6BBBD540AF52C59CED3AA1E801C71278@AKVXCH02.tvnzad.tvnz.co.nz> References: <7D85034F6BBBD540AF52C59CED3AA1E801C71278@AKVXCH02.tvnzad.tvnz.co.nz> Message-ID: <20100217000124.GG4621@bakgwai.americas.hpqcorp.net> Hi Ron! On Tue, 16 Feb 2010, Ron Wilson wrote: > In ngaios 3 how do I do alerting for a small user group of sql admins that require alerting for anything sql related. > I can create a contact group for the sql admins but as there are several services on around 30 servers do I have to add the contact group in to each services Alarm Settings Tab? > This seems a complicated way to provide notifications to a small group of disparate users. > Is there another w ay I can achieve this. It was a lot easier in earlier versions of Nagios "Alarm Settings Tab?" I'm not sure Nagios has such a thing. The easiest way to handle it in Nagios 3 would probably be to have a service template set up that adds the SQL admins as a contact, and then use something like "use sql_service" to include that template for any services you want that group notified on (assuming there is a standard set of notification recipients that also receive them. An alternative would be to add that group to each service, as you mentioned. Neither of those has changed substantially since Nagios was first called Nagios, as far as I know. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron at tvnz.co.nz Wed Feb 17 01:33:14 2010 From: ron at tvnz.co.nz (Ron Wilson) Date: Wed, 17 Feb 2010 13:33:14 +1300 Subject: Nagios ALerting In-Reply-To: <20100217000124.GG4621@bakgwai.americas.hpqcorp.net> References: <7D85034F6BBBD540AF52C59CED3AA1E801C71278@AKVXCH02.tvnzad.tvnz.co.nz> <20100217000124.GG4621@bakgwai.americas.hpqcorp.net> Message-ID: <7D85034F6BBBD540AF52C59CED3AA1E801C7127C@AKVXCH02.tvnzad.tvnz.co.nz> Sorry I am using Nagiosql and that is where the alarm tab shows I agree if it was command line version it would be easier > -----Original Message----- > From: patrick.morris at hp.com [mailto:patrick.morris at hp.com] > Sent: Wednesday, 17 February 2010 1:01 p.m. > To: Ron Wilson > Cc: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] Nagios ALerting > > Hi Ron! > > On Tue, 16 Feb 2010, Ron Wilson wrote: > > > In ngaios 3 how do I do alerting for a small user group of sql admins > that require alerting for anything sql related. > > I can create a contact group for the sql admins but as there are > several services on around 30 servers do I have to add the contact > group in to each services Alarm Settings Tab? > > This seems a complicated way to provide notifications to a small > group of disparate users. > > Is there another w ay I can achieve this. It was a lot easier in > earlier versions of Nagios > > "Alarm Settings Tab?" I'm not sure Nagios has such a thing. > > The easiest way to handle it in Nagios 3 would probably be to have a > service template set up that adds the SQL admins as a contact, and then > use something like "use sql_service" to include that template for any > services you want that group notified on (assuming there is a standard > set of notification recipients that also receive them. > > An alternative would be to add that group to each service, as you > mentioned. > > Neither of those has changed substantially since Nagios was first > called > Nagios, as far as I know. ========================================================== For more information on the Television New Zealand Group, visit us online at tvnz.co.nz ========================================================== CAUTION: This e-mail and any attachment(s) contain information that is intended to be read only by the named recipient(s). This information is not to be used or stored by any other person and/or organisation. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Wed Feb 17 11:03:20 2010 From: reachtarak at hotmail.com (Tarak Mukherjee) Date: Wed, 17 Feb 2010 15:33:20 +0530 Subject: Monitoring Tomcat Message-ID: Hi List, I'm interested tomcat monitoring. Some times i have a problems with java, for example i get java.lang.OutOfMemoryError some idea is monitor catalina.out log for such errors, is there any possibilities to check with Nagios. /\ Ricky _________________________________________________________________ Hotmail: Powerful Free email with security by Microsoft. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pitchfork at ederdrom.de Wed Feb 17 11:28:10 2010 From: pitchfork at ederdrom.de (Joerg Linge) Date: Wed, 17 Feb 2010 11:28:10 +0100 Subject: Monitoring Tomcat In-Reply-To: References: Message-ID: <4B7BC4BA.9020509@ederdrom.de> Am 17.02.10 11:03, schrieb Tarak Mukherjee: > > Hi List, > > I'm interested tomcat monitoring. Some times i have a problems with java, for example i get > > java.lang.OutOfMemoryError > > > > some idea is monitor catalina.out log for such errors, is there any possibilities to check with Nagios. jmx4perl works perfect. http://labs.consol.de/lang/de/jmx4perl/ Joerg ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Wed Feb 17 12:19:53 2010 From: nigel.leach at uk.bnpparibas.com (NIgel Leach) Date: Wed, 17 Feb 2010 11:19:53 +0000 (UTC) Subject: ndomod: Could not open data sink References: <4B7AE280.809@univie.ac.at> Message-ID: Michael Friedrich univie.ac.at> writes: > > > nigel.leach uk.bnpparibas.com wrote: > > Nagios Core 3.2.0 > NDO2DB 1.4b9 > MySQL 5.0.17 > Redhat AS4 U6 > I have the dreaded "ndomod: Could > not open data sink!" problem. Nagios itself is running happily, and > I am trying to integrate Nagvis. These are my Nagios processes, and socket > file > > debug level in ndo2db.cfg to highest, what says ndo2db.debug? Nothing unfortunately,what ever debug options I set, this file is always empty > The ndo2db.cfg and nagios.cfg are below. > > > and ndomod.cfg? here it is, cheers instance_name=default output_type=tcpsocket output=127.0.0.1 tcp_port=5668 use_ssl=0 output_buffer_items=5000 buffer_file=/usr/local/nagios/var/ndomod.tmp file_rotation_interval=14400 file_rotation_timeout=60 reconnect_interval=15 reconnect_warning_interval=15 data_processing_options=-1 config_output_options=2 ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Wed Feb 17 12:40:49 2010 From: nigel.leach at uk.bnpparibas.com (NIgel Leach) Date: Wed, 17 Feb 2010 11:40:49 +0000 (UTC) Subject: ndomod: Could not open data sink References: <26E07E3A-63AB-4AD7-BC24-6244D2717182@ena.com> Message-ID: Marc Powell ena.com> writes: > > > On Feb 16, 2010, at 8:13 AM, nigel.leach uk.bnpparibas.com wrote: > > > > > Nagios Core 3.2.0 > > NDO2DB 1.4b9 > > MySQL 5.0.17 > > Redhat AS4 U6 > > > No Nagios data is going through to the mysql Nagios tables though. They are all created, but none have been > populated. I've gone through all the numerous other posts on this topic, but nothing has worked. > > Hints of what you've tried will keep us from offering up repeats.. > Moved btw TCP socket, and Unix socket ? same results ? no data in d/b Recompiled mysql Checked for extra space in broker_module parameters Changed between v4 and v5 mysql password types Plus lots of checks to confirm my configuration appears correct > > I can write to my mysql d/b using the db_user/db_pass in ndo2db.cfg. > > Into the NDO tables? > Yes > > This is from my Nagios log (coincidently, I can never get anything written to the ndo2db.debug file). > > Check /var/log/messages or wherever LOG_ERR goes to for possible clues. > I get the contents of my nagios.log mirrored in /var/log/messages, but nothing that appears to relate to ndo debug > -- > Marc > > p.s. posting the same question more than once in a 24 hour period could be considered to be rude. "The mailbox is full. The email was rejected." suggests a second post may be necessary > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Wed Feb 17 13:21:47 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Wed, 17 Feb 2010 13:21:47 +0100 Subject: ndomod: Could not open data sink In-Reply-To: References: <4B7AE280.809@univie.ac.at> Message-ID: <4B7BDF5B.1090101@univie.ac.at> NIgel Leach wrote: > Nothing unfortunately,what ever debug options I set, this file is always empty good. then ndo2db does not fork itsself meaning ndomod cannot connect over the socket. so the current problem is ndomod and/or the socket from which ndo2db should read then. >> and ndomod.cfg? > > here it is, cheers > > instance_name=default > output_type=tcpsocket > output=127.0.0.1 > tcp_port=5668 > use_ssl=0 > output_buffer_items=5000 > buffer_file=/usr/local/nagios/var/ndomod.tmp > file_rotation_interval=14400 > file_rotation_timeout=60 > reconnect_interval=15 > reconnect_warning_interval=15 > data_processing_options=-1 > config_output_options=2 How does that match with this? [ndo2db.cfg] lock_file=/usr/local/nagios/var/ndo2db.lock ndo2db_user=nagios ndo2db_group=nagios socket_type=unix socket_name=/usr/local/nagios/var/ndo.sock Kind regards, Michael -- DI (FH) Michael Friedrich michael.friedrich at univie.ac.at Tel: +43 1 4277 14359 Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jkamens at advent.com Wed Feb 17 13:18:03 2010 From: jkamens at advent.com (Jonathan Kamens) Date: Wed, 17 Feb 2010 07:18:03 -0500 Subject: Monitoring Tomcat In-Reply-To: <4B7BC4BA.9020509@ederdrom.de> References: <4B7BC4BA.9020509@ederdrom.de> Message-ID: <4B7BDE7B.80009@Advent.COM> On 02/17/2010 05:28 AM, Joerg Linge wrote: > Am 17.02.10 11:03, schrieb Tarak Mukherjee: > >> Hi List, >> >> I'm interested tomcat monitoring. Some times i have a problems with java, for example i get >> >> java.lang.OutOfMemoryError >> >> some idea is monitor catalina.out log for such errors, is there any possibilities to check with Nagios. >> > jmx4perl works perfect. > > http://labs.consol.de/lang/de/jmx4perl/ > You might also want to check out monju: http://monju.sourceforge.net/monju.html. jik ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Wed Feb 17 15:31:33 2010 From: nigel.leach at uk.bnpparibas.com (nigel.leach at uk.bnpparibas.com) Date: Wed, 17 Feb 2010 14:31:33 +0000 Subject: ndomod: Could not open data sink In-Reply-To: <4B7BDF5B.1090101@univie.ac.at> References: <4B7BDF5B.1090101@univie.ac.at> Message-ID: sorry Michael & Tom, blame that on all my testing getting a bit out of sync. I've updated my ndomod.cfg as below, restarted mysql/nagios/ndo, and my problems persist instance_name=default output_type=unixsocket output=/usr/local/nagios/var/ndo.sock tcp_port=5668 use_ssl=0 output_buffer_items=5000 buffer_file=/usr/local/nagios/var/ndomod.tmp file_rotation_interval=14400 file_rotation_timeout=60 reconnect_interval=15 reconnect_warning_interval=15 data_processing_options=-1 config_output_options=2 The mysql port is confirmed as 3306 tcp 0 0 0.0.0.0:3306 0.0.0.0:* LISTEN 17380/mysqld unix 2 [ ACC ] STREAM LISTENING 79686764 17380/mysqld /tmp/mysql.sock And the table prefixs are (I think correct as nagios_) mysql> show tables; +----------------------------------------+ | Tables_in_nagios +----------------------------------------+ | nagios_acknowledgements | nagios_commands | nagios_commenthistory | nagios_comments ... mysql> show grants for 'ndouser'@'localhost'; +---------------------------------------------------------------------------------------+ | Grants for ndouser at localhost +---------------------------------------------------------------------------------------+ | GRANT USAGE ON *.* TO 'ndouser'@'localhost' IDENTIFIED BY PASSWORD '4da1ca1e5c4de6f2' | GRANT ALL PRIVILEGES ON `nagios`.* TO 'ndouser'@'localhost' +---------------------------------------------------------------------------------------+ 2 rows in set (0.00 sec) Many thanks for all your help Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas , 5Y021 @ Harewood Avenue ( Internal 711 3634 ! Mobile +44 (0)7789 077 946 ! Direct Dial +44 (0)207 595 3634 * nigel.leach at bnpparibas.com Group Mail: cmitp.fors at bnpparibas.com | Remedy Requests: CMITP FORS | Pickup 711 0350 Internet michael.friedrich at univie.ac.at 17/02/2010 12:21 To Nigel LEACH cc nagios-users at lists.sourceforge.net Subject Re: [Nagios-users] ndomod: Could not open data sink NIgel Leach wrote: > Nothing unfortunately,what ever debug options I set, this file is always empty good. then ndo2db does not fork itsself meaning ndomod cannot connect over the socket. so the current problem is ndomod and/or the socket from which ndo2db should read then. >> and ndomod.cfg? > > here it is, cheers > > instance_name=default > output_type=tcpsocket > output=127.0.0.1 > tcp_port=5668 > use_ssl=0 > output_buffer_items=5000 > buffer_file=/usr/local/nagios/var/ndomod.tmp > file_rotation_interval=14400 > file_rotation_timeout=60 > reconnect_interval=15 > reconnect_warning_interval=15 > data_processing_options=-1 > config_output_options=2 How does that match with this? [ndo2db.cfg] lock_file=/usr/local/nagios/var/ndo2db.lock ndo2db_user=nagios ndo2db_group=nagios socket_type=unix socket_name=/usr/local/nagios/var/ndo.sock Kind regards, Michael -- DI (FH) Michael Friedrich michael.friedrich at univie.ac.at Tel: +43 1 4277 14359 Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria ___________________________________________________________ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. ____________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Wed Feb 17 15:47:58 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Wed, 17 Feb 2010 15:47:58 +0100 Subject: ndomod: Could not open data sink In-Reply-To: References: Message-ID: <4B7C019E.80109@univie.ac.at> nigel.leach at uk.bnpparibas.com wrote: > > sorry Michael & Tom, blame that on all my testing getting a bit out of > sync. Is /usr/local/nagios/var/ndo.sock there when nagios with ndomod and ndo2db are running? also show ls -la /usr/local/nagios/var/ Kind regards, Michael > > I've updated my ndomod.cfg as below, restarted mysql/nagios/ndo, and my > problems persist > > instance_name=default > output_type=unixsocket > output=/usr/local/nagios/var/ndo.sock > tcp_port=5668 > use_ssl=0 > output_buffer_items=5000 > buffer_file=/usr/local/nagios/var/ndomod.tmp > file_rotation_interval=14400 > file_rotation_timeout=60 > reconnect_interval=15 > reconnect_warning_interval=15 > data_processing_options=-1 > config_output_options=2 > > > The mysql port is confirmed as 3306 > > tcp 0 0 0.0.0.0:3306 0.0.0.0:* > LISTEN 17380/mysqld > unix 2 [ ACC ] STREAM LISTENING 79686764 17380/mysqld > /tmp/mysql.sock > > And the table prefixs are (I think correct as nagios_) > > mysql> show tables; > +----------------------------------------+ > | Tables_in_nagios > +----------------------------------------+ > | nagios_acknowledgements > | nagios_commands > | nagios_commenthistory > | nagios_comments > ... > > mysql> show grants for 'ndouser'@'localhost'; > +---------------------------------------------------------------------------------------+ > > | Grants for ndouser at localhost > > +---------------------------------------------------------------------------------------+ > > | GRANT USAGE ON *.* TO 'ndouser'@'localhost' IDENTIFIED BY PASSWORD > '4da1ca1e5c4de6f2' > | GRANT ALL PRIVILEGES ON `nagios`.* TO 'ndouser'@'localhost' > > +---------------------------------------------------------------------------------------+ > > 2 rows in set (0.00 sec) > > Many thanks for all your help > > Nigel Leach > Grid Engineer | Front Office Risk Systems | BNP Paribas > > , 5Y021 @ Harewood Avenue > ( Internal 711 3634 ! Mobile +44 (0)7789 077 946 ! Direct Dial +44 > (0)207 595 3634 > * _nigel.leach at bnpparibas.com_ > > Group Mail: cmitp.fors at bnpparibas.com | Remedy Requests: CMITP FORS | > Pickup 711 0350 > > > > > *Internet * > *michael.friedrich at univie.ac.at* > > 17/02/2010 12:21 > > > To > Nigel LEACH > cc > nagios-users at lists.sourceforge.net > Subject > Re: [Nagios-users] ndomod: Could not open data sink > > > > > > > > > NIgel Leach wrote: > > Nothing unfortunately,what ever debug options I set, this file is > always empty > > good. then ndo2db does not fork itsself meaning ndomod cannot connect > over the socket. so the current problem is ndomod and/or the socket from > which ndo2db should read then. > > >> and ndomod.cfg? > > > > here it is, cheers > > > > instance_name=default > > output_type=tcpsocket > > output=127.0.0.1 > > tcp_port=5668 > > use_ssl=0 > > output_buffer_items=5000 > > buffer_file=/usr/local/nagios/var/ndomod.tmp > > file_rotation_interval=14400 > > file_rotation_timeout=60 > > reconnect_interval=15 > > reconnect_warning_interval=15 > > data_processing_options=-1 > > config_output_options=2 > > How does that match with this? > > [ndo2db.cfg] > lock_file=/usr/local/nagios/var/ndo2db.lock > ndo2db_user=nagios > ndo2db_group=nagios > socket_type=unix > socket_name=/usr/local/nagios/var/ndo.sock > > Kind regards, > Michael > > -- > DI (FH) Michael Friedrich > michael.friedrich at univie.ac.at > Tel: +43 1 4277 14359 > > Vienna University Computer Center > Universitaetsstrasse 7 A-1010 Vienna, Austria > > ___________________________________________________________ > This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. > > BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. > > BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. > > BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. > > BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. > > BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. > > BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. > > BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. > > BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. > > BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. > > Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. > > BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. > > BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. > > BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. > > BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. > ____________________________________________________________ > > -- DI (FH) Michael Friedrich michael.friedrich at univie.ac.at Tel: +43 1 4277 14359 Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sam.bancal at gmail.com Wed Feb 17 16:42:55 2010 From: sam.bancal at gmail.com (Samuel Bancal) Date: Wed, 17 Feb 2010 16:42:55 +0100 Subject: service notification when host is down Message-ID: <4fd99abf1002170742m2a7d6bb3y55abbfec20940e20@mail.gmail.com> Nagios Core 3.2.0 nagios-plugins-1.4.14 Ubuntu server 8.04.3 LTS Hi, I'm encountering problems to configure the notifications in case a server is no more responding to PING (ICMP). I don't understand why Nagios is jumping over steps when it's doing service-check "icmp". Here is the config : define host{ use generic-server host_name server1 alias server1 address the.ip.the.ip hostgroups prod-servers contact_groups group1 check_command check-host-alive check_period 24x7 check_interval 5 retry_interval 1 max_check_attempts 4 notification_period 24x7 notification_interval 60 notification_options d,u,r } define service{ use generic-service host_name server1 service_description ICMP check_command check_icmp!100.0,20%!500.0,60% max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 notification_options w,u,c,r notification_interval 60 notification_period 24x7 } [...] define command{ command_name check-host-alive command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c 5000.0,100% -p 5 } define command{ command_name check_icmp command_line $USER1$/check_icmp -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ -p 5 } [...] Here is an example of history that I get : [image: Service Critical][2010-02-16 11:33:13] SERVICE ALERT: server1;ICMP;CRITICAL;SOFT;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% [image: Host Down][2010-02-16 11:33:43] HOST ALERT: server1;DOWN;SOFT;1;(Host Check Timed Out) [image: Service Critical][2010-02-16 11:34:13] SERVICE ALERT: server1;ICMP;CRITICAL;HARD;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% [image: Host Down][2010-02-16 11:34:43] HOST ALERT: server1;DOWN;SOFT;2;(Host Check Timed Out) [image: Host Down][2010-02-16 11:35:23] HOST ALERT: server1;DOWN;SOFT;3;(Host Check Timed Out) [image: Host Down][2010-02-16 11:36:33] HOST ALERT: server1;DOWN;HARD;4;(Host Check Timed Out) [image: Host Up][2010-02-16 11:37:43] HOST ALERT: server1;UP;HARD;1;PING OK - Packet loss = 0%, RTA = 0.67 ms [image: Service Ok][2010-02-16 11:39:13] SERVICE ALERT: server1;ICMP;OK;HARD;1;OK - the.ip.the.ip: rta 0.943ms, lost 0% Or later : [image: Host Down][2010-02-16 11:42:03] HOST ALERT: server1;DOWN;SOFT;1;(Host Check Timed Out) [image: Host Down][2010-02-16 11:43:13] HOST ALERT: server1;DOWN;SOFT;2;(Host Check Timed Out) [image: Service Critical][2010-02-16 11:44:13] SERVICE ALERT: server1;ICMP;CRITICAL;HARD;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% [image: Host Down][2010-02-16 11:44:43] HOST ALERT: server1;DOWN;SOFT;3;(Host Check Timed Out) [image: Host Up][2010-02-16 11:45:53] HOST ALERT: server1;UP;SOFT;4;PING OK - Packet loss = 0%, RTA = 0.64 ms [image: Service Ok][2010-02-16 11:49:13] SERVICE ALERT: server1;ICMP;OK;HARD;1;OK - the.ip.the.ip: rta 0.948ms, lost 0% Someone any clue? Regards, Samuel Bancal -- Samuel Bancal - CH -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.willis at kent.pnn.police.uk Wed Feb 17 17:30:45 2010 From: paul.willis at kent.pnn.police.uk (Paul WILLIS PSE 55499) Date: Wed, 17 Feb 2010 16:30:45 +0000 Subject: CHECK_HTTP odd behaviour Message-ID: <4B7C19B5020000C4000154E7@kent.pnn.police.uk> I'm trying to run a check on a website we have that a) has a stupidly long generated path and b) insists it is available only to certain browers. If I run /usr/local/nagios/check_http -p 8000 -H some.host.co.uk -u "/sap/bc/webdynpro/sap/hrrcf_a_unreg_job_search?sap-wd-configId=ZUNREG_JOB_SEARCH&sap-ep-themeroot=/sap/public/bc/ur/customerthemes/sap_kp" -A "Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.8.0.9) Gecko/20061206 Firefox/1.5.0.9" -R fs_QE2_00 it comes back fine and tells me that it can read the string fs_QE2_00 on the page OK. If I put exactly the same command into my command definition file the accompanying service simply goes red with a status definition of null. If I then drop the -A string it will now work, although it now gives the expected 500 error of browser not supported. I have tried simplifying the -A string but the web site requires pretty much all of it else it 500 errors. Has anyone any idea why the Agent string works when check_http is run directly but not through Nagios? Thanks Paul -- This email and any accompanying document(s) contain information from Kent Police, which is confidential or privileged. The information is intended to be for the exclusive use of the individual(s) or bodies to whom it is addressed. If you are not the intended recipient, be aware that any disclosure, copying, distribution or use of the contents of this information is prohibited. If you have received this email in error, please notify us immediately by contacting the sender or telephoning 01622 690690. The copyright in the contents of this email and any enclosure is the property of Kent Police and any unauthorised reproduction or disclosure is contrary to the provisions of the Copyright Designs and Patents Act 1998. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.willis at kent.pnn.police.uk Wed Feb 17 17:45:47 2010 From: paul.willis at kent.pnn.police.uk (Paul WILLIS PSE 55499) Date: Wed, 17 Feb 2010 16:45:47 +0000 Subject: CHECK_HTTP odd behaviour Message-ID: <4B7C1D3B020000C4000154ED@kent.pnn.police.uk> Ah, forgot to add Nagios 3.2 Check_http v2053 (nagios-plugins 1.4.13) Running under Suse Linux 10.2 Paul -- This email and any accompanying document(s) contain information from Kent Police, which is confidential or privileged. The information is intended to be for the exclusive use of the individual(s) or bodies to whom it is addressed. If you are not the intended recipient, be aware that any disclosure, copying, distribution or use of the contents of this information is prohibited. If you have received this email in error, please notify us immediately by contacting the sender or telephoning 01622 690690. The copyright in the contents of this email and any enclosure is the property of Kent Police and any unauthorised reproduction or disclosure is contrary to the provisions of the Copyright Designs and Patents Act 1998. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jaiho8 at gmail.com Wed Feb 17 17:47:27 2010 From: jaiho8 at gmail.com (Jai Ram) Date: Wed, 17 Feb 2010 11:47:27 -0500 Subject: Nagios client plugins for RHEL5 Message-ID: <9cfff3f01002170847g2d2eadf7jd71e41ae85134b98@mail.gmail.com> Hello All: Can anyone pls help me to find the RHEL5 nagios client plugin bundle....we have been monitoring AIX/Sun boxes for which we have the tar balls that we extract to each client under nagio's home dir and we invoke the check.sh from the Nagios master server using ssh....appreciate if anyone can suggest similar pre-packaged scripts for RHEL. *Here is the listing of AIX client scripts...* /home/nagios/AIX_3_powerpc ? # ls -l 4.0K -rwxr-x--- 1 ops staff 2.3K Sep 28 2007 check.sh* 4.0K -rwxr-x--- 1 ops staff 2.3K Sep 28 2007 check.sh.AIX* 4.0K -rwxr-x--- 1 ops staff 2.2K Sep 26 2007 check_AIX.sh* 496K -rwxr-x--- 1 ops staff 494K Sep 26 2007 check_disk* 4.0K -rwxr-x--- 1 ops staff 1.6K Sep 28 2007 check_errpt.sh* 4.0K -rwxr-x--- 1 ops staff 3.2K Sep 26 2007 check_file_age.pl* 312K -rwxr-x--- 1 ops staff 312K Sep 26 2007 check_load* 4.0K -rwxr-x--- 1 ops staff 4.0K Sep 28 2007 check_lvm.pl* 4.0K -rwxr-x--- 1 ops staff 4.0K Sep 28 2007 check_lvm.pl.AIX* 20K -rwxr-x--- 1 ops staff 20K Sep 26 2007 check_mailq.pl* 8.0K -rwxr-x--- 1 ops staff 4.2K Sep 26 2007 check_mem.pl* 8.0K -rwxr-x--- 1 ops staff 4.2K Sep 26 2007 check_mem.pl.AIX* 348K -rwxr-x--- 1 ops staff 346K Sep 26 2007 check_procs* 4.0K -rwxr-x--- 1 ops staff 559 Sep 26 2007 check_procs.sh* 4.0K -rwxr-x--- 1 ops staff 2.6K Sep 27 2007 check_psrinfo.sh* 4.0K -rwxr-x--- 1 ops staff 2.6K Sep 27 2007 check_psrinfo.sh.AIX* 488K -rwxr-x--- 1 ops staff 485K Sep 26 2007 check_swap* 4.0K -rwxr-x--- 1 ops staff 2.1K Sep 26 2007 utils.pm* 4.0K -rwxr-x--- 1 ops staff 2.1K Sep 26 2007 utils.pm.AIX* 4.0K -rwxr-x--- 1 ops staff 859 Sep 26 2007 utils.sh* 4.0K -rwxr-x--- 1 ops staff 859 Sep 26 2007 utils.sh.AIX* -- Thanks Jai Ram. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 17 17:50:35 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 17 Feb 2010 08:50:35 -0800 Subject: CHECK_HTTP odd behaviour In-Reply-To: <4B7C19B5020000C4000154E7@kent.pnn.police.uk> References: <4B7C19B5020000C4000154E7@kent.pnn.police.uk> Message-ID: <4B7C1E5B.1060902@hp.com> Paul WILLIS PSE 55499 wrote: > I'm trying to run a check on a website we have that a) has a stupidly > long generated path and b) insists it is available only to certain > browers. > If I run /usr/local/nagios/check_http -p 8000 -H some.host.co.uk -u > "/sap/bc/webdynpro/sap/hrrcf_a_unreg_job_search?sap-wd-configId=ZUNREG_JOB_SEARCH&sap-ep-themeroot=/sap/public/bc/ur/customerthemes/sap_kp" > -A "Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.8.0.9) > Gecko/20061206 Firefox/1.5.0.9" -R fs_QE2_00 > it comes back fine and tells me that it can read the string fs_QE2_00 > on the page OK. If I put exactly the same command into my command > definition file the accompanying service simply goes red with a status > definition of null. If I then drop the -A string it will now work, > although it now gives the expected 500 error of browser not supported. > I have tried simplifying the -A string but the web site requires > pretty much all of it else it 500 errors. Has anyone any idea why the > Agent string works when check_http is run directly but not through > Nagios? My guess is that you've got a quoting and/or special character issue in your check or service commands; there shouldn't really be any difference between running it from the command line and running it through Nagios in this respect. One relatively easy way to check that would be to run the check through a wrapper script that first echoes the command it's using to a log file, and then runs the command. My guess is that you'll find the command Nagios is running and the one you're running from the command line aren't the same. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 17 17:52:53 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 17 Feb 2010 08:52:53 -0800 Subject: service notification when host is down In-Reply-To: <4fd99abf1002170742m2a7d6bb3y55abbfec20940e20@mail.gmail.com> References: <4fd99abf1002170742m2a7d6bb3y55abbfec20940e20@mail.gmail.com> Message-ID: <4B7C1EE5.1010007@hp.com> Samuel Bancal wrote: > Nagios Core 3.2.0 > nagios-plugins-1.4.14 > Ubuntu server 8.04.3 LTS > > Hi, > > I'm encountering problems to configure the notifications in case a > server is no more responding to PING (ICMP). > I don't understand why Nagios is jumping over steps when it's doing > service-check "icmp". > Here is the config : > > define host{ > use generic-server > host_name server1 > alias server1 > address the.ip.the.ip > hostgroups prod-servers > contact_groups group1 > check_command check-host-alive > check_period 24x7 > check_interval 5 > retry_interval 1 > max_check_attempts 4 > notification_period 24x7 > notification_interval 60 > notification_options d,u,r > } > > define service{ > use generic-service > host_name server1 > service_description ICMP > check_command check_icmp!100.0,20%!500.0,60% > max_check_attempts 4 > normal_check_interval 5 > retry_check_interval 1 > notification_options w,u,c,r > notification_interval 60 > notification_period 24x7 > } > [...] > define command{ > command_name check-host-alive > command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c > 5000.0,100% -p 5 > } > define command{ > command_name check_icmp > command_line $USER1$/check_icmp -H $HOSTADDRESS$ -w $ARG1$ -c > $ARG2$ -p 5 > } > [...] > > Here is an example of history that I get : > Service Critical[2010-02-16 11:33:13] SERVICE ALERT: > server1;ICMP;CRITICAL;SOFT;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% > Host Down[2010-02-16 11:33:43] HOST ALERT: server1;DOWN;SOFT;1;(Host > Check Timed Out) > Service Critical[2010-02-16 11:34:13] SERVICE ALERT: > server1;ICMP;CRITICAL;HARD;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% > Host Down[2010-02-16 11:34:43] HOST ALERT: server1;DOWN;SOFT;2;(Host > Check Timed Out) > Host Down[2010-02-16 11:35:23] HOST ALERT: server1;DOWN;SOFT;3;(Host > Check Timed Out) > Host Down[2010-02-16 11:36:33] HOST ALERT: server1;DOWN;HARD;4;(Host > Check Timed Out) > Host Up[2010-02-16 11:37:43] HOST ALERT: server1;UP;HARD;1;PING OK - > Packet loss = 0%, RTA = 0.67 ms > Service Ok[2010-02-16 11:39:13] SERVICE ALERT: > server1;ICMP;OK;HARD;1;OK - the.ip.the.ip: rta 0.943ms, lost 0% > > Or later : > Host Down[2010-02-16 11:42:03] HOST ALERT: server1;DOWN;SOFT;1;(Host > Check Timed Out) > Host Down[2010-02-16 11:43:13] HOST ALERT: server1;DOWN;SOFT;2;(Host > Check Timed Out) > Service Critical[2010-02-16 11:44:13] SERVICE ALERT: > server1;ICMP;CRITICAL;HARD;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% > Host Down[2010-02-16 11:44:43] HOST ALERT: server1;DOWN;SOFT;3;(Host > Check Timed Out) > Host Up[2010-02-16 11:45:53] HOST ALERT: server1;UP;SOFT;4;PING OK - > Packet loss = 0%, RTA = 0.64 ms > Service Ok[2010-02-16 11:49:13] SERVICE ALERT: > server1;ICMP;OK;HARD;1;OK - the.ip.the.ip: rta 0.948ms, lost 0% If you're asking why Nagios runs a host check when it sees the service fail a check, that's normal behavior. When a service check fails, the first thing Nagios will do is look to see if the service failed because the host is down. ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nigel.leach at uk.bnpparibas.com Wed Feb 17 17:56:50 2010 From: nigel.leach at uk.bnpparibas.com (nigel.leach at uk.bnpparibas.com) Date: Wed, 17 Feb 2010 16:56:50 +0000 Subject: ndomod: Could not open data sink Message-ID: all working now, worryingly I resorted to the Microsoft solution, and rebooted the server. Everything came up sweet, and mysql is now being populated. No idea why really, but hey ho, hopefully it will remain stable. Now need to get Nagvis working. Many thanks again for all your help. Regards, Nigel Nigel Leach Grid Engineer | Front Office Risk Systems | BNP Paribas ___________________________________________________________ This communication is confidential, may be privileged and is meant only for the intended recipient. If you are not the intended recipient, please notify the sender by reply and delete the message from your system. Any unauthorised dissemination, distribution or copying hereof is prohibited. BNP Paribas Trust Corporation UK Limited, BNP Paribas UK Limited, BNP Paribas Commodity Futures Limited, BNP Paribas Asset Management UK Limited and Investment Fund Services Limited are authorised and regulated by the Financial Services Authority. BNP Paribas London Branch and BNP Paribas Wealth Management London Branch are authorised by the CECEI and supervised by the Commission Bancaire. BNP Paribas London Branch is authorised and subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas is also a member of the London Stock Exchange. BNP Paribas Wealth Management London Branch is subject to limited regulation by the Financial Services Authority. Details about the extent of our authorisation and regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services London Branch is authorised by the CECEI and supervised by the AMF, and subject to limited regulation by the Financial Services Authority. Details on the extent of our regulation by the Financial Services Authority are available from us on request. BNP Paribas Securities Services is also a member of the London Stock Exchange. BNP Paribas Trust Corporation UK Limited is registered in England and Wales (registered no. 4042668) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas UK Limited is registered in England and Wales (registered no. 1488108) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Commodity Futures Limited is registered in England and Wales (registered no. 2391477) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Asset Management UK Limited is registered in England and Wales (registered no. 2474627) at registered office 10 Harewood Avenue, London NW1 6AA. Investment Fund Services Limited is registered in England and Wales (registered no. 6110770) at registered office 55 Moorgate, London EC2R 6PA. BNP Paribas London Branch is registered in England and Wales (registered no. FC13447) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Wealth Management London Branch is registered in England and Wales (registered no. FC023926) at registered office 10 Harewood Avenue, London NW1 6AA. BNP Paribas Securities Services London Branch is registered in England and Wales (registered no. BR006393) at registered office 55 Moorgate, London, EC2R 6PA. BNP Paribas Lease Group Plc is registered in England and Wales (registered no. 2341989) at registered office Northern Cross, Basing View, Basingstoke, Hampshire RG21 4HL. ____________________________________________________________ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Wed Feb 17 18:36:41 2010 From: reachtarak at hotmail.com (Digital Edge) Date: Wed, 17 Feb 2010 23:06:41 +0530 Subject: Help required Message-ID: Hi List, It will be really helpful if i can get any response on my below mentioned query..... I have an URL , say.. http://www.example.com/sigin.jsf , After login in, it'll redirect to "https://www.example1.com/ddo/get_sec_pwd.php" ; here another authentication will happen, then it'll come to an URL "https://www.example1.com/home/home.jsf. Inside that page I have several other Tabs.. -------------------- Home|Home1|Home2 -------------------- all the tabs can be navigate & viewable after successful login of 2nd time . And can be accessible within that session only. Can we monitor those URLS response time without loosing the session , one by one in Nagios...... Please let me know , if I not clear with the scenario. /\ Ricky _________________________________________________________________ Hotmail: Trusted email with powerful SPAM protection. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jrangi at automotive.com Wed Feb 17 18:24:57 2010 From: jrangi at automotive.com (Rangi, Jai) Date: Wed, 17 Feb 2010 09:24:57 -0800 Subject: Nagios client plugins for RHEL5 In-Reply-To: <9cfff3f01002170847g2d2eadf7jd71e41ae85134b98@mail.gmail.com> References: <9cfff3f01002170847g2d2eadf7jd71e41ae85134b98@mail.gmail.com> Message-ID: You can compile and build from source for RHEL*. -Jai From: Jai Ram [mailto:jaiho8 at gmail.com] Sent: Wednesday, February 17, 2010 8:47 AM To: Nagios Users Mailinglist Subject: [Nagios-users] Nagios client plugins for RHEL5 Hello All: Can anyone pls help me to find the RHEL5 nagios client plugin bundle....we have been monitoring AIX/Sun boxes for which we have the tar balls that we extract to each client under nagio's home dir and we invoke the check.sh from the Nagios master server using ssh....appreciate if anyone can suggest similar pre-packaged scripts for RHEL. Here is the listing of AIX client scripts... /home/nagios/AIX_3_powerpc * # ls -l 4.0K -rwxr-x--- 1 ops staff 2.3K Sep 28 2007 check.sh* 4.0K -rwxr-x--- 1 ops staff 2.3K Sep 28 2007 check.sh.AIX* 4.0K -rwxr-x--- 1 ops staff 2.2K Sep 26 2007 check_AIX.sh* 496K -rwxr-x--- 1 ops staff 494K Sep 26 2007 check_disk* 4.0K -rwxr-x--- 1 ops staff 1.6K Sep 28 2007 check_errpt.sh* 4.0K -rwxr-x--- 1 ops staff 3.2K Sep 26 2007 check_file_age.pl* 312K -rwxr-x--- 1 ops staff 312K Sep 26 2007 check_load* 4.0K -rwxr-x--- 1 ops staff 4.0K Sep 28 2007 check_lvm.pl* 4.0K -rwxr-x--- 1 ops staff 4.0K Sep 28 2007 check_lvm.pl.AIX* 20K -rwxr-x--- 1 ops staff 20K Sep 26 2007 check_mailq.pl* 8.0K -rwxr-x--- 1 ops staff 4.2K Sep 26 2007 check_mem.pl* 8.0K -rwxr-x--- 1 ops staff 4.2K Sep 26 2007 check_mem.pl.AIX* 348K -rwxr-x--- 1 ops staff 346K Sep 26 2007 check_procs* 4.0K -rwxr-x--- 1 ops staff 559 Sep 26 2007 check_procs.sh* 4.0K -rwxr-x--- 1 ops staff 2.6K Sep 27 2007 check_psrinfo.sh* 4.0K -rwxr-x--- 1 ops staff 2.6K Sep 27 2007 check_psrinfo.sh.AIX* 488K -rwxr-x--- 1 ops staff 485K Sep 26 2007 check_swap* 4.0K -rwxr-x--- 1 ops staff 2.1K Sep 26 2007 utils.pm* 4.0K -rwxr-x--- 1 ops staff 2.1K Sep 26 2007 utils.pm.AIX* 4.0K -rwxr-x--- 1 ops staff 859 Sep 26 2007 utils.sh* 4.0K -rwxr-x--- 1 ops staff 859 Sep 26 2007 utils.sh.AIX* -- Thanks Jai Ram. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pepper at reppep.com Wed Feb 17 18:34:53 2010 From: pepper at reppep.com (Chris Pepper) Date: Wed, 17 Feb 2010 12:34:53 -0500 Subject: Escalation in Host / Service Definitions? Message-ID: <4B7C28BD.6040405@reppep.com> To keep things simple, we define contacts in our host templates. This means that adding a simple host is as simple as copying & pasting a short host definition and changing the name. I'd like to do the same for escalations -- sending to the same people's cellphones via email, at least for now. We definitely don't want to add a hostname to hostnameescalation & serviceescalation definitions every time we add a host (in another file). Is there any particular reason hosts can define contacts but not escalations? This applies equally to services, of course. For now, I will define hostgroups inside serviceescalation, but I'm not sure how that will affect additional contacts (added with +user in individual host & hostgroup definitions). Alternatively, if there's a simple way to start with email notifications and then automatically start pager notifications (for those contacts with pager: defined) after an interval, that would work as well and be even simpler. I don't see anything built in for deferring pages until a few emails have gone unanswered, though. Thanks! Chris Pepper > define host { > name linux-server > use generic-host > check_period 24x7 > check_interval 5 > retry_interval 1 > max_check_attempts 10 > check_command check-host-alive > notification_period 24x7 > notification_interval 30 > notification_options d,u,r > contact_groups admins > register 0 > hostgroups linux-servers, ssh-disks-monitored, ssh-load-monitored > } -- Chris Pepper: ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rms at sibs.pt Wed Feb 17 19:01:12 2010 From: rms at sibs.pt (Rui Miguel Silva Seabra) Date: Wed, 17 Feb 2010 18:01:12 +0000 Subject: Nagios client plugins for RHEL5 In-Reply-To: <9cfff3f01002170847g2d2eadf7jd71e41ae85134b98@mail.gmail.com> References: <9cfff3f01002170847g2d2eadf7jd71e41ae85134b98@mail.gmail.com> Message-ID: <1266429672.2423.23.camel@localhost> Qua, 2010-02-17 ?s 11:47 -0500, Jai Ram escreveu: > > Hello All: > > Can anyone pls help me to find the RHEL5 nagios client plugin > bundle....we have been monitoring AIX/Sun boxes for which we have the http://download.fedora.redhat.com/pub/epel/ have fun. Rui ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dd-b at dd-b.net Wed Feb 17 20:45:17 2010 From: dd-b at dd-b.net (David Dyer-Bennet) Date: Wed, 17 Feb 2010 13:45:17 -0600 Subject: check_snmp disk space monitoring Message-ID: I'm playing with using check_snmp to look at disk space, with commands (working from the command line so far) like: /usr/lib/nagios/plugins/check_snmp -C public -P 2c -H localhost -o dskErrorMsg.2 -r '^\s*$' ([[:space:]] doesn't work any better than \s either) I'm trying to monitor the error message rather than the simple flag so that the data returned will include the error when one is found. I'm trying to use the regex capability to to match an empty error message; so that anything non-empty will be reported as an error. And I'm not getting anywhere. I'm mostly pretty good with regexps, but despite claiming in --help to support "extended regular expressions", it doesn't seem to. In particular the "^" for beginning of text and "$" for end of text don't seem to be working. Clues! Clues for the poor! Is this a basically stupid approach, by the way? Oh, and how does -r work with multiply OIDs in -o? What's the syntax for providing multiple -r values, and what happens if you only provide one? -- David Dyer-Bennet, dd-b at dd-b.net; http://dd-b.net/ Snapshots: http://dd-b.net/dd-b/SnapshotAlbum/data/ Photos: http://dd-b.net/photography/gallery/ Dragaera: http://dragaera.info ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From Kenny.Xu at soros.com Wed Feb 17 20:44:41 2010 From: Kenny.Xu at soros.com (Xu, Kenny) Date: Wed, 17 Feb 2010 14:44:41 -0500 Subject: NRPE: command not defined Message-ID: <8284A6582758FC4FAFEC8BD5E9C11C06106FF886@NYC-SOR-EXCH-06.SorosFunds.com> Hi, All, I ran into a problem on defining a specific check. When configured to run from nagios monitoring server (Linux) it returns: NRPE: Command 'check_halog' not defined But it is defined clearly in /usr/local/nagios/etc/nrpe.cfg file as: command[check_halog]=/usr/local/nagios/libexec/check_log -F /var/VRTSvcs/log/engine_A.log -O /usr/local/nagios/var/engine_A.log -q "FAULTED|CRITICAL" I can run it manual as nagios from the server fine: [nagios at nagiossvr ~]$ /usr/local/nagios/libexec/check_nrpe -H -c check_halog Log check ok - 0 pattern matches found The client is running x86 Solaris 10. Any idea? Appreciate your help. Kenny -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From alexdehaini at gmail.com Wed Feb 17 21:05:19 2010 From: alexdehaini at gmail.com (Alex Dehaini) Date: Wed, 17 Feb 2010 12:05:19 -0800 Subject: NRPE on macs Message-ID: <4b008f7d1002171205o251dd79s1b4fcf1a1d1cc0b1@mail.gmail.com> Hi Guys, Quick question - is NRPE bi-directional or uni-directional. By that, I mean when NRPE is installed on a client - does it push it output to the nagios server or does nagios pull the results from the client? Cheers, Alex -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From alexus at gmail.com Wed Feb 17 21:25:48 2010 From: alexus at gmail.com (alexus) Date: Wed, 17 Feb 2010 15:25:48 -0500 Subject: NRPE on macs In-Reply-To: <4b008f7d1002171205o251dd79s1b4fcf1a1d1cc0b1@mail.gmail.com> References: <4b008f7d1002171205o251dd79s1b4fcf1a1d1cc0b1@mail.gmail.com> Message-ID: <6ae50c2d1002171225w379f225y10bf093e7d891526@mail.gmail.com> NRPE works only one direction. you make a request through check_nrpe (remotely) to NRPE daemon that runs wherever (in your case on mac server) if you looking sort of passing checking look into Nagios NSCA On Wed, Feb 17, 2010 at 3:05 PM, Alex Dehaini wrote: > Hi Guys, > > Quick question - is NRPE bi-directional or uni-directional. By that, I mean > when NRPE is installed on a client - does it push it output to the nagios > server or does nagios pull the results from the client? > > Cheers, > Alex > > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting > any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- http://alexus.org/ ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Feb 17 21:27:34 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 17 Feb 2010 20:27:34 +0000 Subject: Help required In-Reply-To: References: Message-ID: <765d77c81002171227h526f8203m4b50103925335178@mail.gmail.com> On 17 February 2010 17:36, Digital Edge wrote: > Hi List, > > It will be really helpful if i can get any response on my below mentioned > query..... > > I have an URL , say.. > > http://www.example.com/sigin.jsf , After login in, it'll redirect to > "https://www.example1.com/ddo/get_sec_pwd.php" ; here another authentication > will happen, then it'll come to an URL > "https://www.example1.com/home/home.jsf. Inside that page I have several > other Tabs.. > -------------------- > Home|Home1|Home2 > -------------------- > all the tabs can be navigate & viewable after successful login of 2nd time . > And can be accessible within that session only. > > Can we monitor those URLS response time without loosing the session , one by > one in Nagios...... Not in Nagios itself, no, but I expect you could use WebInject http://www.webinject.org/ to do the web querying and timing and feed the results back to Nagios. hth, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 17 21:33:18 2010 From: marc at ena.com (Marc Powell) Date: Wed, 17 Feb 2010 14:33:18 -0600 Subject: NRPE: command not defined In-Reply-To: <8284A6582758FC4FAFEC8BD5E9C11C06106FF886@NYC-SOR-EXCH-06.SorosFunds.com> References: <8284A6582758FC4FAFEC8BD5E9C11C06106FF886@NYC-SOR-EXCH-06.SorosFunds.com> Message-ID: On Feb 17, 2010, at 1:44 PM, Xu, Kenny wrote: > Hi, All, > > I ran into a problem on defining a specific check. When configured to run from nagios monitoring server (Linux) it returns: > > NRPE: Command 'check_halog' not defined > > But it is defined clearly in /usr/local/nagios/etc/nrpe.cfg file as: > > command[check_halog]=/usr/local/nagios/libexec/check_log -F /var/VRTSvcs/log/engine_A.log -O /usr/local/nagios/var/engine_A.log -q "FAULTED|CRITICAL" > > I can run it manual as nagios from the server fine: > > [nagios at nagiossvr ~]$ /usr/local/nagios/libexec/check_nrpe -H -c check_halog > Log check ok - 0 pattern matches found > > The client is running x86 Solaris 10. Any idea? You need to post the service{} and command{} definition that's not working. It's seems clear that something is different there than what you're testing from the command line. Are you sure you're checking the same client? -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Feb 17 21:40:11 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 17 Feb 2010 20:40:11 +0000 Subject: check_snmp disk space monitoring In-Reply-To: References: Message-ID: <765d77c81002171240s17a71abw8c85c46790c36278@mail.gmail.com> On 17 February 2010 19:45, David Dyer-Bennet wrote: > > I'm playing with using check_snmp to look at disk space, with commands > (working from the command line so far) like: > > /usr/lib/nagios/plugins/check_snmp -C public -P 2c -H localhost -o > dskErrorMsg.2 -r '^\s*$' > > ([[:space:]] doesn't work any better than \s either) > > I'm trying to monitor the error message rather than the simple flag so > that the data returned will include the error when one is found. ?I'm > trying to use the regex capability to to match an empty error message; so > that anything non-empty will be reported as an error. > > And I'm not getting anywhere. ?I'm mostly pretty good with regexps, but > despite claiming in --help to support "extended regular expressions", it > doesn't seem to. ?In particular the "^" for beginning of text and "$" for > end of text don't seem to be working. > > Clues! ?Clues for the poor! > > Is this a basically stupid approach, by the way? > > Oh, and how does -r work with multiply OIDs in -o? ?What's the syntax for > providing multiple -r values, and what happens if you only provide one? I think you're maybe a bit ambitious using check_snmp for disk space. I recommend you use check_snmp_storage.pl which you will find at http://nagios.manubulon.com/snmp_storage.html hth, Jim ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From alexdehaini at gmail.com Wed Feb 17 21:44:56 2010 From: alexdehaini at gmail.com (Alex Dehaini) Date: Wed, 17 Feb 2010 12:44:56 -0800 Subject: NRPE on macs In-Reply-To: <6ae50c2d1002171225w379f225y10bf093e7d891526@mail.gmail.com> References: <4b008f7d1002171205o251dd79s1b4fcf1a1d1cc0b1@mail.gmail.com> <6ae50c2d1002171225w379f225y10bf093e7d891526@mail.gmail.com> Message-ID: <4b008f7d1002171244k3192a8d6id7c73f47a8b5f1ee@mail.gmail.com> So, The request is from the nagios server to the client running nrpe i.e pulling the information from the nrpe client? Thanks Alex On Wed, Feb 17, 2010 at 12:25 PM, alexus wrote: > NRPE works only one direction. > > you make a request through check_nrpe (remotely) to NRPE daemon that > runs wherever (in your case on mac server) > > if you looking sort of passing checking look into Nagios NSCA > > > On Wed, Feb 17, 2010 at 3:05 PM, Alex Dehaini > wrote: > > Hi Guys, > > > > Quick question - is NRPE bi-directional or uni-directional. By that, I > mean > > when NRPE is installed on a client - does it push it output to the nagios > > server or does nagios pull the results from the client? > > > > Cheers, > > Alex > > > > > > > ------------------------------------------------------------------------------ > > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > > http://p.sf.net/sfu/solaris-dev2dev > > _______________________________________________ > > Nagios-users mailing list > > Nagios-users at lists.sourceforge.net > > https://lists.sourceforge.net/lists/listinfo/nagios-users > > ::: Please include Nagios version, plugin version (-v) and OS when > reporting > > any issue. > > ::: Messages without supporting info will risk being sent to /dev/null > > > > > > -- > http://alexus.org/ > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From peter at berghold.net Wed Feb 17 21:35:04 2010 From: peter at berghold.net (Peter L. Berghold) Date: Wed, 17 Feb 2010 15:35:04 -0500 Subject: Proactive disk checks? Message-ID: <1266438904.18048.2.camel@localhost.localdomain> Hi folks, Has anybody out there written a check for Nagios that detects a failing disk? A number of years ago I wrote a shell script that ran from a crontab that just checked latency on JBOD disks as a rough way of determining a drive that was about to fail. Platform I want to run these checks on is Fedora Core 12 and the iostat command does not behave the same way it did on a Sun under Solaris 8 OS. -- Peter L. Berghold, Australian Cattle Dog Owner, Agility Fan, Foodie, Salty Old Dog and Old School Unix Hacker. Skype: cowdawg "Those who fail to learn from history are condemned to repeat it" ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 17 21:58:42 2010 From: marc at ena.com (Marc Powell) Date: Wed, 17 Feb 2010 14:58:42 -0600 Subject: NRPE on macs In-Reply-To: <4b008f7d1002171244k3192a8d6id7c73f47a8b5f1ee@mail.gmail.com> References: <4b008f7d1002171205o251dd79s1b4fcf1a1d1cc0b1@mail.gmail.com> <6ae50c2d1002171225w379f225y10bf093e7d891526@mail.gmail.com> <4b008f7d1002171244k3192a8d6id7c73f47a8b5f1ee@mail.gmail.com> Message-ID: <700682A1-2BDC-400B-AC08-DAAFD316C7B2@ena.com> On Feb 17, 2010, at 2:44 PM, Alex Dehaini wrote: > So, > > The request is from the nagios server to the client running nrpe i.e pulling the information from the nrpe client? http://nagios.sourceforge.net/docs/3_0/addons.html#nrpe and http://nagios.sourceforge.net/docs/nrpe/NRPE.pdf -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From alexdehaini at gmail.com Wed Feb 17 22:04:17 2010 From: alexdehaini at gmail.com (Alex Dehaini) Date: Wed, 17 Feb 2010 13:04:17 -0800 Subject: NRPE on macs In-Reply-To: <700682A1-2BDC-400B-AC08-DAAFD316C7B2@ena.com> References: <4b008f7d1002171205o251dd79s1b4fcf1a1d1cc0b1@mail.gmail.com> <6ae50c2d1002171225w379f225y10bf093e7d891526@mail.gmail.com> <4b008f7d1002171244k3192a8d6id7c73f47a8b5f1ee@mail.gmail.com> <700682A1-2BDC-400B-AC08-DAAFD316C7B2@ena.com> Message-ID: <4b008f7d1002171304h509a3b65l26be02881551dbe@mail.gmail.com> thanks all On Wed, Feb 17, 2010 at 12:58 PM, Marc Powell wrote: > > On Feb 17, 2010, at 2:44 PM, Alex Dehaini wrote: > > > So, > > > > The request is from the nagios server to the client running nrpe i.e > pulling the information from the nrpe client? > > http://nagios.sourceforge.net/docs/3_0/addons.html#nrpe > > and > > http://nagios.sourceforge.net/docs/nrpe/NRPE.pdf > > -- > Marc > > ------------------------------------------------------------------------------ > SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, > Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW > http://p.sf.net/sfu/solaris-dev2dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 17 22:11:19 2010 From: marc at ena.com (Marc Powell) Date: Wed, 17 Feb 2010 15:11:19 -0600 Subject: Proactive disk checks? In-Reply-To: <1266438904.18048.2.camel@localhost.localdomain> References: <1266438904.18048.2.camel@localhost.localdomain> Message-ID: <7ADCAC25-EBE3-4533-BA68-1BCA57F72734@ena.com> On Feb 17, 2010, at 2:35 PM, Peter L. Berghold wrote: > Hi folks, > > Has anybody out there written a check for Nagios that detects a failing > disk? If the disks support SMART and it is enabled, you could use check_smart.pl included in the contrib of the standard plugins. -- Marc ------------------------------------------------------------------------------ SOLARIS 10 is the OS for Data Centers - provides features such as DTrace, Predictive Self Healing and Award Winning ZFS. Get Solaris 10 NOW http://p.sf.net/sfu/solaris-dev2dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ron at tvnz.co.nz Thu Feb 18 00:14:51 2010 From: ron at tvnz.co.nz (Ron Wilson) Date: Thu, 18 Feb 2010 12:14:51 +1300 Subject: Editing the nagios Side bar Message-ID: <7D85034F6BBBD540AF52C59CED3AA1E801C71288@AKVXCH02.tvnzad.tvnz.co.nz> I have 6 groups set up holding servers being patched on each day. I would like an entry in the Nagios sidebar that says patching which would then give a web page view of the six patching groups on one page. This makes it easier for admins to disable notifications for a large number of servers with one click. Because we have so many groups it would be easier to have the Patching days on one page. However while I can create a url for one days Patching in the new page I cannot get all six. This is my php code
  • /status.cgi?hostgroup=Patch_Day1&style=overvi ew" target="">Patch Day1
  • This works fine but how can I get the other 5 Patch Groups in that line. I need something like Patch_Day* but such a command does not work with php. Anyone got some ideas? ________________________________ -- Ron Wilson Systems Engineer Television New Zealand P.O. Box 3819 Auckland, New Zealand Phone 649-916-7560 'Tis an ill wind that blows no minds. ________________________________ ========================================================== For more information on the Television New Zealand Group, visit us online at tvnz.co.nz ========================================================== CAUTION: This e-mail and any attachment(s) contain information that is intended to be read only by the named recipient(s). This information is not to be used or stored by any other person and/or organisation. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From BChan at Shawcor.com Thu Feb 18 00:49:18 2010 From: BChan at Shawcor.com (Brian Chan) Date: Wed, 17 Feb 2010 18:49:18 -0500 Subject: AUTO: Chan, Brian is out of the office. (returning 02/18/2010) Message-ID: I am out of the office until 02/18/2010. I will respond to your message when I return. Note: This is an automated response to your message "Nagios-users Digest, Vol 45, Issue 13" sent on 2/17/10 15:25:58. This is the only notification you will receive while this person is away. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From malik_khurram at hotmail.com Thu Feb 18 02:39:43 2010 From: malik_khurram at hotmail.com (Khurram Malik) Date: Thu, 18 Feb 2010 01:39:43 +0000 Subject: MACRO PROBLEM Message-ID: Hi I am using Nagios 3.0.6 and in an integration project i want Nagios to send alerts to Netcool. I am using Host/Service Global Event Handlers. I am able to get the maximum information via the following macros $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$ $SERVICEDESC$ But i also want some other info via macros and i am using the following link to see if macro is enabled or disabled http://nagios.sourceforge.net/docs/3_0/macrolist.html#hostoutput I want to get CONTACTEMAIL and CONTACTPAGER contents but these macros are disabled with Global Host/Service handler, what is the easiest way to get info for the conact macros with Global Event Handlers. I can see $CONTACTGROUPMEMBERS$ is enabled with Global Event Handlers but I am unable to get any value, seems like a bug. Regards Khurram Malik -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lylexryan at yahoo.com Thu Feb 18 03:29:43 2010 From: lylexryan at yahoo.com (Lylex Ryan) Date: Wed, 17 Feb 2010 18:29:43 -0800 (PST) Subject: Can I run both Nagios V2 and V3 in parallel while I migrate? Message-ID: <788699.27345.qm@web63002.mail.re1.yahoo.com> In upgrading from nagios (v2) to nagios3, I'd like to do a fresh install of nagios3 and start with a clean sheet of? (config) "paper".? But can I do this while V2 is running production? Since the packages have different names, I thought it might work.? But they probably would both have /etc/nagios and other default directories in common.? Maybe if I installed from the tar-ball, I could specify new directories for V3, but I'm also trying to avoid that learning-process and use a pre-packaged rpm. Maybe installing V3 on a different server all-together, then moving it to the production machine would be a way. Any advice is appreciated.? Thanks...Lyle -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Thu Feb 18 08:06:17 2010 From: reachtarak at hotmail.com (Digital Edge) Date: Thu, 18 Feb 2010 12:36:17 +0530 Subject: Help required In-Reply-To: <765d77c81002171227h526f8203m4b50103925335178@mail.gmail.com> References: , <765d77c81002171227h526f8203m4b50103925335178@mail.gmail.com> Message-ID: > Date: Wed, 17 Feb 2010 20:27:34 +0000 > Subject: Re: [Nagios-users] Help required > From: jim at jimavery.me.uk > To: reachtarak at hotmail.com > CC: nagios-users at lists.sourceforge.net > > On 17 February 2010 17:36, Digital Edge wrote: > > Hi List, > > > > It will be really helpful if i can get any response on my below mentioned > > query..... > > > > I have an URL , say.. > > > > http://www.example.com/sigin.jsf , After login in, it'll redirect to > > "https://www.example1.com/ddo/get_sec_pwd.php" ; here another authentication > > will happen, then it'll come to an URL > > "https://www.example1.com/home/home.jsf. Inside that page I have several > > other Tabs.. > > -------------------- > > Home|Home1|Home2 > > -------------------- > > all the tabs can be navigate & viewable after successful login of 2nd time . > > And can be accessible within that session only. > > > > Can we monitor those URLS response time without loosing the session , one by > > one in Nagios...... > > Not in Nagios itself, no, but I expect you could use WebInject > http://www.webinject.org/ to do the web querying and timing and feed > the results back to Nagios. > > hth, > > Jim Hi , Yes; even I have tried also. The issue what i'm facing is after successful authentication checking , I'm unable to navigate through those links. All the tests are passing except case4. I am not able to understand why it's happening. can anyone help me on this ..... /\ Ricky _________________________________________________________________ Hotmail: Powerful Free email with security by Microsoft. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Feb 18 08:24:59 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 17 Feb 2010 23:24:59 -0800 Subject: Can I run both Nagios V2 and V3 in parallel while I migrate? In-Reply-To: <788699.27345.qm@web63002.mail.re1.yahoo.com> References: <788699.27345.qm@web63002.mail.re1.yahoo.com> Message-ID: <4B7CEB4B.1000600@hp.com> Lylex Ryan wrote: > In upgrading from nagios (v2) to nagios3, I'd like to do a fresh > install of nagios3 and start with a clean sheet of (config) "paper". > But can I do this while V2 is running production? > > Since the packages have different names, I thought it might work. But > they probably would both have /etc/nagios and other default > directories in common. Maybe if I installed from the tar-ball, I > could specify new directories for V3, but I'm also trying to avoid > that learning-process and use a pre-packaged rpm. > > Maybe installing V3 on a different server all-together, then moving it > to the production machine would be a way. > > Any advice is appreciated. Thanks...Lyle > > > > Might work, might not. Really depends on your config, and how the packages are laid out. You've given us details about neither, so "maybe" is about the best answer I can give you. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Feb 18 08:38:12 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 17 Feb 2010 23:38:12 -0800 Subject: MACRO PROBLEM In-Reply-To: References: Message-ID: <4B7CEE63.8090007@hp.com> Khurram Malik wrote: > Hi > > I am using Nagios 3.0.6 and in an integration project i want Nagios to > send alerts to Netcool. I am using Host/Service Global Event Handlers. > I am able to get the maximum information via the following macros > > $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$ $SERVICEDESC$ > > But i also want some other info via macros and i am using the > following link to see if macro is enabled or disabled > http://nagios.sourceforge.net/docs/3_0/macrolist.html#hostoutput > > I want to get CONTACTEMAIL and CONTACTPAGER contents but these macros > are disabled with Global Host/Service handler, what is the easiest way > to get info for the conact macros with Global Event Handlers. I can > see $_CONTACTGROUPMEMBERS$_ is enabled with Global Event Handlers but > I am unable to get any value, seems like a bug. This is not a bug. These macros are not available with event handlers, since eventhandlers do not have contacts associated with them. If you look at the matrix on tha page you linked, you'll see that CONTACTEMAIL and CONTACTPAGER work only with host and service notifications. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Feb 18 08:48:08 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Wed, 17 Feb 2010 23:48:08 -0800 Subject: MACRO PROBLEM In-Reply-To: <4B7CEE63.8090007@hp.com> References: <4B7CEE63.8090007@hp.com> Message-ID: <4B7CF0B8.8070805@hp.com> Morris, Patrick wrote: > Khurram Malik wrote: > >> Hi >> >> I am using Nagios 3.0.6 and in an integration project i want Nagios to >> send alerts to Netcool. I am using Host/Service Global Event Handlers. >> I am able to get the maximum information via the following macros >> >> $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$ $SERVICEDESC$ >> >> But i also want some other info via macros and i am using the >> following link to see if macro is enabled or disabled >> http://nagios.sourceforge.net/docs/3_0/macrolist.html#hostoutput >> >> I want to get CONTACTEMAIL and CONTACTPAGER contents but these macros >> are disabled with Global Host/Service handler, what is the easiest way >> to get info for the conact macros with Global Event Handlers. I can >> see $_CONTACTGROUPMEMBERS$_ is enabled with Global Event Handlers but >> I am unable to get any value, seems like a bug. >> > > This is not a bug. These macros are not available with event handlers, > since eventhandlers do not have contacts associated with them. If you > look at the matrix on tha page you linked, you'll see that CONTACTEMAIL > and CONTACTPAGER work only with host and service notifications. > After re-reading your original question, I may have misunderstood, and you're wondering why $CONTACTGROUPMEMBERS$ doesn't work. See notes 5 and 7 on the page you linked. These macros work as on-demand-macros in event handlers, since event handler have no contacts associated with them. To obtain a list of conatct group members in that contacts, you would also need to provide the name of the group. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From samuel.mutel at free.fr Thu Feb 18 09:21:41 2010 From: samuel.mutel at free.fr (samuel.mutel at free.fr) Date: Thu, 18 Feb 2010 09:21:41 +0100 (CET) Subject: Fwd: Problem with recovery notification In-Reply-To: <4B7AFAA2.7000908@free.fr> References: <4B7AFAA2.7000908@free.fr> Message-ID: <1036495780.3662231266481301097.JavaMail.root@spooler6-g27.priv.proxad.net> Hello, Anybody ? I don't understand the Hard and soft logic in Service Alert of server 1 : CRITICAL;SOFT;1 CRITICAL;SOFT;2 CRITICAL;HARD;3 CRITICAL;SOFT;1 => Why I don't have after CRITICAL;HARD;3 and before CRITICAL;SOFT;1 : OK;HARD;3. Questions : 1) The flapping mode can explain this behaviour ? 2) If the node is down the service state (hard or soft) is set to soft ? Regards, Thanks. ----- Mail transf?r? ----- De: "Samuel Mutel" ?: nagios-users at lists.sourceforge.net Envoy?: Mardi 16 F?vrier 2010 21:05:54 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Problem with recovery notification Hello, I have two Nagios servers that monitor the same equipement. This two nagios send the result of check by notification to another monitoring system (OpenNMS). I use Nagios 3.2. I received the recovery notification from server 2 but I did not received recovery notification from server 1. Why ? I think that SOFT and HARD states are the problem but I am not sur. In the second server 2 the status of service is HARD - OK so the notification is sent but on server 1, the service is SOFT - OK !!! Here is the log of Nagios : Service Alert of server 1 : [1266299592] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266299885] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266299925] SERVICE ALERT: test-server;CPU;CRITICAL;HARD;3;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266303080] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266303380] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266308485] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266308500] SERVICE ALERT: test-server;CPU;OK;SOFT;2;CHECK_ESX3.PL OK - "test-server" cpu usage=2.29 % Service Notification of server 1 : [1266299925] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266300645] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266301385] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266301720] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266303575] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266304175] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266304810] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266305270] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266305975] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server Service Alert of server 2 : [1266299856] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266300161] SERVICE ALERT: test-server;CPU;CRITICAL;HARD;1;(Service Check Timed Out) [1266300516] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266301481] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266301512] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266304201] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266304506] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;(Service Check Timed Out) [1266307211] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266307411] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266308616] SERVICE ALERT: test-server;CPU;OK;HARD;3;CHECK_ESX3.PL OK - "test-server" cpu usage=2.15 % Service Notification of server 2 : [1266301901] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266302501] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266303106] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266304811] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266305411] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266306001] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266307611] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266308206] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266308616] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;OK;send_service_trap_to_onms_preprod;CHECK_ESX3.PL OK - test-server cpu usage=2.15 % Thank for your help. Regards, Samuel Mutel. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Feb 18 09:31:13 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 18 Feb 2010 00:31:13 -0800 Subject: Fwd: Problem with recovery notification In-Reply-To: <1036495780.3662231266481301097.JavaMail.root@spooler6-g27.priv.proxad.net> References: <1036495780.3662231266481301097.JavaMail.root@spooler6-g27.priv.proxad.net> Message-ID: <4B7CFAD1.3040008@hp.com> samuel.mutel at free.fr wrote: > Hello, > > Anybody ? > I don't understand the Hard and soft logic in Service Alert of server 1 : > > CRITICAL;SOFT;1 > CRITICAL;SOFT;2 > CRITICAL;HARD;3 > CRITICAL;SOFT;1 > > => Why I don't have after CRITICAL;HARD;3 and before CRITICAL;SOFT;1 : OK;HARD;3. > > Questions : > 1) The flapping mode can explain this behaviour ? > 2) If the node is down the service state (hard or soft) is set to soft ? > Flap detection only inhibits notifications. It would not effect hard/soft states. Several things could cause this, but it appears you've stripped all context out of the logs. Was Nagios restarted between the " CRITICAL;HARD;3" and the "CRITICAL;SOFT;1," maybe? Im not 100% sure, but the service state count may also be reset (I'd be a bit surprised if it isn't) if the host is determined to be down. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sam.bancal at gmail.com Thu Feb 18 10:47:52 2010 From: sam.bancal at gmail.com (Samuel Bancal) Date: Thu, 18 Feb 2010 10:47:52 +0100 Subject: service notification when host is down In-Reply-To: <4B7C1EE5.1010007@hp.com> References: <4fd99abf1002170742m2a7d6bb3y55abbfec20940e20@mail.gmail.com> <4B7C1EE5.1010007@hp.com> Message-ID: <4fd99abf1002180147j55be849vb14dd49a86816e31@mail.gmail.com> Thanks for your answer, In fact it is normal behavior to me also. Thing that is not "normal behavior" to me is that between two checks, Nagios jumps from "SOFT 1" to "HARD 1" without doing the steps "SOFT 1" > "SOFT 2" > "SOFT 3" and finally "HARD 4". Regards, Samuel Bancal 2010/2/17 Morris, Patrick > Samuel Bancal wrote: > >> Nagios Core 3.2.0 >> nagios-plugins-1.4.14 >> Ubuntu server 8.04.3 LTS >> >> Hi, >> >> I'm encountering problems to configure the notifications in case a server >> is no more responding to PING (ICMP). >> I don't understand why Nagios is jumping over steps when it's doing >> service-check "icmp". >> Here is the config : >> >> define host{ >> use generic-server >> host_name server1 >> alias server1 >> address the.ip.the.ip >> hostgroups prod-servers >> contact_groups group1 >> check_command check-host-alive >> check_period 24x7 >> check_interval 5 >> retry_interval 1 >> max_check_attempts 4 >> notification_period 24x7 >> notification_interval 60 >> notification_options d,u,r >> } >> >> define service{ >> use generic-service >> host_name server1 >> service_description ICMP >> check_command check_icmp!100.0,20%!500.0,60% >> max_check_attempts 4 >> normal_check_interval 5 >> retry_check_interval 1 >> notification_options w,u,c,r >> notification_interval 60 >> notification_period 24x7 >> } >> [...] >> define command{ >> command_name check-host-alive >> command_line $USER1$/check_ping -H $HOSTADDRESS$ -w 3000.0,80% -c >> 5000.0,100% -p 5 >> } >> define command{ >> command_name check_icmp >> command_line $USER1$/check_icmp -H $HOSTADDRESS$ -w $ARG1$ -c $ARG2$ >> -p 5 >> } >> [...] >> >> Here is an example of history that I get : >> Service Critical[2010-02-16 11:33:13] SERVICE ALERT: >> server1;ICMP;CRITICAL;SOFT;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% >> Host Down[2010-02-16 11:33:43] HOST ALERT: server1;DOWN;SOFT;1;(Host Check >> Timed Out) >> Service Critical[2010-02-16 11:34:13] SERVICE ALERT: >> server1;ICMP;CRITICAL;HARD;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% >> Host Down[2010-02-16 11:34:43] HOST ALERT: server1;DOWN;SOFT;2;(Host Check >> Timed Out) >> Host Down[2010-02-16 11:35:23] HOST ALERT: server1;DOWN;SOFT;3;(Host Check >> Timed Out) >> Host Down[2010-02-16 11:36:33] HOST ALERT: server1;DOWN;HARD;4;(Host Check >> Timed Out) >> Host Up[2010-02-16 11:37:43] HOST ALERT: server1;UP;HARD;1;PING OK - >> Packet loss = 0%, RTA = 0.67 ms >> Service Ok[2010-02-16 11:39:13] SERVICE ALERT: server1;ICMP;OK;HARD;1;OK - >> the.ip.the.ip: rta 0.943ms, lost 0% >> >> Or later : >> Host Down[2010-02-16 11:42:03] HOST ALERT: server1;DOWN;SOFT;1;(Host Check >> Timed Out) >> Host Down[2010-02-16 11:43:13] HOST ALERT: server1;DOWN;SOFT;2;(Host Check >> Timed Out) >> Service Critical[2010-02-16 11:44:13] SERVICE ALERT: >> server1;ICMP;CRITICAL;HARD;1;CRITICAL - the.ip.the.ip: rta nan, lost 100% >> Host Down[2010-02-16 11:44:43] HOST ALERT: server1;DOWN;SOFT;3;(Host Check >> Timed Out) >> Host Up[2010-02-16 11:45:53] HOST ALERT: server1;UP;SOFT;4;PING OK - >> Packet loss = 0%, RTA = 0.64 ms >> Service Ok[2010-02-16 11:49:13] SERVICE ALERT: server1;ICMP;OK;HARD;1;OK - >> the.ip.the.ip: rta 0.948ms, lost 0% >> > > If you're asking why Nagios runs a host check when it sees the service fail > a check, that's normal behavior. > > When a service check fails, the first thing Nagios will do is look to see > if the service failed because the host is down. > -- Samuel Bancal - CH -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From samuel.mutel at free.fr Thu Feb 18 11:10:11 2010 From: samuel.mutel at free.fr (samuel.mutel at free.fr) Date: Thu, 18 Feb 2010 11:10:11 +0100 (CET) Subject: Fwd: Problem with recovery notification In-Reply-To: <1036495780.3662231266481301097.JavaMail.root@spooler6-g27.priv.proxad.net> References: <1036495780.3662231266481301097.JavaMail.root@spooler6-g27.priv.proxad.net> Message-ID: <1855780397.3694721266487811010.JavaMail.root@spooler6-g27.priv.proxad.net> I found this in source code : /* ADDED IF STATEMENT 01-17-05 EG */ /* 01-17-05: Services in hard problem states before hosts went down would sometimes come back as soft problem states after */ /* the hosts recovered. This caused problems, so hopefully this will fix it */ if(temp_service->state_type==SOFT_STATE) temp_service->current_attempt=1; } "so hopefully this will fix it" => Perhaps this patch does not work ... Samuel Mutel. ----- Mail Original ----- De: "samuel mutel" ?: nagios-users at lists.sourceforge.net Envoy?: Jeudi 18 F?vrier 2010 09:21:41 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: [Nagios-users] Fwd: Problem with recovery notification Hello, Anybody ? I don't understand the Hard and soft logic in Service Alert of server 1 : CRITICAL;SOFT;1 CRITICAL;SOFT;2 CRITICAL;HARD;3 CRITICAL;SOFT;1 => Why I don't have after CRITICAL;HARD;3 and before CRITICAL;SOFT;1 : OK;HARD;3. Questions : 1) The flapping mode can explain this behaviour ? 2) If the node is down the service state (hard or soft) is set to soft ? Regards, Thanks. ----- Mail transf?r? ----- De: "Samuel Mutel" ?: nagios-users at lists.sourceforge.net Envoy?: Mardi 16 F?vrier 2010 21:05:54 GMT +01:00 Amsterdam / Berlin / Berne / Rome / Stockholm / Vienne Objet: Problem with recovery notification Hello, I have two Nagios servers that monitor the same equipement. This two nagios send the result of check by notification to another monitoring system (OpenNMS). I use Nagios 3.2. I received the recovery notification from server 2 but I did not received recovery notification from server 1. Why ? I think that SOFT and HARD states are the problem but I am not sur. In the second server 2 the status of service is HARD - OK so the notification is sent but on server 1, the service is SOFT - OK !!! Here is the log of Nagios : Service Alert of server 1 : [1266299592] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266299885] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266299925] SERVICE ALERT: test-server;CPU;CRITICAL;HARD;3;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266303080] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266303380] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266308485] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266308500] SERVICE ALERT: test-server;CPU;OK;SOFT;2;CHECK_ESX3.PL OK - "test-server" cpu usage=2.29 % Service Notification of server 1 : [1266299925] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266300645] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266301385] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266301720] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266303575] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266304175] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266304810] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266305270] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at https://ip_address/sdk/vimService.wsdl [1266305975] SERVICE NOTIFICATION: onms_prod;test-server;CPU;CRITICAL;send_service_trap_to_onms_prod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server Service Alert of server 2 : [1266299856] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266300161] SERVICE ALERT: test-server;CPU;CRITICAL;HARD;1;(Service Check Timed Out) [1266300516] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266301481] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266301512] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error: Server version unavailable at 'https://ip_address/sdk/vimService.wsdl' [1266304201] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266304506] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;(Service Check Timed Out) [1266307211] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;1;(Service Check Timed Out) [1266307411] SERVICE ALERT: test-server;CPU;CRITICAL;SOFT;2;CHECK_ESX3.PL CRITICAL - Error connecting to server at 'https://ip_address/sdk/webService': Perhaps host is not a Virtual Center or ESX server [1266308616] SERVICE ALERT: test-server;CPU;OK;HARD;3;CHECK_ESX3.PL OK - "test-server" cpu usage=2.15 % Service Notification of server 2 : [1266301901] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266302501] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266303106] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266304811] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266305411] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;(Service Check Timed Out) [1266306001] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266307611] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266308206] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;CRITICAL;send_service_trap_to_onms_preprod;CHECK_ESX3.PL CRITICAL - Error connecting to server at https://ip_address/sdk/webService: Perhaps host is not a Virtual Center or ESX server [1266308616] SERVICE NOTIFICATION: onms_preprod;test-server;CPU;OK;send_service_trap_to_onms_preprod;CHECK_ESX3.PL OK - test-server cpu usage=2.15 % Thank for your help. Regards, Samuel Mutel. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.willis at kent.pnn.police.uk Thu Feb 18 12:04:28 2010 From: paul.willis at kent.pnn.police.uk (Paul WILLIS PSE 55499) Date: Thu, 18 Feb 2010 11:04:28 +0000 Subject: CHECK_HTTP odd behaviour Message-ID: <4B7D1EBC020000C40001551F@kent.pnn.police.uk> Well I tried writing a wrapper script to see what check_http was actually receiving. The answer would appear to be absolutely nothing, in fact check_http is never even getting called. Something in the parameters would appear to be causing nagios to throw an exception when trying to make the call that is caught and treated as a critical error with a null reply. When I went through the -A parameter and escaped every non-standard character everything burst into life, the wrapper reported the correct string and check_http reported the site as up. Clearly that whereas bash only needs $ and ` escaping within inverted commas nagios must have a larger list, including I would guess either the ; or the : Thanks for the help Paul Willis -- This email and any accompanying document(s) contain information from Kent Police, which is confidential or privileged. The information is intended to be for the exclusive use of the individual(s) or bodies to whom it is addressed. If you are not the intended recipient, be aware that any disclosure, copying, distribution or use of the contents of this information is prohibited. If you have received this email in error, please notify us immediately by contacting the sender or telephoning 01622 690690. The copyright in the contents of this email and any enclosure is the property of Kent Police and any unauthorised reproduction or disclosure is contrary to the provisions of the Copyright Designs and Patents Act 1998. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gtj at addicks.org Thu Feb 18 12:17:17 2010 From: gtj at addicks.org (Glynne Jones) Date: Thu, 18 Feb 2010 11:17:17 +0000 Subject: Processing External Commands Message-ID: <20100218111717.GB3916@valley.addicks.org> Hi, I'm running a distributed setup with two servers carrying out monitoring and sending their results back to a central server via NSCA. Most of the time this works well, but from time to time I get a substantial delay both between NSCA receiving the check result and the EXTERNAL COMMAND being logged, and between the EXTERNAL COMMAND being logged and the PASSIVE SERVICE CHECK result being logged. This delay can be several minutes and has sometimes been over 10 minutes. I am running ndoutils as well, and some of the tables are quite big. Could this affect things? Any help appreciated. Thanks, Glynne ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 14:13:56 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 07:13:56 -0600 Subject: CHECK_HTTP odd behaviour In-Reply-To: <4B7D1EBC020000C40001551F@kent.pnn.police.uk> References: <4B7D1EBC020000C40001551F@kent.pnn.police.uk> Message-ID: <211D2A11-170D-4E67-9CEF-88EE0410BE67@ena.com> On Feb 18, 2010, at 5:04 AM, Paul WILLIS PSE 55499 wrote: > Clearly that whereas bash only needs $ and ` escaping within inverted commas nagios must have a larger list, including I would guess either the ; or the : Nope, not really. \, ! and $ are the only characters that may need escaping, depending on where they are used. With the exception of $MACRO$ substitutions, nagios just takes your raw command_line and passes it to the shell for execution. You never posted your command definition but I'd guess that you didn't have proper quoting or something like that. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 14:19:05 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 07:19:05 -0600 Subject: Processing External Commands In-Reply-To: <20100218111717.GB3916@valley.addicks.org> References: <20100218111717.GB3916@valley.addicks.org> Message-ID: <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> On Feb 18, 2010, at 5:17 AM, Glynne Jones wrote: > This delay can be several minutes and has sometimes been over 10 minutes. > > I am running ndoutils as well, and some of the tables are quite big. Could this affect things? Yes, certainly. If the database is busy, either through action of your own or through one of the regular table maintenance tasks then processing of check results may be delayed waiting on the database. Should be pretty easy to see through top if mysql is busy during those times. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 13:59:26 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 06:59:26 -0600 Subject: Can I run both Nagios V2 and V3 in parallel while I migrate? In-Reply-To: <788699.27345.qm@web63002.mail.re1.yahoo.com> References: <788699.27345.qm@web63002.mail.re1.yahoo.com> Message-ID: On Feb 17, 2010, at 8:29 PM, Lylex Ryan wrote: > In upgrading from nagios (v2) to nagios3, I'd like to do a fresh install of nagios3 and start with a clean sheet of (config) "paper". But can I do this while V2 is running production? Yes, I've run instances of all three versions on a single box at once. > Since the packages have different names, I thought it might work. But they probably would both have /etc/nagios and other default directories in common. Clearly if the packages install components to common directories then that isn't going to work. Compiling and installing from tarball is not difficult at all and you have control over where things get put (by default everything is under /usr/local/nagios). You'll need to set up a second http vhost with a different name for the second instance and either modify the nagios init script to start the second instance or add the startup to rc.local. Once you're confident in the success of your transition, you could uninstall the v2 package, install the v3 package and copy over your etc and var directories from your transition install... -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Thu Feb 18 14:26:47 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Thu, 18 Feb 2010 13:26:47 +0000 Subject: Can I run both Nagios V2 and V3 in parallel while I migrate? In-Reply-To: <788699.27345.qm@web63002.mail.re1.yahoo.com> References: <788699.27345.qm@web63002.mail.re1.yahoo.com> Message-ID: <765d77c81002180526u3e4e4accn5debfbb28690e0f7@mail.gmail.com> On 18 February 2010 02:29, Lylex Ryan wrote: > > In upgrading from nagios (v2) to nagios3, I'd like to do a fresh install of nagios3 and start with a clean sheet of? (config) "paper".? But can I do this while V2 is running production? > > Since the packages have different names, I thought it might work.? But they probably would both have /etc/nagios and other default directories in common.? Maybe if I installed from the tar-ball, I could specify new directories for V3, but I'm also trying to avoid that learning-process and use a pre-packaged rpm. > > Maybe installing V3 on a different server all-together, then moving it to the production machine would be a way. I think the standard advice is no you can't run more than one instance on a single operating-system (of course you probably can if you put enough effort in to it). I would recommend against installing your new Nagios 3 install with non-standard install paths - it could make installing add-ons in the future (for example PNP graphing, NagVis dashboards etc,) difficult if everything is in the wrong place. Personally, when I upgraded from 2 to 3, I put the 3 install on a new server and 'migrated' hosts and services across from old to new gradually over a period of a couple of months. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 14:05:05 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 07:05:05 -0600 Subject: service notification when host is down In-Reply-To: <4fd99abf1002180147j55be849vb14dd49a86816e31@mail.gmail.com> References: <4fd99abf1002170742m2a7d6bb3y55abbfec20940e20@mail.gmail.com> <4B7C1EE5.1010007@hp.com> <4fd99abf1002180147j55be849vb14dd49a86816e31@mail.gmail.com> Message-ID: <33FEF5BF-5162-4366-B84F-DDEF1758A36F@ena.com> On Feb 18, 2010, at 3:47 AM, Samuel Bancal wrote: > Thanks for your answer, > > In fact it is normal behavior to me also. > Thing that is not "normal behavior" to me is that between two checks, Nagios jumps from "SOFT 1" to "HARD 1" without doing the steps "SOFT 1" > "SOFT 2" > "SOFT 3" and finally "HARD 4". If the host is down, why should nagios go through all that? There's no possibility for the service to be up when the host is not. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 13:52:51 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 06:52:51 -0600 Subject: Editing the nagios Side bar In-Reply-To: <7D85034F6BBBD540AF52C59CED3AA1E801C71288@AKVXCH02.tvnzad.tvnz.co.nz> References: <7D85034F6BBBD540AF52C59CED3AA1E801C71288@AKVXCH02.tvnzad.tvnz.co.nz> Message-ID: On Feb 17, 2010, at 5:14 PM, Ron Wilson wrote: > I have 6 groups set up holding servers being patched on each day. I would like an entry in the Nagios sidebar that says patching which would then give a web page view of the six patching groups on one page. This makes it easier for admins to disable notifications for a large number of servers with one click. > Because we have so many groups it would be easier to have the Patching days on one page. > However while I can create a url for one days Patching in the new page I cannot get all six. > This is my php code >
  • /status.cgi?hostgroup=Patch_Day1&style=overview" target="">Patch Day1
  • > > This works fine but how can I get the other 5 Patch Groups in that line. I need something like Patch_Day* but such a command does not work with php. > Anyone got some ideas? It's not a PHP thing... Nagios does not have functionality to limit (or expand, depending on how you look at it), the display of multiple hostgroups that are a subset of all hostgroups. The only exception to this is limitation through authentication, which wouldn't appear to fit your goals. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gtj at addicks.org Thu Feb 18 14:44:30 2010 From: gtj at addicks.org (Glynne Jones) Date: Thu, 18 Feb 2010 13:44:30 +0000 Subject: Processing External Commands In-Reply-To: <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> Message-ID: <20100218134430.GE3916@valley.addicks.org> On Thu, Feb 18, 2010 at 07:19:05AM -0600, Marc Powell wrote: > > On Feb 18, 2010, at 5:17 AM, Glynne Jones wrote: > > > This delay can be several minutes and has sometimes been over 10 minutes. > > > > I am running ndoutils as well, and some of the tables are quite big. Could this affect things? > > Yes, certainly. If the database is busy, either through action of your own or through one of the regular table maintenance tasks then processing of check results may be delayed waiting on the database. Should be pretty easy to see through top if mysql is busy during those times. > Thought that might be the case. mysql is always busy (I've got 3370 checks over 362 hosts). You mention regular table maintenance tasks - is this something that comes out of the box or something separate? Thanks, Glynne ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From reachtarak at hotmail.com Thu Feb 18 15:35:29 2010 From: reachtarak at hotmail.com (Digital Edge) Date: Thu, 18 Feb 2010 20:05:29 +0530 Subject: Help required In-Reply-To: References: , , <765d77c81002171227h526f8203m4b50103925335178@mail.gmail.com>, Message-ID: From: reachtarak at hotmail.com To: jim at jimavery.me.uk Date: Thu, 18 Feb 2010 12:36:17 +0530 CC: nagios-users at lists.sourceforge.net Subject: Re: [Nagios-users] Help required > Date: Wed, 17 Feb 2010 20:27:34 +0000 > Subject: Re: [Nagios-users] Help required > From: jim at jimavery.me.uk > To: reachtarak at hotmail.com > CC: nagios-users at lists.sourceforge.net > > On 17 February 2010 17:36, Digital Edge wrote: > > Hi List, > > > > It will be really helpful if i can get any response on my below mentioned > > query..... > > > > I have an URL , say.. > > > > http://www.example.com/sigin.jsf , After login in, it'll redirect to > > "https://www.example1.com/ddo/get_sec_pwd.php" ; here another authentication > > will happen, then it'll come to an URL > > "https://www.example1.com/home/home.jsf. Inside that page I have several > > other Tabs.. > > -------------------- > > Home|Home1|Home2 > > -------------------- > > all the tabs can be navigate & viewable after successful login of 2nd time . > > And can be accessible within that session only. > > > > Can we monitor those URLS response time without loosing the session , one by > > one in Nagios...... > > Not in Nagios itself, no, but I expect you could use WebInject > http://www.webinject.org/ to do the web querying and timing and feed > the results back to Nagios. > > hth, > > Jim Hi , Yes; even I have tried also. The issue what i'm facing is after successful authentication checking , I'm unable to navigate through those links. All the tests are passing except case4. I am not able to understand why it's happening. can anyone help me on this ..... /\ Ricky Dear List, can anyone help me on this.... sorry for the double post. Hi , Yes; even I have tried also. The issue what i'm facing is after successful authentication checking , I'm unable to navigate through those links. All the tests are passing except case4. I am not able to understand why it's happening. can anyone help me on this ..... /\ Ricky _________________________________________________________________ Your E-mail and More On-the-Go. Get Windows Live Hotmail Free. https://signup.live.com/signup.aspx?id=60969 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From paul.willis at kent.pnn.police.uk Thu Feb 18 15:40:21 2010 From: paul.willis at kent.pnn.police.uk (Paul WILLIS PSE 55499) Date: Thu, 18 Feb 2010 14:40:21 +0000 Subject: CHECK_HTTP odd behaviour Message-ID: <4B7D5155020000C400015556@kent.pnn.police.uk> Marc It wasn't the command definition file. That was the same as the command I was using for running directly, ie # 'check_eRhttp' command definition define command{ command_name check_eRhttp command_line $USER1$/check_http -p 8000 -H some.host.co.uk -u "/sap/bc/webdynpro/sap/hrrcf_a_unreg_job_search?sap-wd-configId=ZUNREG_JOB_SEARCH&sap-ep-themeroot=/sap/public/bc/ur/customerthemes/sap_kp" -A "Mozilla/5.0 (Windows; U; Windows NT 5.2; en-US; rv:1.8.0.9) Gecko/20061206 Firefox/1.5.0.9" -R fs_QE2_00I did away with passing parameters from the service as I originally thought that was the problem. I have since had a quick play and can confirm it is indeed the semi colons. Leave them in and it goes critical / null status without calling the plugin. Escape them and it behaves Paul Willis -- This email and any accompanying document(s) contain information from Kent Police, which is confidential or privileged. The information is intended to be for the exclusive use of the individual(s) or bodies to whom it is addressed. If you are not the intended recipient, be aware that any disclosure, copying, distribution or use of the contents of this information is prohibited. If you have received this email in error, please notify us immediately by contacting the sender or telephoning 01622 690690. The copyright in the contents of this email and any enclosure is the property of Kent Police and any unauthorised reproduction or disclosure is contrary to the provisions of the Copyright Designs and Patents Act 1998. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From malik_khurram at hotmail.com Thu Feb 18 16:08:18 2010 From: malik_khurram at hotmail.com (Khurram Malik) Date: Thu, 18 Feb 2010 15:08:18 +0000 Subject: MACRO PROBLEM In-Reply-To: <4B7CEE63.8090007@hp.com> References: , <4B7CEE63.8090007@hp.com> Message-ID: Hi PAtrick Thanks for your reply, i know according to the matrix CONTACTEMAIL and CONTACTPAGER are disabled for Host/Service Event Handlers but I am talking about CONTACTGROUPMEMBERS. Or if you can help me any other way getting CONTACTEMAIL and CONTACTPAGER ? I hope you understand the problem that i need all this information to send to Netcool along with other Host or Service related information. Regards Khurram Malik > Date: Wed, 17 Feb 2010 23:38:12 -0800 > From: patrick.morris at hp.com > To: malik_khurram at hotmail.com > CC: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] MACRO PROBLEM > > Khurram Malik wrote: > > Hi > > > > I am using Nagios 3.0.6 and in an integration project i want Nagios to > > send alerts to Netcool. I am using Host/Service Global Event Handlers. > > I am able to get the maximum information via the following macros > > > > $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$ $SERVICEDESC$ > > > > But i also want some other info via macros and i am using the > > following link to see if macro is enabled or disabled > > http://nagios.sourceforge.net/docs/3_0/macrolist.html#hostoutput > > > > I want to get CONTACTEMAIL and CONTACTPAGER contents but these macros > > are disabled with Global Host/Service handler, what is the easiest way > > to get info for the conact macros with Global Event Handlers. I can > > see $_CONTACTGROUPMEMBERS$_ is enabled with Global Event Handlers but > > I am unable to get any value, seems like a bug. > > This is not a bug. These macros are not available with event handlers, > since eventhandlers do not have contacts associated with them. If you > look at the matrix on tha page you linked, you'll see that CONTACTEMAIL > and CONTACTPAGER work only with host and service notifications. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From malik_khurram at hotmail.com Thu Feb 18 16:17:59 2010 From: malik_khurram at hotmail.com (Khurram Malik) Date: Thu, 18 Feb 2010 15:17:59 +0000 Subject: MACRO PROBLEM In-Reply-To: <4B7CF0B8.8070805@hp.com> References: <4B7CEE63.8090007@hp.com>,<4B7CF0B8.8070805@hp.com> Message-ID: Thanks Patrick But how can i provide the name of contact group when it depends upon the service or host which is triggering the event. Is there a way that i can provide contact group dynamically to CONTACTGROUPMEMBERS ? e.g $CONTACTEMAIL:CONTACTGROUPMEMBERS:," which can give me comma separated list of emails associated with that perticular service or host? or if there is any ready made script for Nagios and Netcool integration? Regards Khurram Malik > Date: Wed, 17 Feb 2010 23:48:08 -0800 > From: patrick.morris at hp.com > To: malik_khurram at hotmail.com > CC: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] MACRO PROBLEM > > Morris, Patrick wrote: > > Khurram Malik wrote: > > > >> Hi > >> > >> I am using Nagios 3.0.6 and in an integration project i want Nagios to > >> send alerts to Netcool. I am using Host/Service Global Event Handlers. > >> I am able to get the maximum information via the following macros > >> > >> $SERVICESTATE$ $SERVICESTATETYPE$ $SERVICEATTEMPT$ $SERVICEDESC$ > >> > >> But i also want some other info via macros and i am using the > >> following link to see if macro is enabled or disabled > >> http://nagios.sourceforge.net/docs/3_0/macrolist.html#hostoutput > >> > >> I want to get CONTACTEMAIL and CONTACTPAGER contents but these macros > >> are disabled with Global Host/Service handler, what is the easiest way > >> to get info for the conact macros with Global Event Handlers. I can > >> see $_CONTACTGROUPMEMBERS$_ is enabled with Global Event Handlers but > >> I am unable to get any value, seems like a bug. > >> > > > > This is not a bug. These macros are not available with event handlers, > > since eventhandlers do not have contacts associated with them. If you > > look at the matrix on tha page you linked, you'll see that CONTACTEMAIL > > and CONTACTPAGER work only with host and service notifications. > > > > After re-reading your original question, I may have misunderstood, and > you're wondering why > > $CONTACTGROUPMEMBERS$ doesn't work. > > See notes 5 and 7 on the page you linked. These macros work as on-demand-macros in event handlers, since event handler have no contacts associated with them. To obtain a list of conatct group members in that contacts, you would also need to provide the name of the group. > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 17:02:51 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 10:02:51 -0600 Subject: Processing External Commands In-Reply-To: <20100218134430.GE3916@valley.addicks.org> References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> <20100218134430.GE3916@valley.addicks.org> Message-ID: On Feb 18, 2010, at 7:44 AM, Glynne Jones wrote: > Thought that might be the case. mysql is always busy (I've got 3370 checks over 362 hosts). > > You mention regular table maintenance tasks - is this something that comes out of the box or something separate? ndo2db.cfg -- ## TABLE TRIMMING OPTIONS # Several database tables containing Nagios event data can become quite large # over time. Most admins will want to trim these tables and keep only a # certain amount of data in them. The options below are used to specify the # age (in MINUTES) that data should be allowd to remain in various tables # before it is deleted. Using a value of zero (0) for any value means that # that particular table should NOT be automatically trimmed. # Keep timed events for 24 hours max_timedevents_age=1440 # Keep system commands for 1 week max_systemcommands_age=10080 # Keep service checks for 1 week max_servicechecks_age=10080 # Keep host checks for 1 week max_hostchecks_age=10080 # Keep event handlers for 31 days max_eventhandlers_age=44640 I've set all of these to 1 hour for my install based on my needs. If you have database backup scripts, those could be causing delays as well. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gtj at addicks.org Thu Feb 18 17:29:13 2010 From: gtj at addicks.org (Glynne Jones) Date: Thu, 18 Feb 2010 16:29:13 +0000 Subject: Processing External Commands In-Reply-To: References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> <20100218134430.GE3916@valley.addicks.org> Message-ID: <20100218162913.GF3916@valley.addicks.org> On Thu, Feb 18, 2010 at 10:02:51AM -0600, Marc Powell wrote: > > On Feb 18, 2010, at 7:44 AM, Glynne Jones wrote: > > > Thought that might be the case. mysql is always busy (I've got 3370 checks over 362 hosts). > > > > You mention regular table maintenance tasks - is this something that comes out of the box or something separate? > > ndo2db.cfg -- > > ## TABLE TRIMMING OPTIONS [snip] > I've set all of these to 1 hour for my install based on my needs. If you have database backup scripts, those could be causing delays as well. > Doh!, cheers Marc. Forgot those were there. Don't think those are getting the way. Think it's more likely to be some of the other tables getting large. Have you changed the table engine from MyISAM to InnoDB? I'll have a play with the debug settings to see if I can find where the delay is coming in. Thanks, Glynne ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From pepper at reppep.com Thu Feb 18 17:28:48 2010 From: pepper at reppep.com (Chris Pepper) Date: Thu, 18 Feb 2010 11:28:48 -0500 Subject: Delayed Notification for Primary & Secondary Nagios Servers Message-ID: <4B7D6AC0.3020907@reppep.com> We have 2 Nagios servers, with the same /usr/local/nagios/etc (kept current with rsync). I'd like to have the second server skip the initial notifications, since under normal circumstances we'll receive the primary server's notification email and react to that. Does Nagios provide a way to do this? I don't see anything. Currently, I'm considering going with email notifications from primary every 30min, and using a $USER$ macro to effectively set notifications_enabled=0 on secondary. Then I should be abel to escalate to pages with first_notification=3 from secondary, and first_notification=5 from primary. This way, if primary goes down, we should get pages at 90min; if secondary goes down, we should get email about it at 30min and pages at 150min. I can also explicitly enable notifications regarding primary on the secondary server. Is there a better way to do this? Thanks, Chris Pepper -- Chris Pepper: ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 19:17:31 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 12:17:31 -0600 Subject: Processing External Commands In-Reply-To: <20100218162913.GF3916@valley.addicks.org> References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> <20100218134430.GE3916@valley.addicks.org> <20100218162913.GF3916@valley.addicks.org> Message-ID: <46506D5C-8A7C-4F74-A5A4-FD97C959F233@ena.com> On Feb 18, 2010, at 10:29 AM, Glynne Jones wrote: > Doh!, cheers Marc. Forgot those were there. Don't think those are getting the way. Think it's more likely to be some of the other tables getting large. Have you changed the table engine from MyISAM to InnoDB? Still all myISAM. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at medin.name Thu Feb 18 19:25:33 2010 From: michael at medin.name (Michael Medin) Date: Thu, 18 Feb 2010 19:25:33 +0100 Subject: NRPE/NSCA replacement thoughts? Message-ID: <4B7D861D.1090906@medin.name> Hello Since I am pondering a replacement for the NSCA and NRPE protocol I thought I would get some thoughts from the community? So this is pretty much an "open floor" kind of thing to get some sense of what people actually need and would want (if anything at all). But to get some general idea I'll give you a few questions to start it off: Is a new protocol a good idea? Should a new protocol be "flat text based" or structured? Would webservices be the best way? Should the protocol be extensible? What features would a new protocol need to support? - message, performance data, configuration, multiple queries, control logic transfer, inventory, etc. What plattforms would it need to support? Whats polling scheme(s): active, passive, active/passive, proxy, etc? Master/slave scenarios? In both NRPE and NSCA "nagios" is the master should the client be allowed to act as master? What kind of security mechanisms do you need (host, password, encryption, certificates, etc)? Client side "checks" or client side data gathering with server side checks? (ie. check_nrpe get "ok" back, another option would be to get the "value" and let the server decide if it is good or bad.) Multiple streams? ie send to both Nagios and potentially other collectors (like rrd) <> // Michael Medin ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 18 20:28:58 2010 From: marc at ena.com (Marc Powell) Date: Thu, 18 Feb 2010 13:28:58 -0600 Subject: unsubscribe In-Reply-To: <8324365F3DDCFE4EA4CBB01F275CDD6504B576FD@DENMSGV1.QUANTUM.COM> References: <8324365F3DDCFE4EA4CBB01F275CDD6504B576FD@DENMSGV1.QUANTUM.COM> Message-ID: <10069A68-62AF-47F9-B8D8-27CD79E3CBA9@ena.com> you mean to send this to nagios-users-request at lists.sourceforge.net >From this e-mail's headers -- List-Id: Nagios Users List List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , -- Marc On Feb 18, 2010, at 12:42 PM, Rick Garland wrote: > unsubscribe > > The information contained in this transmission may be confidential. Any disclosure, copying, or further distribution of confidential information is not permitted unless such privilege is explicitly granted in writing by Quantum. Quantum reserves the right to have electronic communications, including email and attachments, sent across its networks filtered through anti virus and spam software programs and retain such messages in order to comply with applicable data security and retention requirements. Quantum is not responsible for the proper and complete transmission of the substance of this communication or for any delay in its receipt. > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev_______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From tktucker at gmail.com Thu Feb 18 22:31:45 2010 From: tktucker at gmail.com (tktucker at gmail.com) Date: Thu, 18 Feb 2010 21:31:45 +0000 Subject: Set Host Status from Distributed Monitoring Server Message-ID: <0015175748da9147e2047fe6b13d@google.com> Hello! Pardon me if this is already covered somewhere in the documentation. I can't seem to find what I'm looking for. I have a working Nagios 3.2.0 environment in a distributed configuration. One of my environments lives behind a firewall that is blocking icmp traffic from the central server. In that same environment I have a remote Nagios node that is successfully sending service checks backs to the central server. All of the nodes behind this firewall are reporting "DOWN" on the central server nagios page, but their service checks are reporting "OK". Is their a configuration setting available so that the central server will report these nodes as UP if it receives a successful check from the remote monitoring node? The "translate_passive_host_checks" options sounds like it should work, but it doesn't. I understand I can remove the check_command from the host.cfg, but the host status will be a Pending status. Any suggestions? Thank you in advance for your time and assistance. CS Configs ---------------- log_file=/usr/local/nagios/var/nagios.log cfg_file=/usr/local/nagios/etc/objects/commands.cfg cfg_file=/usr/local/nagios/etc/objects/contacts.cfg cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg cfg_file=/usr/local/nagios/etc/objects/templates.cfg cfg_file=/usr/local/nagios/etc/objects/hostgroups.cfg cfg_dir=/usr/local/nagios/etc/hosts object_cache_file=/usr/local/nagios/var/objects.cache precached_object_file=/usr/local/nagios/var/objects.precache resource_file=/usr/local/nagios/etc/resource.cfg status_file=/usr/local/nagios/var/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/usr/local/nagios/var/nagios.lock temp_file=/usr/local/nagios/var/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/usr/local/nagios/var/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/usr/local/nagios/var/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 check_for_updates=1 bare_update_check=0 use_aggressive_host_checking=0 execute_service_checks=0 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=0 enable_event_handlers=1 process_performance_data=0 obsess_over_services=0 obsess_over_hosts=0 translate_passive_host_checks=1 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=1 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us p1_file=/usr/local/nagios/bin/p1.pl enable_embedded_perl=1 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$&|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=nagios at localhost admin_pager=pagenagios at localhost daemon_dumps_core=0 use_large_installation_tweaks=0 enable_environment_macros=1 debug_level=16 debug_verbosity=2 debug_file=/usr/local/nagios/var/nagios.debug max_debug_file_size=1000000 Remote Monitoring Node ------------------------------------ log_file=/usr/local/nagios/var/nagios.log cfg_file=/usr/local/nagios/etc/objects/commands.cfg cfg_file=/usr/local/nagios/etc/objects/contacts.cfg cfg_file=/usr/local/nagios/etc/objects/timeperiods.cfg cfg_file=/usr/local/nagios/etc/objects/templates.cfg cfg_file=/usr/local/nagios/etc/objects/hostgroups.cfg cfg_dir=/usr/local/nagios/etc/hosts object_cache_file=/usr/local/nagios/var/objects.cache precached_object_file=/usr/local/nagios/var/objects.precache resource_file=/usr/local/nagios/etc/resource.cfg status_file=/usr/local/nagios/var/status.dat status_update_interval=10 nagios_user=nagios nagios_group=nagios check_external_commands=1 command_check_interval=-1 command_file=/usr/local/nagios/var/rw/nagios.cmd external_command_buffer_slots=4096 lock_file=/usr/local/nagios/var/nagios.lock temp_file=/usr/local/nagios/var/nagios.tmp temp_path=/tmp event_broker_options=-1 log_rotation_method=d log_archive_path=/usr/local/nagios/var/archives use_syslog=1 log_notifications=1 log_service_retries=1 log_host_retries=1 log_event_handlers=1 log_initial_states=0 log_external_commands=1 log_passive_checks=1 service_inter_check_delay_method=s max_service_check_spread=30 service_interleave_factor=s host_inter_check_delay_method=s max_host_check_spread=30 max_concurrent_checks=0 check_result_reaper_frequency=10 max_check_result_reaper_time=30 check_result_path=/usr/local/nagios/var/spool/checkresults max_check_result_file_age=3600 cached_host_check_horizon=15 cached_service_check_horizon=15 enable_predictive_host_dependency_checks=1 enable_predictive_service_dependency_checks=1 soft_state_dependencies=0 auto_reschedule_checks=0 auto_rescheduling_interval=30 auto_rescheduling_window=180 sleep_time=0.25 service_check_timeout=60 host_check_timeout=30 event_handler_timeout=30 notification_timeout=30 ocsp_timeout=5 perfdata_timeout=5 retain_state_information=1 state_retention_file=/usr/local/nagios/var/retention.dat retention_update_interval=60 use_retained_program_state=1 use_retained_scheduling_info=1 retained_host_attribute_mask=0 retained_service_attribute_mask=0 retained_process_host_attribute_mask=0 retained_process_service_attribute_mask=0 retained_contact_host_attribute_mask=0 retained_contact_service_attribute_mask=0 interval_length=60 check_for_updates=1 bare_update_check=0 use_aggressive_host_checking=0 execute_service_checks=1 accept_passive_service_checks=1 execute_host_checks=1 accept_passive_host_checks=1 enable_notifications=0 enable_event_handlers=1 process_performance_data=0 obsess_over_services=1 ocsp_command=submit_check_result obsess_over_hosts=0 ochp_command=obsessive_host_handler translate_passive_host_checks=0 passive_host_checks_are_soft=0 check_for_orphaned_services=1 check_for_orphaned_hosts=1 check_service_freshness=1 service_freshness_check_interval=60 check_host_freshness=0 host_freshness_check_interval=60 additional_freshness_latency=15 enable_flap_detection=1 low_service_flap_threshold=5.0 high_service_flap_threshold=20.0 low_host_flap_threshold=5.0 high_host_flap_threshold=20.0 date_format=us p1_file=/usr/local/nagios/bin/p1.pl enable_embedded_perl=1 use_embedded_perl_implicitly=1 illegal_object_name_chars=`~!$%^&*|'"<>?,()= illegal_macro_output_chars=`~$&|'"<> use_regexp_matching=0 use_true_regexp_matching=0 admin_email=nagios at localhost admin_pager=pagenagios at localhost daemon_dumps_core=0 use_large_installation_tweaks=0 enable_environment_macros=1 debug_level=0 debug_verbosity=1 debug_file=/usr/local/nagios/var/nagios.debug max_debug_file_size=1000000 -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan at u13.net Fri Feb 19 03:02:19 2010 From: ryan at u13.net (Ryan Rawdon) Date: Thu, 18 Feb 2010 21:02:19 -0500 Subject: E-mailing separate group for subset of hosts(and their services) Message-ID: <101e2773e0b7e632152214b277de0fa4@192.168.152.50> Hi, I've had a smallish deployment of Nagios for a while now, but now I need to add some more functionality to it. I need to have Nagios notify certain people when there is an issue with a host or any service on it. I see that adding their contactgroup to the host definition only notifies them when the host itself is down or up, however adding their contactgroup to the service definition would notify them whenever said service has an issue on any host - not just the ones I want them to be notified about. Where is the happy medium here? Do I need to create a duplicate copy of all services on these hosts just so that I can list them as the contacts? Thanks, Ryan ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Feb 19 05:20:41 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 18 Feb 2010 20:20:41 -0800 Subject: Set Host Status from Distributed Monitoring Server In-Reply-To: <0015175748da9147e2047fe6b13d@google.com> References: <0015175748da9147e2047fe6b13d@google.com> Message-ID: <4B7E1199.1020806@hp.com> tktucker at gmail.com wrote: > Hello! Pardon me if this is already covered somewhere in the > documentation. I can't seem to find what I'm looking for. > > I have a working Nagios 3.2.0 environment in a distributed configuration. > One of my environments lives behind a firewall that is blocking icmp > traffic from the central server. In that > same environment I have a remote Nagios node that is successfully > sending service checks backs to the central server. > All of the nodes behind this firewall are reporting "DOWN" on the > central server nagios page, but their service checks are reporting > "OK". Is their a configuration > setting available so that the central server will report these nodes > as UP if it receives a successful check from the remote monitoring node? > The "translate_passive_host_checks" options sounds like it should > work, but it doesn't. I understand I can remove the check_command from > the host.cfg, but > the host status will be a Pending status. > > Any suggestions? Thank you in advance for your time and assistance. translate_passive_host_checks only works if you send them. Are you? I suspect whatever method you're using to send service check results upstream is only being used for service checks, and you may need to modify it to also send service check results. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Fri Feb 19 05:22:40 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Thu, 18 Feb 2010 20:22:40 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7D861D.1090906@medin.name> References: <4B7D861D.1090906@medin.name> Message-ID: <4B7E1210.7000400@hp.com> Michael Medin wrote: > Hello > > Since I am pondering a replacement for the NSCA and NRPE protocol I > thought I would get some thoughts from the community? > So this is pretty much an "open floor" kind of thing to get some sense > of what people actually need and would want (if anything at all). > But to get some general idea I'll give you a few questions to start it off: > > Is a new protocol a good idea? > > Should a new protocol be "flat text based" or structured? > > Would webservices be the best way? > > Should the protocol be extensible? > > What features would a new protocol need to support? > - message, performance data, configuration, multiple queries, control > logic transfer, inventory, etc. > > What plattforms would it need to support? > > Whats polling scheme(s): active, passive, active/passive, proxy, etc? > > Master/slave scenarios? > In both NRPE and NSCA "nagios" is the master should the client be > allowed to act as master? > > What kind of security mechanisms do you need (host, password, > encryption, certificates, etc)? > > Client side "checks" or client side data gathering with server side checks? > (ie. check_nrpe get "ok" back, another option would be to get the > "value" and let the server decide if it is good or bad.) > > Multiple streams? > ie send to both Nagios and potentially other collectors (like rrd) > > For what it's worth, I'm pretty happy with NSCA and NRPE as-is, though I'd be interested to hear your motivation for replacing them (especially the resons for replacing them outright instead of extending the existing apps). ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at medin.name Fri Feb 19 07:04:41 2010 From: michael at medin.name (Michael Medin) Date: Fri, 19 Feb 2010 07:04:41 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7E1210.7000400@hp.com> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> Message-ID: <4B7E29F9.5000307@medin.name> On 2010-02-19 05:22, Morris, Patrick wrote: > Michael Medin wrote: >> Hello >> >> >> Multiple streams? >> ie send to both Nagios and potentially other collectors (like rrd) >> > > For what it's worth, I'm pretty happy with NSCA and NRPE as-is, though > I'd be interested to hear your motivation for replacing them > (especially the resons for replacing them outright instead of > extending the existing apps). > Well, the main reason is that they have a number of limitations which I need to resolve and after speaking with Ethan about it I got the impression he would not be updating NRPE/NSCA any more (for instance Ton Voon has some patches to handle payload size which has not been applied). He would (or so I gathered) rather have a new replacement client(s). Also I tend to write programs in C++ and not C which sort of means it is simpler for me to re-write them. // Michael Medin ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 07:25:12 2010 From: subscription at kkeane.com (Kevin Keane) Date: Thu, 18 Feb 2010 22:25:12 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7E1210.7000400@hp.com> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1566@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Morris, Patrick [mailto:patrick.morris at hp.com] > Sent: Thursday, February 18, 2010 8:23 PM > To: Michael Medin > Cc: nagios-users > Subject: Re: [Nagios-users] NRPE/NSCA replacement thoughts? > > Michael Medin wrote: > > Hello > > > > Since I am pondering a replacement for the NSCA and NRPE protocol I > > thought I would get some thoughts from the community? > > So this is pretty much an "open floor" kind of thing to get some sense > > of what people actually need and would want (if anything at all). I have actually written a minimal "replacement" to resolve a shortcoming in my own situation. Actually, it is the standard NSCA protocol wrapped in SSL or (optionally) SSH > > But to get some general idea I'll give you a few questions to start > it off: > > > > Is a new protocol a good idea? Maybe the answer to that question should come at the end instead of the beginning of the process? Generally, I believe that extending an existing protocol is usually a better idea than wholesale replacement, but sometimes one does have to clear-cut some junk. > > Should a new protocol be "flat text based" or structured? What is the design goal? I would advocate structured because of the flexibility, but if it means more bandwidth or using more processing power to parse the protocol, it may not be a great idea? > > Would webservices be the best way? Separate the structure of the protocol from the underlying transport mechanism. In my mind, a lack of that separation is actually the greatest weakness of the current protocol. Web services are an excellent choice of transport for many situations, and quite likely will be the the dominant one for the foreseeable future. Another potential transport is SSH, yet another could be RFC 1149/RFC 2549 avian carriers or whatever else somebody comes up with. Some advantages of Web services: Advantages: - built-in firewall compatibility - built-in encryption and authentication (via SSL) Drawbacks: - needs to be integrated with Web servers, potentially adding to complexity and performance issues. > > Should the protocol be extensible? Yes, within reason. One of the beauties of Nagios is in its simplicity, so if you add too much extensibility you might actually lose more than you gain. But on the other hand, some extensibility is important - otherwise, people will come up with their own extensions that don't really fit with the model. For instance, today's performance data is such an enhancement. > > What plattforms would it need to support? ASCII and Unicode. Other than that, is must be platform neutral, or you lose too much. > > Whats polling scheme(s): active, passive, active/passive, proxy, etc? Both have its place. Most people seem to love active polling, but firewalls may sometimes require passive polling. > > Master/slave scenarios? > > In both NRPE and NSCA "nagios" is the master should the client be > > allowed to act as master? Define "master" and "slave" in this context! If you are talking about the current model of multiple Nagios servers, it seems to me that this is more of a redesign of Nagios, rather than a protocol issue. One thing I would definitely like to see in this context is automatically adding services to Nagios when passive check results arrive. Keeping the list of services in sync between master and slave servers is one of the things that makes such a setup complicated. > > What kind of security mechanisms do you need (host, password, > > encryption, certificates, etc)? That should be left to the underlying transport. Why reinvent the wheel and have to keep chasing security holes if there are already plenty of good solutions available? > > Client side "checks" or client side data gathering with server side checks? > > (ie. check_nrpe get "ok" back, another option would be to get the > > "value" and let the server decide if it is good or bad.) Definitely client-side checks. Otherwise, you'd be looking at effectively re-inventing RPC. What if the "value" being checked is some huge binary blob, or the result of multiple interdependent system calls? > > Multiple streams? > > ie send to both Nagios and potentially other collectors (like rrd) No. Keep it simple, not a protocol to solve all the problems in the world. Nagios itself can forward to other collectors. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 07:35:41 2010 From: subscription at kkeane.com (Kevin Keane) Date: Thu, 18 Feb 2010 22:35:41 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7E29F9.5000307@medin.name> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> <4B7E29F9.5000307@medin.name> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1567@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Michael Medin [mailto:michael at medin.name] > Sent: Thursday, February 18, 2010 10:05 PM > To: Morris, Patrick; nagios-users > Subject: Re: [Nagios-users] NRPE/NSCA replacement thoughts? > > On 2010-02-19 05:22, Morris, Patrick wrote: > > > > For what it's worth, I'm pretty happy with NSCA and NRPE as-is, > though > > I'd be interested to hear your motivation for replacing them > > (especially the resons for replacing them outright instead of > > extending the existing apps). > > Don't get me wrong - I like the idea of improvements to NRPE/NSCA, but I see a few issues with the motivation. > Well, the main reason is that they have a number of limitations which I > need to resolve and after speaking with Ethan about it I got the > impression he would not be updating NRPE/NSCA any more (for instance > Ton Voon has some patches to handle payload size which has not been > applied). He would (or so I gathered) rather have a new replacement > client(s). Client? Or protocol? > Also I tend to write programs in C++ and not C which sort of means it > is simpler for me to re-write them. That really isn't a good reason to throw out the investment thousands of people have made in a working NRPE/NSCA infrastructure! When the next developer comes into the project and likes Java, are we going to get yet another protocol? What if somebody wants to write a client for a new platform - does it have to be in C++? Now don't get me wrong: I actually agree that there are good reasons to update or even replace the protocol. But I'm quite concerned about the motivation, and the end result that would come from it. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at medin.name Fri Feb 19 08:18:57 2010 From: michael at medin.name (Michael Medin) Date: Fri, 19 Feb 2010 08:18:57 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1567@akechi-denki.ad.nctechcenter.com> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> <4B7E29F9.5000307@medin.name> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1567@akechi-denki.ad.nctechcenter.com> Message-ID: <4B7E3B61.3060007@medin.name> On 2010-02-19 07:35, Kevin Keane wrote: > >> Well, the main reason is that they have a number of limitations which I >> need to resolve and after speaking with Ethan about it I got the >> impression he would not be updating NRPE/NSCA any more (for instance >> Ton Voon has some patches to handle payload size which has not been >> applied). He would (or so I gathered) rather have a new replacement >> client(s). >> > Client? Or protocol? > protocol (NRPE and NSCA has fixed limits on data length, Ton extended the protocol with an additional "packet type" that was "more data". > >> Also I tend to write programs in C++ and not C which sort of means it >> is simpler for me to re-write them. >> > That really isn't a good reason to throw out the investment thousands of people have made in a working NRPE/NSCA infrastructure! When the next developer comes into the project and likes Java, are we going to get yet another protocol? What if somebody wants to write a client for a new platform - does it have to be in C++? > Uhmm... I am talking about a protocol here, so feel free to implement a client in brainf*ck if you like... > Now don't get me wrong: I actually agree that there are good reasons to update or even replace the protocol. But I'm quite concerned about the motivation, and the end result that would come from it. > Well, in this case my "motivation" is pure and simple self interest... I have no noble ideas about "helping the nagios community". I need a new protocol, I will write one... pure and simple... I just figured I would get some insight into what to think about whilst doing it... // Michael Medin > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > > ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at medin.name Fri Feb 19 08:27:59 2010 From: michael at medin.name (Michael Medin) Date: Fri, 19 Feb 2010 08:27:59 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1566@akechi-denki.ad.nctechcenter.com> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1566@akechi-denki.ad.nctechcenter.com> Message-ID: <4B7E3D7F.9030509@medin.name> On 2010-02-19 07:25, Kevin Keane wrote: >> Is a new protocol a good idea? >> > Maybe the answer to that question should come at the end instead of the beginning of the process? > Well, if everyone thinks this is a doomed sinking ship there is no point to venture forth so for me this is the most important question actually :) > Generally, I believe that extending an existing protocol is usually a better idea than wholesale replacement, but sometimes one does have to clear-cut some junk. > The "protocol" part of NRPEand NSCA has far far to many flaws to merit extending them. >>> Should a new protocol be "flat text based" or structured? >>> > What is the design goal? I would advocate structured because of the flexibility, but if it means more bandwidth or using more processing power to parse the protocol, it may not be a great idea? > Thats exactly the question: whats more interesting, speed, simplicity or flexibility? Nagios has survived on its simplicity but lately has tried to "grow" into something more advanced. > >>> Should the protocol be extensible? >>> > Yes, within reason. One of the beauties of Nagios is in its simplicity, so if you add too much extensibility you might actually lose more than you gain. > > But on the other hand, some extensibility is important - otherwise, people will come up with their own extensions that don't really fit with the model. For instance, today's performance data is such an enhancement. > > >>> Master/slave scenarios? >>> In both NRPE and NSCA "nagios" is the master should the client be >>> allowed to act as master? >>> > Define "master" and "slave" in this context! If you are talking about the current model of multiple Nagios servers, it seems to me that this is more of a redesign of Nagios, rather than a protocol issue. > One pretty interesting idea I saw at the Nordic Nagios Meet last spring was a client (I don't recall the name now) that allowed you to define the checks and such on the clienht. This was then uploaded and incorporated into Nagios. This means nagios is no longer the master for configuration data instead the clients have become "masters". >>> What kind of security mechanisms do you need (host, password, >>> encryption, certificates, etc)? >>> > That should be left to the underlying transport. Why reinvent the wheel and have to keep chasing security holes if there are already plenty of good solutions available? > > >>> Client side "checks" or client side data gathering with server side checks? >>> (ie. check_nrpe get "ok" back, another option would be to get the >>> "value" and let the server decide if it is good or bad.) >>> > Definitely client-side checks. Otherwise, you'd be looking at effectively re-inventing RPC. What if the "value" being checked is some huge binary blob, or the result of multiple interdependent system calls? > > >>> Multiple streams? >>> ie send to both Nagios and potentially other collectors (like rrd) >>> > No. Keep it simple, not a protocol to solve all the problems in the world. Nagios itself can forward to other collectors. > > From what I have gathered this is pretty time and CPU consuming so another option would be to split off the data "outside" Nagios. // Michael Medin ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misch at multinet.de Fri Feb 19 09:07:44 2010 From: misch at multinet.de (Michael Schwartzkopff) Date: Fri, 19 Feb 2010 09:07:44 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7D861D.1090906@medin.name> References: <4B7D861D.1090906@medin.name> Message-ID: <201002190907.44650.misch@multinet.de> Am Donnerstag, 18. Februar 2010 19:25:33 schrieb Michael Medin: > Hello > > Since I am pondering a replacement for the NSCA and NRPE protocol I > thought I would get some thoughts from the community? > So this is pretty much an "open floor" kind of thing to get some sense > of what people actually need and would want (if anything at all). > But to get some general idea I'll give you a few questions to start it off: > > Is a new protocol a good idea? No. People thought about network managment protocol for more than 20 years now. I doubt that you can reinvent anything better. > Should a new protocol be "flat text based" or structured? No need for a new protocol. > Would webservices be the best way? > > Should the protocol be extensible? Yes. > What features would a new protocol need to support? > - message, performance data, configuration, multiple queries, control > logic transfer, inventory, etc. > > What plattforms would it need to support? > > Whats polling scheme(s): active, passive, active/passive, proxy, etc? > > Master/slave scenarios? > In both NRPE and NSCA "nagios" is the master should the client be > allowed to act as master? > > What kind of security mechanisms do you need (host, password, > encryption, certificates, etc)? > > Client side "checks" or client side data gathering with server side checks? > (ie. check_nrpe get "ok" back, another option would be to get the > "value" and let the server decide if it is good or bad.) > > Multiple streams? > ie send to both Nagios and potentially other collectors (like rrd) > > <> > > // Michael Medin Go for SNMP. This is the ONLY standard for vendor independent managment of devices. Nearly all devices support it. And I really doubt that you will be able to implement your agent on a Cisco IOS or a Check Point firewall appliance. Greetings, -- Dr. Michael Schwartzkopff MultiNET Services GmbH Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany Tel: +49 - 89 - 45 69 11 0 Fax: +49 - 89 - 45 69 11 21 mob: +49 - 174 - 343 28 75 mail: misch at multinet.de web: www.multinet.de Sitz der Gesellschaft: 85630 Grasbrunn Registergericht: Amtsgericht M?nchen HRB 114375 Gesch?ftsf?hrer: G?nter Jurgeneit, Hubert Martens --- PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B Skype: misch42 ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael at medin.name Fri Feb 19 10:15:01 2010 From: michael at medin.name (Michael Medin) Date: Fri, 19 Feb 2010 10:15:01 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <201002190907.44650.misch@multinet.de> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> Message-ID: <4B7E5695.5080609@medin.name> On 2010-02-19 09:07, Michael Schwartzkopff wrote: > > Go for SNMP. This is the ONLY standard for vendor independent managment of > devices. Nearly all devices support it. And I really doubt that you will be > able to implement your agent on a Cisco IOS or a Check Point firewall > appliance. > Interesting point. Yet strangely enough most people who uses nagios (or for that matter other monitoring tools) uses various "other protocol" like wbem, ssh, nrpe, nsca, etc etc. So is that because they have "yet to see the light" or because the (SNMP) clients are lacking something? Or is it because of limitations in the SNMP protocol? (I was under the impression that the efforts of WBEM was to fix various issues with SNMP?). // Michael Medin ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Fri Feb 19 11:19:30 2010 From: flyinvap at orange.fr (Flyinvap) Date: Fri, 19 Feb 2010 11:19:30 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <201002190907.44650.misch@multinet.de> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> Message-ID: <20100219111930.2592c970@orange.fr> Hi, Le Fri, 19 Feb 2010 09:07:44 +0100, Michael Schwartzkopff a ?crit : > > Should a new protocol be "flat text based" or structured? > > No need for a new protocol. It's possible to replace NRPE by SNMPv1/2 or v3 or SSH. NCSA could may be replace by SNMP informs ? > > Would webservices be the best way? It could be but what about performance ? NRPE is very fast but not secure [1], SNMP v1 is slower and insecure. SNMP v3 or SSH are secure but take some resources. I made some test with those 4 protocols. To check if a process is running takes by average : - 27 ms with nrpe (with ssl) - 62 ms with snmp v1 - 107 ms with snmp v3 (SHA for authentication and AES for privacy) - 113 ms with SSHv2 (authentication by certifcate) [1] http://tracker.nagios.org/bug_view_advanced_page.php?bug_id=125 -- Flyinvap ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From sam.bancal at gmail.com Fri Feb 19 11:27:10 2010 From: sam.bancal at gmail.com (Samuel Bancal) Date: Fri, 19 Feb 2010 11:27:10 +0100 Subject: service notification when host is down In-Reply-To: <33FEF5BF-5162-4366-B84F-DDEF1758A36F@ena.com> References: <4fd99abf1002170742m2a7d6bb3y55abbfec20940e20@mail.gmail.com> <4B7C1EE5.1010007@hp.com> <4fd99abf1002180147j55be849vb14dd49a86816e31@mail.gmail.com> <33FEF5BF-5162-4366-B84F-DDEF1758A36F@ena.com> Message-ID: <4fd99abf1002190227i5f8950f6tb98d3f0b1c3b3ff9@mail.gmail.com> Yes I agree... But, I do export to a third application the service's perfdata. (using service_perfdata_file* instructions of Nagios). That third application is extracting from it whether services are up or down and doing some actions in consequence. This behavior is a problem to me because of two facts : - The third application is aware of a problem before the timing I expect it to be. (normally should be around "max_check_attempts * retry_check_interval ") - The administrator may not receive e-mail notification if the host is up again before the host gets "CRITICAL HARD". And the information has passed to that third application anyway! Hope you understand a bit more the inconsistency. Samuel Bancal 2010/2/18 Marc Powell > > On Feb 18, 2010, at 3:47 AM, Samuel Bancal wrote: > > > Thanks for your answer, > > > > In fact it is normal behavior to me also. > > Thing that is not "normal behavior" to me is that between two checks, > Nagios jumps from "SOFT 1" to "HARD 1" without doing the steps "SOFT 1" > > "SOFT 2" > "SOFT 3" and finally "HARD 4". > > If the host is down, why should nagios go through all that? There's no > possibility for the service to be up when the host is not. > > -- > Marc > > > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Samuel Bancal - CH -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misch at multinet.de Fri Feb 19 11:28:25 2010 From: misch at multinet.de (Michael Schwartzkopff) Date: Fri, 19 Feb 2010 11:28:25 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <20100219111930.2592c970@orange.fr> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> Message-ID: <201002191128.25788.misch@multinet.de> Am Freitag, 19. Februar 2010 11:19:30 schrieb Flyinvap: > Hi, > > Le Fri, 19 Feb 2010 09:07:44 +0100, > > Michael Schwartzkopff a ?crit : > > > Should a new protocol be "flat text based" or structured? > > > > No need for a new protocol. > > It's possible to replace NRPE by SNMPv1/2 or v3 or SSH. NCSA could may > be replace by SNMP informs ? Yes. > > > Would webservices be the best way? > > It could be but what about performance ? NRPE is very fast but not > secure [1], SNMP v1 is slower and insecure. SNMP v3 or SSH are secure > but take some resources. - SNMPv1 is quite secure if you use ACLs. - SNMPv3 should not be any problem for any recent hardware. > I made some test with those 4 protocols. To check if a process is > running takes by average : > - 27 ms with nrpe (with ssl) > - 62 ms with snmp v1 > - 107 ms with snmp v3 (SHA for authentication and AES for privacy) > - 113 ms with SSHv2 (authentication by certifcate) > > [1] http://tracker.nagios.org/bug_view_advanced_page.php?bug_id=125 -- Dr. Michael Schwartzkopff MultiNET Services GmbH Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany Tel: +49 - 89 - 45 69 11 0 Fax: +49 - 89 - 45 69 11 21 mob: +49 - 174 - 343 28 75 mail: misch at multinet.de web: www.multinet.de Sitz der Gesellschaft: 85630 Grasbrunn Registergericht: Amtsgericht M?nchen HRB 114375 Gesch?ftsf?hrer: G?nter Jurgeneit, Hubert Martens --- PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B Skype: misch42 ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Fri Feb 19 11:58:45 2010 From: flyinvap at orange.fr (Flyinvap) Date: Fri, 19 Feb 2010 11:58:45 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <201002191128.25788.misch@multinet.de> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> Message-ID: <20100219115845.16615a6c@orange.fr> Le Fri, 19 Feb 2010 11:28:25 +0100, Michael Schwartzkopff a ?crit : > - SNMPv1 is quite secure if you use ACLs. Quite secure ? With UDP (spoofing) and a community not encrypted ? SNMP : Security Not My Problem ;-) > - SNMPv3 should not be any problem for any recent hardware. Yes, but it uses more resources than NRPE so you can probably check less services. -- Flyinvap ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From misch at multinet.de Fri Feb 19 12:05:00 2010 From: misch at multinet.de (Michael Schwartzkopff) Date: Fri, 19 Feb 2010 12:05:00 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <20100219115845.16615a6c@orange.fr> References: <4B7D861D.1090906@medin.name> <201002191128.25788.misch@multinet.de> <20100219115845.16615a6c@orange.fr> Message-ID: <201002191205.01126.misch@multinet.de> Am Freitag, 19. Februar 2010 11:58:45 schrieb Flyinvap: > Le Fri, 19 Feb 2010 11:28:25 +0100, > > Michael Schwartzkopff a ?crit : > > - SNMPv1 is quite secure if you use ACLs. > > Quite secure ? With UDP (spoofing) and a community not encrypted ? > SNMP : Security Not My Problem ;-) OK. With ip spoofing you cen send packages. But if you do not the routing back you will never receive the answer. So what. > > - SNMPv3 should not be any problem for any recent hardware. > > Yes, but it uses more resources than NRPE so you can probably check less > services. That's why I use SNMPv1 with ACLs. Greetings, -- Dr. Michael Schwartzkopff MultiNET Services GmbH Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany Tel: +49 - 89 - 45 69 11 0 Fax: +49 - 89 - 45 69 11 21 mob: +49 - 174 - 343 28 75 mail: misch at multinet.de web: www.multinet.de Sitz der Gesellschaft: 85630 Grasbrunn Registergericht: Amtsgericht M?nchen HRB 114375 Gesch?ftsf?hrer: G?nter Jurgeneit, Hubert Martens --- PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B Skype: misch42 ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gtj at addicks.org Fri Feb 19 13:40:08 2010 From: gtj at addicks.org (gtj at addicks.org) Date: Fri, 19 Feb 2010 12:40:08 +0000 Subject: Processing External Commands In-Reply-To: <46506D5C-8A7C-4F74-A5A4-FD97C959F233@ena.com> References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> <20100218134430.GE3916@valley.addicks.org> <20100218162913.GF3916@valley.addicks.org> <46506D5C-8A7C-4F74-A5A4-FD97C959F233@ena.com> Message-ID: On Thu, 18 Feb 2010 12:17:31 -0600, Marc Powell wrote: > On Feb 18, 2010, at 10:29 AM, Glynne Jones wrote: > >> Doh!, cheers Marc. Forgot those were there. Don't think those are >> getting the way. Think it's more likely to be some of the other tables >> getting large. Have you changed the table engine from MyISAM to InnoDB? > > Still all myISAM. Ok, thanks. I'll have to look at some db optimisations. My MRTG stats are showing an average of 3000 external commands being processed on the 5 minute average, so its quite busy. Cheers, Glynne ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 19 14:43:44 2010 From: marc at ena.com (Marc Powell) Date: Fri, 19 Feb 2010 07:43:44 -0600 Subject: Processing External Commands In-Reply-To: References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> <20100218134430.GE3916@valley.addicks.org> <20100218162913.GF3916@valley.addicks.org> <46506D5C-8A7C-4F74-A5A4-FD97C959F233@ena.com> Message-ID: On Feb 19, 2010, at 6:40 AM, wrote: > My MRTG stats are showing an average of 3000 external commands being > processed on the 5 minute average, so its quite busy. For perspective I do nearly 10,000/5min with no problem (using ndoutils b8). -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Fri Feb 19 13:35:25 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Fri, 19 Feb 2010 12:35:25 +0000 Subject: E-mailing separate group for subset of hosts(and their services) In-Reply-To: <101e2773e0b7e632152214b277de0fa4@192.168.152.50> References: <101e2773e0b7e632152214b277de0fa4@192.168.152.50> Message-ID: <765d77c81002190435g5596ca24j8fea0fb3bcf86325@mail.gmail.com> On 19 February 2010 02:02, Ryan Rawdon wrote: > > Hi, > > I've had a smallish deployment of Nagios for a while now, but now I need > to add some more functionality to it. ?I need to have Nagios notify certain > people when there is an issue with a host or any service on it. ?I see that > adding their contactgroup to the host definition only notifies them when > the host itself is down or up, however adding their contactgroup to the > service definition would notify them whenever said service has an issue on > any host - not just the ones I want them to be notified about. > > Where is the happy medium here? ?Do I need to create a duplicate copy of > all services on these hosts just so that I can list them as the contacts? It depends what version of Nagios you are using (what version are you using?). In Nagios 3, the service will inherit any contactgroups from the host (unless any are explicitly defined for the service). See: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 11:08:35 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 19 Feb 2010 02:08:35 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7E3D7F.9030509@medin.name> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1566@akechi-denki.ad.nctechcenter.com> <4B7E3D7F.9030509@medin.name> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1569@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Michael Medin [mailto:michael at medin.name] > > On 2010-02-19 07:25, Kevin Keane wrote: > >> Is a new protocol a good idea? > >> > > Maybe the answer to that question should come at the end instead of > the beginning of the process? > > > Well, if everyone thinks this is a doomed sinking ship there is no > point > to venture forth so for me this is the most important question actually > :) OK, I see your point. Still, I think without understanding what the new protocol would improve, it's very hard to say even whether it's a doomed ship. The main concern, the main thing that could doom the ship, is compatibility. Many great ideas die because of it. You know the old joke how it was possible to create the world in seven days? He didn't have to worry about an installed base... And it's why we still have Fax machines. BTW, I haven't followed icinga lately - maybe these guys have some things cooking on that end, too? > > Generally, I believe that extending an existing protocol is usually a > better idea than wholesale replacement, but sometimes one does have to > clear-cut some junk. > > > The "protocol" part of NRPEand NSCA has far far to many flaws to merit > extending them. I'm not sure about that. It seems to work reasonably well for what it is supposed to do. I am only aware of two issues with it: insufficient separation of protocol and transport, and an awkward mechanism for performance data. A third issue is that it's not documented very well, or at least not easy to find. > >>> Should a new protocol be "flat text based" or structured? > >>> > > What is the design goal? I would advocate structured because of the > flexibility, but if it means more bandwidth or using more processing > power to parse the protocol, it may not be a great idea? > > > Thats exactly the question: whats more interesting, speed, simplicity > or flexibility? Interoperability is yet another one? We don't really want Yet Another Protocol unless there is some very clear value in it. Quite honestly, I don't think that speed and simplicity are mutually exclusive with flexibility. > Nagios has survived on its simplicity but lately has tried to "grow" > into something more advanced. Can you elaborate what you are thinking of, and how the NRPE/NSCA protocol is limiting it in that respect? > >>> Master/slave scenarios? > >>> In both NRPE and NSCA "nagios" is the master should the client be > >>> allowed to act as master? > >>> > > Define "master" and "slave" in this context! If you are talking about > the current model of multiple Nagios servers, it seems to me that this > is more of a redesign of Nagios, rather than a protocol issue. > > > One pretty interesting idea I saw at the Nordic Nagios Meet last spring > was a client (I don't recall the name now) that allowed you to define > the checks and such on the clienht. This was then uploaded and > incorporated into Nagios. This means nagios is no longer the master for > configuration data instead the clients have become "masters". My client does some of the same thing. My guess is that quite a few people have similar homegrown solutions already. You can probably find many ideas at the Monitoringexchange.org site. What that means is a couple things: first of all, look what people are doing in terms of homegrown solutions. That's probably where the most interest is. If your protocol solves the same problem that 1000 people had to spend days scripting out before, you are likely to have a good starting point for a feature set. > >>> Multiple streams? > >>> ie send to both Nagios and potentially other collectors (like rrd) > >>> > > No. Keep it simple, not a protocol to solve all the problems in the > world. Nagios itself can forward to other collectors. > > > > > From what I have gathered this is pretty time and CPU consuming so > another option would be to split off the data "outside" Nagios. I think to do this cleanly, you would have to rearchitect Nagios itself. That might not be a bad idea, but probably a much bigger scope than you had in mind. It's not something that could be solved at the protocol level. The issue I'm thinking of is that the "traditional" Nagios checks would bypass this mechanism, since you would "only" replace NRPE/NSCA. Similarly, NRPE/NSCA wouldn't go away for many years to come due to the large installed base. So the best way to solve this would probably be to completely remove the command logic from the Nagios "kernel" and extract it into a separate component. The kernel would just handle the status processing. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 13:42:41 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 19 Feb 2010 04:42:41 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <201002191205.01126.misch@multinet.de> References: <4B7D861D.1090906@medin.name> <201002191128.25788.misch@multinet.de> <20100219115845.16615a6c@orange.fr> <201002191205.01126.misch@multinet.de> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B156B@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Michael Schwartzkopff [mailto:misch at multinet.de] > > Am Freitag, 19. Februar 2010 11:58:45 schrieb Flyinvap: > > Le Fri, 19 Feb 2010 11:28:25 +0100, > > > > Michael Schwartzkopff a ?crit : > > > - SNMPv1 is quite secure if you use ACLs. > > > > Quite secure ? With UDP (spoofing) and a community not encrypted ? > > SNMP : Security Not My Problem ;-) > > OK. With ip spoofing you cen send packages. But if you do not the > routing back you will never receive the answer. So what. You could make that argument about all IP spoofing. Yet it is one of the more popular hacker tools. Mostly because many types of hack don't require the answer. Either the answer is well known or easily guessable (such as the prompts in the SMTP protocol, for instance), or the mischief happens by the spoofed packet. With UDP the problem is compounded because there is no need to even try to establish a connection. Just send your spoofed packet complete with payload and all. Right off the bat, I can think of three ways to use that quite nefariously. 1) Denial of service. Simply send so many bogus SNMP packets to a Nagios server that it gets overloaded and stops handling the real check results. 2) Denial of service. Send malformed SNMP packets that crash Nagios. Are you sure that there is no buffer overflow in Nagios? Anywhere? 3) Perform an attack on a monitored server and send spoofed "I am healthy" SNMP messages to Nagios while the server in reality is down. What messages to send is easy to find out by plain network monitoring, since SNMP is not encrypted. And the more powerful the protocol is, the greater the risk. Which goes totally counter to the idea of "NRPE or NSCA is not enough". > > > - SNMPv3 should not be any problem for any recent hardware. > > > > Yes, but it uses more resources than NRPE so you can probably check > less > > services. > > That's why I use SNMPv1 with ACLs. ... which is a proprietary extension. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 12:17:11 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 19 Feb 2010 03:17:11 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <201002191128.25788.misch@multinet.de> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Michael Schwartzkopff [mailto:misch at multinet.de] > Sent: Friday, February 19, 2010 2:28 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] NRPE/NSCA replacement thoughts? > > Am Freitag, 19. Februar 2010 11:19:30 schrieb Flyinvap: > > Hi, > > > > Le Fri, 19 Feb 2010 09:07:44 +0100, > > > > Michael Schwartzkopff a ?crit : > > > > Should a new protocol be "flat text based" or structured? > > > > > > No need for a new protocol. > > > > It's possible to replace NRPE by SNMPv1/2 or v3 or SSH. NCSA could > may > > be replace by SNMP informs ? > > Yes. Once you dig into the details, you'll find the devil.. . I don't think SNMP is a good choice; in fact, I think it would be a step backwards from the current protocol. I actually chose NSCA over SNMP in my own configuration because SNMP had too many issues. SNMP actually is great for what it was designed for, but that is a very different purpose. Among the issues I can think of: - You can't run SNMP over the public Internet. That's a killer in my mind. - SNMP is insecure. CERT recommends disabling SNMP whenever possible because of the many security issues. And the list of vulnerabilities in SNMP is scary. - No security to speak of in SNMPv1 and v2 (other than the plain text "community string" which is almost always "private"). - SNMP is UDP (although you can make it work over TCP). UDP is bad, very bad, for Nagios' purposes. It makes firewall issues extra tough to deal with. You can't send it through SSH tunnels, you can't wrap it in HTTP requests, you pretty much can't do anything with it except route it. - UDP is more prone to IP spoofing and DOS attacks than TCP. - Who is going to assign/manage the OIDs? - No SNMPv3 on Windows (unless you spend considerable amount installing and configuring net-snmp). > > > > Would webservices be the best way? > > > > It could be but what about performance ? NRPE is very fast but not > > secure [1], SNMP v1 is slower and insecure. SNMP v3 or SSH are secure > > but take some resources. > > - SNMPv1 is quite secure if you use ACLs. ACLs are proprietary Cisco extensions. > - SNMPv3 should not be any problem for any recent hardware. SNMPv3 is not universally supported. The biggest issue is that Microsoft doesn't support it (they are pushing their own protocol, MoM instead). You can replace Microsoft's SNMP with net-snmp or others, but that defeats the "supported out of the box" idea. If I have to install something, I'd rather install a traditional NRPE or NSCA client. > > I made some test with those 4 protocols. To check if a process is > > running takes by average : > > - 27 ms with nrpe (with ssl) > > - 62 ms with snmp v1 > > - 107 ms with snmp v3 (SHA for authentication and AES for privacy) > > - 113 ms with SSHv2 (authentication by certifcate) Seems to me that NRPE beats SNMP to begin with? ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From gtj at addicks.org Fri Feb 19 15:47:04 2010 From: gtj at addicks.org (Glynne Jones) Date: Fri, 19 Feb 2010 14:47:04 +0000 Subject: Processing External Commands In-Reply-To: References: <20100218111717.GB3916@valley.addicks.org> <5DE387F3-EC15-4A89-8B4E-6D1B6170D9C3@ena.com> <20100218134430.GE3916@valley.addicks.org> <20100218162913.GF3916@valley.addicks.org> <46506D5C-8A7C-4F74-A5A4-FD97C959F233@ena.com> Message-ID: <20100219144704.GA32413@valley.addicks.org> On Fri, Feb 19, 2010 at 07:43:44AM -0600, Marc Powell wrote: > On Feb 19, 2010, at 6:40 AM, wrote: > > My MRTG stats are showing an average of 3000 external commands being > > processed on the 5 minute average, so its quite busy. > > For perspective I do nearly 10,000/5min with no problem (using ndoutils b8). Good to know. I'm using ndoutils b9 and frequently seeing delays greater than 1 minute between logging the EXTERNAL COMMAND and seeinf the result going through. Going from the NSCA daemon receving the result to it being logged in the external command file takes a few seconds, which is fine, but the delay in actually getting the result is the pain! Glynne ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Fri Feb 19 16:20:43 2010 From: flyinvap at orange.fr (Flyinvap) Date: Fri, 19 Feb 2010 16:20:43 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> Message-ID: <20100219162043.091da5d5@orange.fr> Le Fri, 19 Feb 2010 03:17:11 -0800, Kevin Keane a ?crit : > Seems to me that NRPE beats SNMP to begin with? I you need an insecure protocol, choose nrpe. There is no key or password in nrep configuration. DH key is defined at during compilation ... -- Flyinvap ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From mark.frost1 at pepsi.com Fri Feb 19 16:24:25 2010 From: mark.frost1 at pepsi.com (Frost, Mark {PBG}) Date: Fri, 19 Feb 2010 10:24:25 -0500 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7E3D7F.9030509@medin.name> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1566@akechi-denki.ad.nctechcenter.com> <4B7E3D7F.9030509@medin.name> Message-ID: I just wanted to throw in a couple of comments on this thread regarding some items I've seen mentioned. This pertains more to our particular situation here and I'm sure isn't everyone else's situation. On the topic of a setup that would push configuration more to the client side, that would be somewhat of an administrative nightmare for us. We run a lot of hosts/checks that are all very different (i.e. we don't do 10K checks of the exact same type of router or host). In many cases there are custom scripts that apply only to a certain set of hosts. Pushing changes to sets of hosts would be, erm, horrible for us. On SNMP... while I know people who use this for Nagios quite effectively, this is also a problem in our situation. While we "own" the boxes we monitor, we don't have admin control over them. The entities that do maintain/control SNMP and are unlikely to allow us to control or extend them much. Particularly with custom scripts that we require for Nagios. So while SNMP may be a solution for some, it wouldn't be for us. I too must say that I'm reasonably happy at least with NRPE (using NRPE on Unix/Linux and NSClient++ on Windows). I did change the NRPE source to allow a larger buffer for messages which seems to comfortably allow larger messages to come through. NSCA as a protocol is OK, but I've had to write my own client to push check results back from our distributed nodes to the central Nagios servers which has been stable, but a pain to maintain. I'm looking forward to possibly using Merlin when I feel it's more stable and dumping my custom stuff entirely. That's not to say that NRPE or NSCA couldn't use a revamp. I'd just have to say it's not been on the top of my "wish it were improved" list. My $0.02. Thanks Mark ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From michael.friedrich at univie.ac.at Fri Feb 19 16:30:21 2010 From: michael.friedrich at univie.ac.at (Michael Friedrich) Date: Fri, 19 Feb 2010 16:30:21 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1569@akechi-denki.ad.nctechcenter.com> References: <4B7D861D.1090906@medin.name> <4B7E1210.7000400@hp.com> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1566@akechi-denki.ad.nctechcenter.com> <4B7E3D7F.9030509@medin.name> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1569@akechi-denki.ad.nctechcenter.com> Message-ID: <4B7EAE8D.900@univie.ac.at> Kevin Keane wrote: > BTW, I haven't followed icinga lately - maybe these guys have some things cooking on that end, too? You should do that anyways ;-) Check upcoming 1.0.1 ... Regarding nsca/nrpe: cooking - no, thinking and putting on todo list - yes. But we are interested in thoughts and suggestions. If you have something in mind, you are welcome to put into the dev tracker at dev.icinga.org - sections for nrpe/nsca already exist below core. Patches are welcome too. Kind regards, Michael > >>> Generally, I believe that extending an existing protocol is usually a >> better idea than wholesale replacement, but sometimes one does have to >> clear-cut some junk. >> The "protocol" part of NRPEand NSCA has far far to many flaws to merit >> extending them. > > I'm not sure about that. It seems to work reasonably well for what it is supposed to do. > > I am only aware of two issues with it: insufficient separation of protocol and transport, and an awkward mechanism for performance data. A third issue is that it's not documented very well, or at least not easy to find. > >>>>> Should a new protocol be "flat text based" or structured? >>>>> >>> What is the design goal? I would advocate structured because of the >> flexibility, but if it means more bandwidth or using more processing >> power to parse the protocol, it may not be a great idea? >> Thats exactly the question: whats more interesting, speed, simplicity >> or flexibility? > > Interoperability is yet another one? We don't really want Yet Another Protocol unless there is some very clear value in it. > > Quite honestly, I don't think that speed and simplicity are mutually exclusive with flexibility. > >> Nagios has survived on its simplicity but lately has tried to "grow" >> into something more advanced. > > Can you elaborate what you are thinking of, and how the NRPE/NSCA protocol is limiting it in that respect? > >>>>> Master/slave scenarios? >>>>> In both NRPE and NSCA "nagios" is the master should the client be >>>>> allowed to act as master? >>>>> >>> Define "master" and "slave" in this context! If you are talking about >> the current model of multiple Nagios servers, it seems to me that this >> is more of a redesign of Nagios, rather than a protocol issue. >> One pretty interesting idea I saw at the Nordic Nagios Meet last spring >> was a client (I don't recall the name now) that allowed you to define >> the checks and such on the clienht. This was then uploaded and >> incorporated into Nagios. This means nagios is no longer the master for >> configuration data instead the clients have become "masters". > > My client does some of the same thing. My guess is that quite a few people have similar homegrown solutions already. You can probably find many ideas at the Monitoringexchange.org site. > > What that means is a couple things: first of all, look what people are doing in terms of homegrown solutions. That's probably where the most interest is. If your protocol solves the same problem that 1000 people had to spend days scripting out before, you are likely to have a good starting point for a feature set. > >>>>> Multiple streams? >>>>> ie send to both Nagios and potentially other collectors (like rrd) >>>>> >>> No. Keep it simple, not a protocol to solve all the problems in the >> world. Nagios itself can forward to other collectors. >>> >> From what I have gathered this is pretty time and CPU consuming so >> another option would be to split off the data "outside" Nagios. > > I think to do this cleanly, you would have to rearchitect Nagios itself. That might not be a bad idea, but probably a much bigger scope than you had in mind. It's not something that could be solved at the protocol level. > > The issue I'm thinking of is that the "traditional" Nagios checks would bypass this mechanism, since you would "only" replace NRPE/NSCA. Similarly, NRPE/NSCA wouldn't go away for many years to come due to the large installed base. > > So the best way to solve this would probably be to completely remove the command logic from the Nagios "kernel" and extract it into a separate component. The kernel would just handle the status processing. > > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- DI (FH) Michael Friedrich michael.friedrich at univie.ac.at Tel: +43 1 4277 14359 Vienna University Computer Center Universitaetsstrasse 7 A-1010 Vienna, Austria ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 19 16:46:37 2010 From: marc at ena.com (Marc Powell) Date: Fri, 19 Feb 2010 09:46:37 -0600 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <201002191205.01126.misch@multinet.de> References: <4B7D861D.1090906@medin.name> <201002191128.25788.misch@multinet.de> <20100219115845.16615a6c@orange.fr> <201002191205.01126.misch@multinet.de> Message-ID: <7E45DA9F-5B82-4529-B73B-BFA2E28FD080@ena.com> On Feb 19, 2010, at 5:05 AM, Michael Schwartzkopff wrote: >> Quite secure ? With UDP (spoofing) and a community not encrypted ? >> SNMP : Security Not My Problem ;-) > > OK. With ip spoofing you cen send packages. But if you do not the routing back > you will never receive the answer. So what. If the write community is known, that 'So what' can be quite a big deal. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 16:47:32 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 19 Feb 2010 07:47:32 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <20100219162043.091da5d5@orange.fr> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> <20100219162043.091da5d5@orange.fr> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B156C@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Flyinvap [mailto:flyinvap at orange.fr] > Sent: Friday, February 19, 2010 7:21 AM > To: nagios-users at lists.sourceforge.net > Subject: Re: [Nagios-users] NRPE/NSCA replacement thoughts? > > Le Fri, 19 Feb 2010 03:17:11 -0800, > Kevin Keane a ?crit : > > > Seems to me that NRPE beats SNMP to begin with? > > I you need an insecure protocol, choose nrpe. > > There is no key or password in nrep configuration. DH key is defined at > during compilation ... Oh, absolutely. And actually, I believe that's a good thing as long as you get to select the underlying transport mechanism. The authentication and encryption should really be provided by whatever carries the data, not by Nagios at all. With security-related issues, the fewer implementations you have, the better. Rather than reimplement encryption and authentication and then waiting for the security holes to get discovered, why not leverage the work on the SSH and Apache projects? With SNMP, there is no way to do that - you basically can't wrap UDP in any way. With NRPE, you can easily implement all kinds of homegrown solutions already: ssh tunnels, HTTPS. In the end, I agree with another poster: NRPE/NSCA could use a revamp (or outright replacement), but it's really not on the top of the list. And I'm concerned that the replacement may end up being worse than the original. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 17:05:24 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 19 Feb 2010 08:05:24 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <7E45DA9F-5B82-4529-B73B-BFA2E28FD080@ena.com> References: <4B7D861D.1090906@medin.name> <201002191128.25788.misch@multinet.de> <20100219115845.16615a6c@orange.fr> <201002191205.01126.misch@multinet.de> <7E45DA9F-5B82-4529-B73B-BFA2E28FD080@ena.com> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B156D@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Marc Powell [mailto:marc at ena.com] > > On Feb 19, 2010, at 5:05 AM, Michael Schwartzkopff wrote: > > >> Quite secure ? With UDP (spoofing) and a community not encrypted ? > >> SNMP : Security Not My Problem ;-) > > > > OK. With ip spoofing you cen send packages. But if you do not the > routing back > > you will never receive the answer. So what. > > If the write community is known, that 'So what' can be quite a big > deal. You don't even need the community. CERT has a very long list of vulnerabilities, many of whom apply before the community string is ever evaluated. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcall at verio.net Fri Feb 19 17:10:14 2010 From: jcall at verio.net (Jonathan Call) Date: Fri, 19 Feb 2010 11:10:14 -0500 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <4B7D861D.1090906@medin.name> References: <4B7D861D.1090906@medin.name> Message-ID: <04F3233F47E2714CB7431AE913E57E7703DAA985@IAD-WPRD-XCHB02.corp.verio.net> Here is my $0.02: I have a distributed Nagios2 system with 24,000+ service checks and 4000+ hosts. I rely heavily on NSCA to get the results from the slaves to the master. My issue seems to be with Nagios since I can't get a Nagios slave to process a mere thousand service checks using the documented method specified for NSCA before is starts overwhelming the server. I've had to resort to using the OCP_daemon method instead. No complaints about what NSCA does just with how poorly it seems to work within Nagios itself. > -----Original Message----- > From: Michael Medin [mailto:michael at medin.name] > Sent: Thursday, February 18, 2010 11:26 AM > To: nagios-users > Subject: [Nagios-users] NRPE/NSCA replacement thoughts? > > Hello > > Since I am pondering a replacement for the NSCA and NRPE protocol I > thought I would get some thoughts from the community? > So this is pretty much an "open floor" kind of thing to get some sense > of what people actually need and would want (if anything at all). This email message is intended for the use of the person to whom it has been sent, and may contain information that is confidential or legally protected. If you are not the intended recipient or have received this message in error, you are not authorized to copy, distribute, or otherwise use this message or its attachments. Please notify the sender immediately by return e-mail and permanently delete this message and any attachments. Verio, Inc. makes no warranty that this email is error or virus free. Thank you. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Fri Feb 19 17:19:21 2010 From: flyinvap at orange.fr (Flyinvap) Date: Fri, 19 Feb 2010 17:19:21 +0100 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <724C3B2F1C5EB44D9108E471EC5E9933874C0B156C@akechi-denki.ad.nctechcenter.com> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> <20100219162043.091da5d5@orange.fr> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156C@akechi-denki.ad.nctechcenter.com> Message-ID: <20100219171921.65d2cba4@orange.fr> Le Fri, 19 Feb 2010 07:47:32 -0800, Kevin Keane a ?crit : > With SNMP, there is no way to do that - you basically can't wrap UDP > in any way. With NRPE, you can easily implement all kinds of > homegrown solutions already: ssh tunnels, HTTPS. You can use SNMP on TCP. You can even use SNMP on SSH or DTLS [1]. net-snmp [2] can do that without implementing SSL tunnel or other kind of solution. With SNMP v3, you can use authentication and encryption. I'm not saying SNMP is a good solution but provides all security needed. Finally, SSH is cloud be the best way to execute remote plugins. [1] http://en.wikipedia.org/wiki/Datagram_Transport_Layer_Security [2] http://www.net-snmp.org/docs/man/snmpd.html -- Flyinvap ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Fri Feb 19 17:52:47 2010 From: perldork at webwizarddesign.com (Max) Date: Fri, 19 Feb 2010 11:52:47 -0500 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <20100219171921.65d2cba4@orange.fr> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> <20100219162043.091da5d5@orange.fr> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156C@akechi-denki.ad.nctechcenter.com> <20100219171921.65d2cba4@orange.fr> Message-ID: On Fri, Feb 19, 2010 at 11:19 AM, Flyinvap wrote: > Le Fri, 19 Feb 2010 07:47:32 -0800, > Kevin Keane a ?crit : > >> With SNMP, there is no way to do that - you basically can't wrap UDP >> in any way. With NRPE, you can easily implement all kinds of >> homegrown solutions already: ssh tunnels, HTTPS. > > You can use SNMP on TCP. You can even use SNMP on SSH or DTLS [1]. > net-snmp [2] can do that without implementing SSL tunnel or other kind > of solution. With SNMP v3, you can use authentication and encryption. > > I'm not saying SNMP is a good solution but provides all security > needed. We have had very good success with SNMP for polling; we are getting about 2200 hosts and 15000+ active checks per poller, about 80% of our checks are SNMP-based, 95% of our checks (that is a number from our performance graphing) return in < 1 sec. Our SNMP zone is heavily ACL'd and not reachable from outside of our networks, and yes, security is a very big design question that has to be answered early and revisited often when using SNMP. We do not use SNMP v3 for performance reasons. I use SNMP v3 for WAN SNMP polling and for small scale projects it works well and I have never had cracks or breakins through the SNMP agent (as compared to the many many cracks that have happened on web hosts with customer sites through PHP miscoding or other web-based vulns). The tradeoffs of SNMP vs other protocols Pros ===================== * It is a standard, so it does not lock anyone into using Nagios-specific protocols, in some orgs this is seen as a big advantage * It is well supported from a library and code perspective; many ways to integrate SNMP code into applications * MIBs give a common language for teams to use to communicate what checks do. In larger organizations, this langua franca can make conversations about monitoring more productive and more productive; MIBs also provide developer documentation for consumers of SNMP-based metrics / information. * It performs well for us in our environment. Yes, this is subjective, our performance graphs support my claim in this environment but I make no claims that this would hold true for every environment :). Wes Hardaker, of Sparta, did a study of SNMP performance over UDP vs TCP. In a well-behaved network SNMP over TCP did better; as the network quality degraded to more than 40% loss for short messages or ~ 30% for longer SNMP sessions, UDP actually worked better as a transport as TCP would give up altogether. There are many studies that have conflicting results in this area. Wes Hardaker - SNMP performance - TCP vs UDP http://www.ietf.org/proceedings/72/slides/opsarea-2.pdf SNMP over SSH http://www.ietf.org/proceedings/67/slides/isms-3.pdf Cons ======================================= * Security - have to carefully define network and host-based ACLs * Security - have to carefully define agent ACLs * Security - have to keep up with advisories * Performance - longer SNMP sessions (like pulling process tables or partition tables ) can time out even on pretty well-behaved networks. Most agents provde SNMP extensions that move the utilization checking to the agent side so that a check just retuns cooked utilization values * Development time - SNMP-based checks do take longer to create as there is an explicit contract with MIBs and OIDs that should be defined to communicate what a check does. We have an extention framework for Net-SNMP that make implementing a new SNMP check as easy as writing a perl module that returns a list or list of lists. Net-SNMP does have a number of methods that make writing SNMP extensions easier, but certainly none are as simple as NRPE. We use SNMP for our 'send a check from anywhere to Nagios' wrapper script. We have implemented the nSvcEvent and nHostEvent trap definitions in this script and we have filter code for the two traps in SNMPTT; this is a much lighter weight way to send an asynchronous event to Nagios than it would be to initiate a TCP session for every fault message with NSCA. I like NSCA and we use it too, so I am not knocking it. Our user audience is a mix of developers and SAs; we find that most SAs prefer NRPE for simplicity and that our developers are comfortable with either. Both communities like our SNMP trap sender wrapper, which functions as both a CLI script and as a perl module. I make no claim SNMP is better than NRPE nor NSCA, all 3 are excellent choices depending on the politics at an organization, security requirements, and network conditions. I do get annoyed at people who just throw out that SNMP is not a good choice without supporting evidence. it can be a fine choice, it is the only choice with many HW appliances. It can be used effectively and securely and it works fine in conjunction with other protocols. - Max ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From subscription at kkeane.com Fri Feb 19 19:11:52 2010 From: subscription at kkeane.com (Kevin Keane) Date: Fri, 19 Feb 2010 10:11:52 -0800 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <20100219171921.65d2cba4@orange.fr> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> <20100219162043.091da5d5@orange.fr> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156C@akechi-denki.ad.nctechcenter.com> <20100219171921.65d2cba4@orange.fr> Message-ID: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1570@akechi-denki.ad.nctechcenter.com> > -----Original Message----- > From: Flyinvap [mailto:flyinvap at orange.fr] > > > With SNMP, there is no way to do that - you basically can't wrap UDP > > in any way. With NRPE, you can easily implement all kinds of > > homegrown solutions already: ssh tunnels, HTTPS. > > You can use SNMP on TCP. You can even use SNMP on SSH or DTLS [1]. I am risking the life of an innocent equine by flogging when I respond... I know, but it's really an unusual configuration (I was going to say "nonstandard" but technically speaking it is in the RFC). > net-snmp [2] can do that without implementing SSL tunnel or other kind > of solution. With SNMP v3, you can use authentication and encryption. It seems to me that SNMP v3 is not even in the running due to performance concerns; pretty much everybody in this discussion seems to have agreed on that point (and for me also, because of a lack of support by vendors; Microsoft, in particular). > I'm not saying SNMP is a good solution but provides all security > needed. > > Finally, SSH is cloud be the best way to execute remote plugins. Personally, I'm not sure if there is a single "best way". That is why I keep promoting separating the protocol from the transport. Basically, Nagios protocol data fits into Layer 6 or 7 of the OSI model, while the encryption and authentication really should be handled by layer 4 or 5. SSH really is a good way, but in many scenarios HTTPS may be even better. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From perldork at webwizarddesign.com Fri Feb 19 19:24:07 2010 From: perldork at webwizarddesign.com (Max) Date: Fri, 19 Feb 2010 13:24:07 -0500 Subject: NRPE/NSCA replacement thoughts? In-Reply-To: <724C3B2F1C5EB44D9108E471EC5E9933874C0B1570@akechi-denki.ad.nctechcenter.com> References: <4B7D861D.1090906@medin.name> <201002190907.44650.misch@multinet.de> <20100219111930.2592c970@orange.fr> <201002191128.25788.misch@multinet.de> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156A@akechi-denki.ad.nctechcenter.com> <20100219162043.091da5d5@orange.fr> <724C3B2F1C5EB44D9108E471EC5E9933874C0B156C@akechi-denki.ad.nctechcenter.com> <20100219171921.65d2cba4@orange.fr> <724C3B2F1C5EB44D9108E471EC5E9933874C0B1570@akechi-denki.ad.nctechcenter.com> Message-ID: On Fri, Feb 19, 2010 at 1:11 PM, Kevin Keane wrote: > Personally, I'm not sure if there is a single "best way". That is why I keep promoting separating the protocol from the transport. Basically, Nagios protocol data fits into Layer 6 or 7 of the OSI model, while the encryption and authentication really should be handled by layer 4 or 5. We have written in addtion to our other agents an HTTP-based agent that uses either RESTful or non RESTful style URLs for requests. Creating an HTTP RESTful style interface spec would be a pretty clean way to define a standard without getting into implementation discussions and would provide a common ground for plugins. As with SNMP or SSH HTTP comes with a performance price but more and more with Nagios I am seeing (because it is so flexible) that defining standards for a user group / org that uses Nagios is much harder than performance tuning is. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ryan at u13.net Fri Feb 19 19:47:33 2010 From: ryan at u13.net (Ryan Rawdon) Date: Fri, 19 Feb 2010 13:47:33 -0500 Subject: E-mailing separate group for subset of hosts(and their services) In-Reply-To: <765d77c81002190435g5596ca24j8fea0fb3bcf86325@mail.gmail.com> References: <101e2773e0b7e632152214b277de0fa4@192.168.152.50> <765d77c81002190435g5596ca24j8fea0fb3bcf86325@mail.gmail.com> Message-ID: <4B7EDCC5.8070404@u13.net> Thanks a lot Jim, that explanation and page cleared it up. There was a contact groups definition in the services template which overrides the inheritance from the parent host (I hadn't read any formal definition of how the Nagios inheritance works). I removed that template definition and it works as expected. Thanks, Ryan On 2/19/10 7:35 AM, Jim Avery wrote: > On 19 February 2010 02:02, Ryan Rawdon wrote: > >> Hi, >> >> I've had a smallish deployment of Nagios for a while now, but now I need >> to add some more functionality to it. I need to have Nagios notify certain >> people when there is an issue with a host or any service on it. I see that >> adding their contactgroup to the host definition only notifies them when >> the host itself is down or up, however adding their contactgroup to the >> service definition would notify them whenever said service has an issue on >> any host - not just the ones I want them to be notified about. >> >> Where is the happy medium here? Do I need to create a duplicate copy of >> all services on these hosts just so that I can list them as the contacts? >> > > > It depends what version of Nagios you are using (what version are you > using?). In Nagios 3, the service will inherit any contactgroups from > the host (unless any are explicitly defined for the service). > > See: http://nagios.sourceforge.net/docs/3_0/objectinheritance.html > ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From lecb79 at gmail.com Fri Feb 19 22:58:05 2010 From: lecb79 at gmail.com (Eduardo Barreto) Date: Fri, 19 Feb 2010 19:58:05 -0200 Subject: Notifying through sms and call Message-ID: Hi All, How can I set nagios to send a notification to the same contact but with different way, let me explain, I've got more than 300 hosts to monitored, and they send sms when it goes down or up, but I don't want to receive the call from everyone, just a few of then may have to call my mobile phone, how could I do that? Should I create another contact just for the call notifications? Please, someone give a clue. Cheers Eduardo Barreto -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Fri Feb 19 23:42:39 2010 From: marc at ena.com (Marc Powell) Date: Fri, 19 Feb 2010 16:42:39 -0600 Subject: Notifying through sms and call In-Reply-To: References: Message-ID: On Feb 19, 2010, at 3:58 PM, Eduardo Barreto wrote: > Should I create another contact just for the call notifications? Please, someone give a clue. That's how I'd do it. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jcasale at activenetwerx.com Fri Feb 19 23:50:25 2010 From: jcasale at activenetwerx.com (Joseph L. Casale) Date: Fri, 19 Feb 2010 22:50:25 +0000 Subject: snmp net utilization Message-ID: I have a crappy windows app that provides no way of remotely monitoring its status. In fact, the only way I can be assured it's in a state I depend on is if the net utilization is up. I use was using check_snmp as it allows the warning/critical to be set such that when the utilization falls low, then I trigger the alarm. The problem with this is it provides no perf output for use with pnp4nagios and several of the windows hosts that I monitor regularly return no data for short periods, very cyclically. I want to use manubulon's snmp script (check_snmp_int.pl) however I can't get the warning/critical to work on low values from high, no as expected, when low values get high. Anyone know a workaround, using snmp only of course? Thanks! jlc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From zijaii6405 at vivax.com.br Mon Feb 22 02:42:46 2010 From: zijaii6405 at vivax.com.br (Pfizer Genuine VIAGRA) Date: Mon, 22 Feb 2010 01:42:46 +0000 (UTC) Subject: Mr. gnnu-nagios-users 80% Off on all products Message-ID: <12128.9214388982$1266802966@news.gmane.org> An HTML attachment was scrubbed... URL: From jcasale at activenetwerx.com Sun Feb 21 04:58:58 2010 From: jcasale at activenetwerx.com (Joseph L. Casale) Date: Sun, 21 Feb 2010 03:58:58 +0000 Subject: snmp net utilization In-Reply-To: <633e02961002201127y3d487223w36eb4fa89301dc49@mail.gmail.com> References: <633e02961002201127y3d487223w36eb4fa89301dc49@mail.gmail.com> Message-ID: >>I have a crappy windows app that provides no way of remotely monitoring its status. >>In fact, the only way I can be assured it's in a state I depend on is if the net >>utilization is up. > >Checking whether the application is running or not would not suffice? Nope, it's a dvr application and it's not very stable (hardware used requires it) so the only way I can tell non interactively if it's not "recording" is the net usage. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jerryapr7 at gmail.com Mon Feb 22 10:06:05 2010 From: jerryapr7 at gmail.com (Jerry Joy) Date: Mon, 22 Feb 2010 14:36:05 +0530 Subject: Local firewall changes on Nagios core server Message-ID: <9f3a8a031002220106g6d85ec63m85445d1ddc781e94@mail.gmail.com> Hi, I have setup a Nagios server on RHEL 5 and it has the local firewall "ON". What exactly are the changes required on the local firewall so that Nagios core server is able to accept reports from NRPE and NSClient++ agents. I don't see a fixed port used by Nagios core. Thanks, Jerry -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Mon Feb 22 12:37:32 2010 From: nagios at flatto.net (Assaf Flatto) Date: Mon, 22 Feb 2010 11:37:32 +0000 Subject: Local firewall changes on Nagios core server In-Reply-To: <9f3a8a031002220106g6d85ec63m85445d1ddc781e94@mail.gmail.com> References: <9f3a8a031002220106g6d85ec63m85445d1ddc781e94@mail.gmail.com> Message-ID: <4B826C7C.3030608@flatto.net> Jerry Joy wrote: > Hi, > > I have setup a Nagios server on RHEL 5 and it has the local firewall > "ON". What exactly are the changes required on the local firewall so > that Nagios core server is able to accept reports from NRPE and > NSClient++ agents. I don't see a fixed port used by Nagios core. > > Thanks, > Jerry Nagios does not need a specific port to send out on , unless you use NSCA , then you will need to open the ports for the NSCA daemon . The NRPE and NSClient++ ports need to be open on the client machines - not on the Core server . If your machine is behind the company firewall - and not accessible from the web , you may want to disable the firewall on the local machine. Assaf ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Mon Feb 22 14:09:41 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Mon, 22 Feb 2010 13:09:41 +0000 Subject: Local firewall changes on Nagios core server In-Reply-To: <9f3a8a031002220106g6d85ec63m85445d1ddc781e94@mail.gmail.com> References: <9f3a8a031002220106g6d85ec63m85445d1ddc781e94@mail.gmail.com> Message-ID: <765d77c81002220509i1fe7edeax830e8c6521207c0b@mail.gmail.com> On 22 February 2010 09:06, Jerry Joy wrote: > Hi, > > I have setup a Nagios server on RHEL 5 and it has the local firewall "ON". > What exactly are the changes required on the local firewall so that Nagios > core server is able to accept reports from NRPE and NSClient++ agents. I > don't see a fixed port used by Nagios core. > > Thanks, > Jerry The Nagios server will pull check results using nrpe so you shouldn't need to open a specific port. Likewise, usually the Nagios server will query the NSClient++ agent either using the same nrpe protocol or using the check_nt protocol in which case you also shouldn't need to open a specific port. However, if you have configure NSClient++ to send check results to Nagios using the NSCA protocol, you will need to configure the nsca daemon on your Nagios server to receive these checks and forward them to Nagios. The port the nsca daemon listens on is configurable - by default it's port 5667. Another problem I have seen in the past with RedHat variants of Linux is that SELinux can be a right pain to configure to get things like nsca working. I used to disable SELinux, but now I tend to use Debian derived versions of Linux rather then RedHat so don't see this problem. hth, Jim ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From flyinvap at orange.fr Mon Feb 22 17:09:09 2010 From: flyinvap at orange.fr (Flyinvap) Date: Mon, 22 Feb 2010 17:09:09 +0100 Subject: unsubscribe In-Reply-To: <004701cab3d0$4c8cb620$e5a62260$@com> References: <004701cab3d0$4c8cb620$e5a62260$@com> Message-ID: <20100222170909.11852271@orange.fr> Le Mon, 22 Feb 2010 10:04:13 -0500, "Rob Bowman" a ?crit : or With a decent mail client [1], you just have to click in the menu ;-) [1] http://www.claws-mail.org -- Flyinvap ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 22 19:36:44 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 22 Feb 2010 13:36:44 -0500 Subject: NRPE question Message-ID: <6db4a4201002221036j43ba1b66nced92ceda34ddc2a@mail.gmail.com> I am running nrpe as daemon rather than xinetd. I have one server which showed high load and all nrpe checks where showing UNKNOWN. Later on I logged into the box and I saw there were 382 nrpe process running. Can someone shed some light into this? Thanks -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 22 21:05:07 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 22 Feb 2010 15:05:07 -0500 Subject: NRPE question In-Reply-To: <6db4a4201002221201w3067eab1v90b9ff516e0ce4d0@mail.gmail.com> References: <6db4a4201002221036j43ba1b66nced92ceda34ddc2a@mail.gmail.com> <20100222193429.GA4621@bakgwai.americas.hpqcorp.net> <6db4a4201002221201w3067eab1v90b9ff516e0ce4d0@mail.gmail.com> Message-ID: <6db4a4201002221205r6b715d02qd2704dc9829c7b1f@mail.gmail.com> I have thought about this solution but I have an issue with this. Let us say the machine has high load. Now, I presume, we are checking the number of "nrpe" processes on client host via nrpe. If the load is high, the check for number of nrpe processes will fail, which in turn would fork possibly another nrpe process? Please advise on this. Thanks > > > On Mon, Feb 22, 2010 at 2:34 PM, wrote: > >> On Mon, 22 Feb 2010, shadih rahman wrote: >> >> > I am running nrpe as daemon rather than xinetd. I have one server which >> showed high load and all nrpe checks where showing UNKNOWN. Later on I >> logged into the box and I saw there were 382 nrpe process running. Can >> someone shed some light into this? Thanks >> >> Happens here all the time. I have a Nagios check that looks for >> high numbers of NRPE processes and restarts it if it goes out of >> control. >> >> I'm not sure what causes it; the brute force method we're using here has >> been enough to keep it under control. >> > > > > -- > Cordially, > Shadhin Rahman > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 22 21:06:22 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 22 Feb 2010 15:06:22 -0500 Subject: NRPE question In-Reply-To: <6db4a4201002221201w3067eab1v90b9ff516e0ce4d0@mail.gmail.com> References: <6db4a4201002221036j43ba1b66nced92ceda34ddc2a@mail.gmail.com> <20100222193429.GA4621@bakgwai.americas.hpqcorp.net> <6db4a4201002221201w3067eab1v90b9ff516e0ce4d0@mail.gmail.com> Message-ID: <6db4a4201002221206q20adc421t8db3be98e110d5aa@mail.gmail.com> I have thought about this solution but I have an issue with this. Let us say the machine has high load. Now, I presume, we are checking the number of "nrpe" processes on client host via nrpe. If the load is high, the check for number of nrpe processes will fail, which in turn would fork possibly another nrpe process? Please advise on this. Thanks On Mon, Feb 22, 2010 at 3:01 PM, shadih rahman wrote: > I have thought about this solution but I have an issue with this. Let us > say the machine has high load. Now, I presume, we are checking the number > of "nrpe" processes on client host via nrpe. If the load is high, the check > for number of nrpe processes will fail, which in turn would fork possibly > another nrpe process? Please advise on this. Thanks > > > On Mon, Feb 22, 2010 at 2:34 PM, wrote: > >> On Mon, 22 Feb 2010, shadih rahman wrote: >> >> > I am running nrpe as daemon rather than xinetd. I have one server which >> showed high load and all nrpe checks where showing UNKNOWN. Later on I >> logged into the box and I saw there were 382 nrpe process running. Can >> someone shed some light into this? Thanks >> >> Happens here all the time. I have a Nagios check that looks for >> high numbers of NRPE processes and restarts it if it goes out of >> control. >> >> I'm not sure what causes it; the brute force method we're using here has >> been enough to keep it under control. >> > > > > -- > Cordially, > Shadhin Rahman > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 22 21:27:30 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 22 Feb 2010 15:27:30 -0500 Subject: Alternate schedule for service Message-ID: <6db4a4201002221227u99767f1gac13b5f0b5ab9f70@mail.gmail.com> I have to check a service every 30 minutes during normal business hour and check the service every two hours after mid-night. I have a extended time period definition. Can someone please give an example how to do this? Thanks define timeperiod{ timeperiod_name extended alias extended sunday 07:00-22:59,23:00-06:59 monday 07:00-22:59,23:00-06:59 tuesday 07:00-22:59,23:00-06:59 wednesday 07:00-22:59,23:00-06:59 thursday 07:00-22:59,23:00-06:59 friday 07:00-22:59,23:00-06:59 saturday 07:00-22:59,23:00-06:59 } define service{ name extended ; Name of this template use generic-service ; Inherit default values check_period extended normal_check_interval 30 retry_check_interval 5 max_check_attempts 2 notification_period extended notification_interval 30 notification_options c,r,f register 0 ; DONT REGISTER THIS - ITS A TEMPLATE } -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Mon Feb 22 21:34:33 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Mon, 22 Feb 2010 12:34:33 -0800 Subject: Alternate schedule for service In-Reply-To: <6db4a4201002221227u99767f1gac13b5f0b5ab9f70@mail.gmail.com> References: <6db4a4201002221227u99767f1gac13b5f0b5ab9f70@mail.gmail.com> Message-ID: <20100222203433.GC4621@bakgwai.americas.hpqcorp.net> Hi shadih! On Mon, 22 Feb 2010, shadih rahman wrote: > I have to check a service every 30 minutes during normal business hour and check the service every two hours after mid-night. I have a extended time period definition. Can someone please give an example how to do this? Thanks > > define timeperiod{ > timeperiod_name extended > alias extended > sunday 07:00-22:59,23:00-06:59 > monday 07:00-22:59,23:00-06:59 > tuesday 07:00-22:59,23:00-06:59 > wednesday 07:00-22:59,23:00-06:59 > thursday 07:00-22:59,23:00-06:59 > friday 07:00-22:59,23:00-06:59 > saturday 07:00-22:59,23:00-06:59 > } > > define service{ > name extended ; Name of this template > use generic-service ; Inherit default values > check_period extended > normal_check_interval 30 > retry_check_interval 5 > max_check_attempts 2 > notification_period extended > notification_interval 30 > notification_options c,r,f > register 0 ; DONT REGISTER THIS - ITS A TEMPLATE > } If you need two different normal_check_intervals, you probably need two services. I can't think of any way to modify the check interval based on a time period (though maybe someone else will have some ideas). ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Mon Feb 22 21:35:04 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Mon, 22 Feb 2010 15:35:04 -0500 Subject: NRPE question In-Reply-To: <20100222203230.GB4621@bakgwai.americas.hpqcorp.net> References: <6db4a4201002221036j43ba1b66nced92ceda34ddc2a@mail.gmail.com> <20100222193429.GA4621@bakgwai.americas.hpqcorp.net> <6db4a4201002221201w3067eab1v90b9ff516e0ce4d0@mail.gmail.com> <20100222203230.GB4621@bakgwai.americas.hpqcorp.net> Message-ID: <6db4a4201002221235k5a52e7e0u3894475bd6dfa00d@mail.gmail.com> Unfortunately my problem is that its happenning on a remote client machine rather than nagios server. On Mon, Feb 22, 2010 at 3:32 PM, wrote: > Hi shadih! > > On Mon, 22 Feb 2010, shadih rahman wrote: > > > I have thought about this solution but I have an issue with this. Let us > say the machine has high load. Now, I presume, we are checking the number > of "nrpe" processes on client host via nrpe. If the load is high, the check > for number of nrpe processes will fail, which in turn would fork possibly > another nrpe process? Please advise on this. Thanks > > > > On Mon, Feb 22, 2010 at 2:34 PM, patrick.morris at hp.com>> wrote: > > On Mon, 22 Feb 2010, shadih rahman wrote: > > > > > I am running nrpe as daemon rather than xinetd. I have one server > which showed high load and all nrpe checks where showing UNKNOWN. Later on > I logged into the box and I saw there were 382 nrpe process running. Can > someone shed some light into this? Thanks > > > > Happens here all the time. I have a Nagios check that looks for > > high numbers of NRPE processes and restarts it if it goes out of > > control. > > > > I'm not sure what causes it; the brute force method we're using here has > > been enough to keep it under control. > > The check I was referring to runs on the Nagios boxes (since that's > where I see nrpe go out of control). So I'm doing a local check that > does nothing but look for NRPE in the process list; it doesn't use NRPE > to do that. > -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Mon Feb 22 22:36:33 2010 From: akpgeek at gmail.com (akp geek) Date: Mon, 22 Feb 2010 16:36:33 -0500 Subject: check_disk question Nagios Message-ID: <2024a9fb1002221336l764dd44n236628239e80b387@mail.gmail.com> Hi All - I have installed nagios on *server1*. Without installing NRPE on the *Server2* , would it possible to use the *check_disk* to monitor the space on *server2*. Thanks for the help Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Mon Feb 22 22:59:19 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Tue, 23 Feb 2010 08:59:19 +1100 Subject: check_disk question Nagios In-Reply-To: <2024a9fb1002221336l764dd44n236628239e80b387@mail.gmail.com> References: <2024a9fb1002221336l764dd44n236628239e80b387@mail.gmail.com> Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E2CA8B676@off-win2003-01.ausregistrygroup.local> > From: akp geek [mailto:akpgeek at gmail.com] > ? > I have installed nagios on server1. Without installing NRPE on the Server2 , > would it possible to use the check_disk to monitor the space on server2. Thanks > for the help Investigate the 'check_by_ssh' plugin that comes with the nagios-plugins. Stuart ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ennis at nmcourts.gov Mon Feb 22 23:20:01 2010 From: ennis at nmcourts.gov (Ennis Ibarra) Date: Mon, 22 Feb 2010 15:20:01 -0700 (MST) Subject: check_disk question Nagios In-Reply-To: <1394152433.26000.1266876976637.JavaMail.root@mail.nmcourts.gov> References: <1394152433.26000.1266876976637.JavaMail.root@mail.nmcourts.gov> Message-ID: <209537025.26354.1266877201194.JavaMail.root@mail.nmcourts.gov> What are you familiar with? Perl, BASH ? If you want to give it a try, use http://debianclusters.cs.uni.edu/index.php/Creating_Your_Own_Nagios_Plugin it was very handy for me, and the previous suggestion about check_by_ssh is also handy. Go crazy mate and good luck. ----- Original Message ----- From: "Stuart Browne" To: "akp geek" , "Nagios-Users Mailinglist" Sent: Monday, February 22, 2010 2:59:19 PM Subject: Re: [Nagios-users] check_disk question Nagios > From: akp geek [mailto:akpgeek at gmail.com] > > I have installed nagios on server1. Without installing NRPE on the Server2 , > would it possible to use the check_disk to monitor the space on server2. Thanks > for the help Investigate the 'check_by_ssh' plugin that comes with the nagios-plugins. Stuart ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From stuart.browne at ausregistry.com.au Mon Feb 22 23:45:51 2010 From: stuart.browne at ausregistry.com.au (Stuart Browne) Date: Tue, 23 Feb 2010 09:45:51 +1100 Subject: check_disk question Nagios In-Reply-To: <209537025.26354.1266877201194.JavaMail.root@mail.nmcourts.gov> References: <1394152433.26000.1266876976637.JavaMail.root@mail.nmcourts.gov> <209537025.26354.1266877201194.JavaMail.root@mail.nmcourts.gov> Message-ID: <8CEF048B9EC83748B1517DC64EA130FB3E2CA8B6AB@off-win2003-01.ausregistrygroup.local> > From: Ennis Ibarra [mailto:ennis at nmcourts.gov] >> From: "Stuart Browne" >>> From: akp geek [mailto:akpgeek at gmail.com] >>> >>> I have installed nagios on server1. Without installing NRPE on the >>> Server2, would it possible to use the check_disk to monitor the >>> space on server2. Thanks for the help >> >> Investigate the 'check_by_ssh' plugin that comes with the nagios-plugins. >> > What are you familiar with? Perl, BASH ? If you want to give it a try, > use > http://debianclusters.cs.uni.edu/index.php/Creating_Your_Own_Nagios_Plugin > it was very handy for me, and the previous suggestion about check_by_ssh > is also handy. Another option I didn't think of is to use NSCA to submit passive checks. But that involves installing the NSCA client on SERVER2 and the NSCA service on SERVER1. Stuart ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jpratt at norwich.edu Tue Feb 23 01:36:27 2010 From: jpratt at norwich.edu (James Pratt) Date: Mon, 22 Feb 2010 19:36:27 -0500 Subject: check_disk question Nagios In-Reply-To: <8CEF048B9EC83748B1517DC64EA130FB3E2CA8B6AB@off-win2003-01.ausregistrygroup.local> References: <1394152433.26000.1266876976637.JavaMail.root@mail.nmcourts.gov><209537025.26354.1266877201194.JavaMail.root@mail.nmcourts.gov> <8CEF048B9EC83748B1517DC64EA130FB3E2CA8B6AB@off-win2003-01.ausregistrygroup.local> Message-ID: <24B6509E4191AF44B60A24EAA3B4AD49525383@nuexchg.norwich.edu> >> -----Original Message----- >> From: Stuart Browne [mailto:stuart.browne at ausregistry.com.au] >> Sent: Monday, February 22, 2010 5:46 PM >> To: nagios-users at lists.sourceforge.net >> Subject: Re: [Nagios-users] check_disk question Nagios >> >> > From: Ennis Ibarra [mailto:ennis at nmcourts.gov] >> >> From: "Stuart Browne" >> >>> From: akp geek [mailto:akpgeek at gmail.com] >> >>> >> >>> I have installed nagios on server1. Without installing NRPE on the >> >>> Server2, would it possible to use the check_disk to monitor the >> >>> space on server2. Thanks for the help >> >> >> >> Investigate the 'check_by_ssh' plugin that comes with the nagios-plugins. >> >> >> > What are you familiar with? Perl, BASH ? If you want to give it a try, >> > use >> > http://debianclusters.cs.uni.edu/index.php/Creating_Your_Own_Nagios_Plug in >> > it was very handy for me, and the previous suggestion about check_by_ssh >> > is also handy. >> >> Another option I didn't think of is to use NSCA to submit passive checks. But that >> involves installing the NSCA client on SERVER2 and the NSCA service on SERVER1. >> >> Stuart Those would all work, but if you are like me and have too many things to monitor with software agents and such, there is always good old SNMP polling - http://nagios.manubulon.com/ cheers, james ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dgullett at symmetrixtech.com Tue Feb 23 06:59:02 2010 From: dgullett at symmetrixtech.com (David Gullett) Date: Mon, 22 Feb 2010 23:59:02 -0600 Subject: Snort 2.8.5 on Ubuntu 8.04 LTS Installation Guide Message-ID: <1266904742.24400.23.camel@dgullett-desktop> We're pleased to announce that we have just published a step-by-step guide on how to install Snort 2.8.5 and Snort Report 1.3.1 on Ubuntu 8.04 LTS. It's available for direct download here: http://www.symmetrixtech.com/articles/001-snortinstallguide.pdf Regards, David Gullett Symmetrix Technologies http://www.symmetrixtech.com ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From dgullett at symmetrixtech.com Tue Feb 23 07:00:04 2010 From: dgullett at symmetrixtech.com (David Gullett) Date: Tue, 23 Feb 2010 00:00:04 -0600 Subject: Snort 2.8.5 on Ubuntu 8.04 LTS Installation Guide In-Reply-To: <1266904742.24400.23.camel@dgullett-desktop> References: <1266904742.24400.23.camel@dgullett-desktop> Message-ID: <1266904804.24400.24.camel@dgullett-desktop> Ugh, my apologies, wrong list. ----------- We're pleased to announce that we have just published a step-by-step guide on how to install Snort 2.8.5 and Snort Report 1.3.1 on Ubuntu 8.04 LTS. It's available for direct download here: http://www.symmetrixtech.com/articles/001-snortinstallguide.pdf Regards, David Gullett Symmetrix Technologies http://www.symmetrixtech.com -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Tue Feb 23 17:16:27 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Tue, 23 Feb 2010 17:16:27 +0100 Subject: check_openmanage and net-snmp v3 In-Reply-To: <15tpr46r6am.fsf@tux.uio.no> (Trond Hasle Amundsen's message of "Mon, 15 Feb 2010 14:13:05 +0100") References: <701DEDF6795DA64DAA315656332BE29102C07377@webmail.meucci.be> <15tvde3srlb.fsf@tux.uio.no> <701DEDF6795DA64DAA315656332BE29102C074AE@webmail.meucci.be> <15taavesnih.fsf@tux.uio.no> <701DEDF6795DA64DAA315656332BE29102C07658@webmail.meucci.be> <15tpr46r6am.fsf@tux.uio.no> Message-ID: <15thbp8dj1g.fsf@tux.uio.no> Hi all, Just to bring this thread to a conclusion... I have released a new version of check_openmanage that adds a new option '--use-get_table', which is to be used as a workaround for issues with SNMPv3 on Windows using net-snmp. There are a few other minor fixes and feature enhancements as well. Downloads and changelog: http://folk.uio.no/trondham/software/check_openmanage.html#download (Also available on Nagios Exchange and Monitoring Exchange.) Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jerryapr7 at gmail.com Tue Feb 23 20:05:16 2010 From: jerryapr7 at gmail.com (Jerry Joy) Date: Wed, 24 Feb 2010 00:35:16 +0530 Subject: Local firewall changes on Nagios core server In-Reply-To: <765d77c81002220509i1fe7edeax830e8c6521207c0b@mail.gmail.com> References: <9f3a8a031002220106g6d85ec63m85445d1ddc781e94@mail.gmail.com> <765d77c81002220509i1fe7edeax830e8c6521207c0b@mail.gmail.com> Message-ID: <9f3a8a031002231105j1de9a692r19f9debb6a51787d@mail.gmail.com> Thanks Assaf and Jim for the detailed response. Assaf - We need to keep the local firewall ON as per policy Jim - SELinux is disabled on the server so saved from the SElinux trouble. On Mon, Feb 22, 2010 at 6:39 PM, Jim Avery wrote: > On 22 February 2010 09:06, Jerry Joy wrote: > > Hi, > > > > I have setup a Nagios server on RHEL 5 and it has the local firewall > "ON". > > What exactly are the changes required on the local firewall so that > Nagios > > core server is able to accept reports from NRPE and NSClient++ agents. I > > don't see a fixed port used by Nagios core. > > > > Thanks, > > Jerry > > The Nagios server will pull check results using nrpe so you shouldn't > need to open a specific port. > > Likewise, usually the Nagios server will query the NSClient++ agent > either using the same nrpe protocol or using the check_nt protocol in > which case you also shouldn't need to open a specific port. However, > if you have configure NSClient++ to send check results to Nagios using > the NSCA protocol, you will need to configure the nsca daemon on your > Nagios server to receive these checks and forward them to Nagios. The > port the nsca daemon listens on is configurable - by default it's port > 5667. > > Another problem I have seen in the past with RedHat variants of Linux > is that SELinux can be a right pain to configure to get things like > nsca working. I used to disable SELinux, but now I tend to use Debian > derived versions of Linux rather then RedHat so don't see this > problem. > > hth, > > Jim > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jperrin at gmail.com Tue Feb 23 21:55:32 2010 From: jperrin at gmail.com (Jim Perrin) Date: Tue, 23 Feb 2010 15:55:32 -0500 Subject: Snort 2.8.5 on Ubuntu 8.04 LTS Installation Guide In-Reply-To: <1266904804.24400.24.camel@dgullett-desktop> References: <1266904742.24400.23.camel@dgullett-desktop> <1266904804.24400.24.camel@dgullett-desktop> Message-ID: <302ce8b51002231255t5cb3671apebff0f45087e0bde@mail.gmail.com> On Tue, Feb 23, 2010 at 1:00 AM, David Gullett wrote: > Ugh, my apologies, wrong list. But since you're here, can you generate nagios alerts based on snort reports? :-P -- During times of universal deceit, telling the truth becomes a revolutionary act. George Orwell ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From shadhin71 at gmail.com Wed Feb 24 02:07:06 2010 From: shadhin71 at gmail.com (shadih rahman) Date: Tue, 23 Feb 2010 20:07:06 -0500 Subject: ndo unusual behavior Message-ID: <6db4a4201002231707j2e258043i220460a8d0cace03@mail.gmail.com> In my nagios setup all object_id from nagios_object table jumped down to 31**** to 3**** on January 5, 2010 around 11: 06 PM, can someone shed any light to this issue? Thanks -- Cordially, Shadhin Rahman -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From daniel at danielemmanuelfeinsmith.com Wed Feb 24 05:19:31 2010 From: daniel at danielemmanuelfeinsmith.com (daniel at danielemmanuelfeinsmith.com) Date: Tue, 23 Feb 2010 21:19:31 -0700 Subject: Brooklyn iPhone App Push Notification Query In-Reply-To: <4A0E68A7.3000209@edcint.co.nz> References: <494F355B.3080904@vanderkooij.org> <4A0E68A7.3000209@edcint.co.nz> Message-ID: <1BBB5519-8113-4F0D-B54E-036132B936AA@danielemmanuelfeinsmith.com> Greetings Community, I am the developer of the iPhone app, "Brooklyn for Nagios". I am trying to gauge community interest in two features I'm considering adding: 1. Push notification on host and/or service status change. 2. Support for multiple nagios instances. Push notification would require the installation of a push notification proxy that would send outbound packets to an APNS (Apple Push Notification) Server, which would then deliver the notifications to your device. The advantage of push over SMS is (1) it is free, and (2), the push message will lead you directly into the Brooklyn for Nagios app to the host or service with the problem where you can ameliorate the issue quickly without having to find the host or service using the Web UI. Both of these constitute major changes, so I'd be glad to get some community feedback on priority. Thanks much, Daniel. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hsjang at gmail.com Wed Feb 24 08:47:21 2010 From: hsjang at gmail.com (HyunSung Jang) Date: Wed, 24 Feb 2010 16:47:21 +0900 Subject: parent/child relationship in distribute environment Message-ID: hello. i just finished setting up nagios as distributed monitoring and i found a problem. we have a backbone switch called BB1. and we have 2 access switch called A1, A2 which uses uplink to BB1 so this makes both A1,A2 have its parent BB1 and then i prepared 4 dedicated servers for nagios, first server for central server, second server for servers under A1, third server for servers under A2, fourth server for our switches BB1,A1,A2 in this environment, how can i define parent/child relationship in distribute nagios server?? i read nagios manual that says "only those services and hosts which are being monitored directly by the distributed server are defined in the object configuration file" i want keep all configuration file for switches only fourth server which is for dedicated switch monitoring. when i try to start nagios in second and thrid server, it shows error like "there is no BB1 parent define in configuration" thanks for your help. -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Feb 24 11:19:54 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 24 Feb 2010 10:19:54 +0000 Subject: parent/child relationship in distribute environment In-Reply-To: References: Message-ID: <4B84FD4A.5030803@flatto.net> HyunSung Jang wrote: > hello. > > i just finished setting up nagios as distributed monitoring and i > found a problem. > we have a backbone switch called BB1. > and we have 2 access switch called A1, A2 which uses uplink to BB1 > so this makes both A1,A2 have its parent BB1 > > and then i prepared 4 dedicated servers for nagios, > first server for central server, > second server for servers under A1, > third server for servers under A2, > fourth server for our switches BB1,A1,A2 > > in this environment, how can i define parent/child relationship in > distribute nagios server?? > i read nagios manual that says "only those services and hosts which > are being monitored directly by the distributed server are defined in > the object configuration file" > > i want keep all configuration file for switches only fourth server > which is for dedicated switch monitoring. > > when i try to start nagios in second and thrid server, it shows error > like "there is no BB1 parent define in configuration" > > thanks for your help. > Hello I know it is bad to assume things , but i will take it that you have both server A1 and A2 transfer the results to the central nagios . What you must remember and is clearly stated in the docs is that the definitions of the remote server has to also be on the central server - they do not however need to be a duplicate on the remote server , that being said , the Parnet/child relation ship can and should be defined on the central server for the whole network , BB1-->A1 -> hosts . while on each remote host you define the parent/child for only the relevant hosts that the host is checking . so if host A1 id down stream from BB1 - it is not defined on A1 , but on the central server only ( it is also a good idea to have active monitoring of BB1 on the central so it will be able to show real time UP/Unknown statuses ). Hope this helps. Assaf ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ampranti at gmail.com Wed Feb 24 12:27:06 2010 From: ampranti at gmail.com (Brandino Andreas) Date: Wed, 24 Feb 2010 13:27:06 +0200 Subject: check_cluster - Wrong service state Message-ID: <35799310.20100224132706@gmail.com> Hi all, I configured check_cluster plugin to monitor two services that work redundant (1 is up, 1 is down all the time). The plugin that check the services, returns "UNKNOWN" state for the service that are down. However, "check_cluster" return that both services are "OK" (CLUSTER OK: Disk-Cluster: 2 ok, 0 warning, 0 unknown, 0 critical). Which can possible reasons for detecting incorrectly the state of the services? Thank you <> --- - - - --- <> Brandino Andreas ampranti at gmail.com <> --- - - - --- <> ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Feb 24 13:45:50 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 24 Feb 2010 12:45:50 +0000 Subject: parent/child relationship in distribute environment In-Reply-To: References: <4B84FD4A.5030803@flatto.net> Message-ID: <4B851F7E.4080208@flatto.net> HyunSung Jang wrote: first thing to remember is a nagios server has no parent since he is the source of test . but in a distributed nagios setting there is the exception to the rule. > thank you for your comment, > but, I think i already configured like you just said. > actually A1,A2 is not a server but it is our access switch i mean > there are many servers under A1, A2 > > let me describe my environment again, > > BB1->A1->server1, server2, nagios1(center), nagios2(for servers under > A1), nagios3(for servers under A2), nagios4(just for switch monitoring) So all nagios monitoring servers are on the same network connected to the same switch ? > BB1->A2->server3, server4... > > my central server (nagios1) has all objects > definition(BB1,A1,A2,server1,server2,server3....etc) > and then nagios2 has only 2 defined servers which is server1 and server2 > nagios3 has server3, server4 > > if I start nagios daemon on nagios3, it shows me an error like > "there's no parent definition for server3" > because parent for server3 is A2 and parent for A2 is BB1. > and I didn't put config for A2 and BB1 to nagios3. > That it the reason - server3 has a definition of a parent in the file on nagios3 , and since the definition does not match the configuration files , that is the error you get . > I have no problem when i starting nagios in my central server, > cuz central server has all configurations of my servers and switches > > but distributed nagios server which is nagios3 won't start. > > am I doing right or is there a misunderstanding to config nagios properly? > The distributed nagios server should hold parent/child definition only for the hosts they monitor directly, but not for anything "upstream" toward the central server, since they are not being monitored from that machine , the central server should hold for every thing , so it can know the topology and structure , if all the nagios servers are on the same switch - the switch should be the parent for the distributed servers . Assaf ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hsjang at gmail.com Wed Feb 24 13:29:12 2010 From: hsjang at gmail.com (HyunSung Jang) Date: Wed, 24 Feb 2010 21:29:12 +0900 Subject: parent/child relationship in distribute environment In-Reply-To: <4B84FD4A.5030803@flatto.net> References: <4B84FD4A.5030803@flatto.net> Message-ID: thank you for your comment, but, I think i already configured like you just said. actually A1,A2 is not a server but it is our access switch i mean there are many servers under A1, A2 let me describe my environment again, BB1->A1->server1, server2, nagios1(center), nagios2(for servers under A1), nagios3(for servers under A2), nagios4(just for switch monitoring) BB1->A2->server3, server4... my central server (nagios1) has all objects definition(BB1,A1,A2,server1,server2,server3....etc) and then nagios2 has only 2 defined servers which is server1 and server2 nagios3 has server3, server4 if I start nagios daemon on nagios3, it shows me an error like "there's no parent definition for server3" because parent for server3 is A2 and parent for A2 is BB1. and I didn't put config for A2 and BB1 to nagios3. I have no problem when i starting nagios in my central server, cuz central server has all configurations of my servers and switches but distributed nagios server which is nagios3 won't start. am I doing right or is there a misunderstanding to config nagios properly? On Wed, Feb 24, 2010 at 7:19 PM, Assaf Flatto wrote: > HyunSung Jang wrote: > > hello. > > > > i just finished setting up nagios as distributed monitoring and i > > found a problem. > > we have a backbone switch called BB1. > > and we have 2 access switch called A1, A2 which uses uplink to BB1 > > so this makes both A1,A2 have its parent BB1 > > > > and then i prepared 4 dedicated servers for nagios, > > first server for central server, > > second server for servers under A1, > > third server for servers under A2, > > fourth server for our switches BB1,A1,A2 > > > > in this environment, how can i define parent/child relationship in > > distribute nagios server?? > > i read nagios manual that says "only those services and hosts which > > are being monitored directly by the distributed server are defined in > > the object configuration file" > > > > i want keep all configuration file for switches only fourth server > > which is for dedicated switch monitoring. > > > > when i try to start nagios in second and thrid server, it shows error > > like "there is no BB1 parent define in configuration" > > > > thanks for your help. > > > Hello > > I know it is bad to assume things , but i will take it that you have > both server A1 and A2 transfer the results to the central nagios . > > What you must remember and is clearly stated in the docs is that the > definitions of the remote server has to also be on the central server - > they do not however need to be a duplicate on the remote server , that > being said , the Parnet/child relation ship can and should be defined on > the central server for the whole network , BB1-->A1 -> hosts . while on > each remote host you define the parent/child for only the relevant hosts > that the host is checking . > so if host A1 id down stream from BB1 - it is not defined on A1 , but on > the central server only ( it is also a good idea to have active > monitoring of BB1 on the central so it will be able to show real > time UP/Unknown statuses ). > > Hope this helps. > > Assaf > > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hsjang at gmail.com Wed Feb 24 14:15:47 2010 From: hsjang at gmail.com (HyunSung Jang) Date: Wed, 24 Feb 2010 22:15:47 +0900 Subject: parent/child relationship in distribute environment In-Reply-To: <4B851F7E.4080208@flatto.net> References: <4B84FD4A.5030803@flatto.net> <4B851F7E.4080208@flatto.net> Message-ID: On Wed, Feb 24, 2010 at 9:45 PM, Assaf Flatto wrote: > HyunSung Jang wrote: > > first thing to remember is a nagios server has no parent since he is the > source of test . > but in a distributed nagios setting there is the exception to the rule. > > yes. > > thank you for your comment, > > but, I think i already configured like you just said. > > actually A1,A2 is not a server but it is our access switch i mean > > there are many servers under A1, A2 > > > > let me describe my environment again, > > > > BB1->A1->server1, server2, nagios1(center), nagios2(for servers under > > A1), nagios3(for servers under A2), nagios4(just for switch monitoring) > > So all nagios monitoring servers are on the same network connected to > the same switch ? > > your are right. all nagios monitoring servers are on the same network and same switch. > > BB1->A2->server3, server4... > > > > my central server (nagios1) has all objects > > definition(BB1,A1,A2,server1,server2,server3....etc) > > and then nagios2 has only 2 defined servers which is server1 and server2 > > nagios3 has server3, server4 > > > > if I start nagios daemon on nagios3, it shows me an error like > > "there's no parent definition for server3" > > because parent for server3 is A2 and parent for A2 is BB1. > > and I didn't put config for A2 and BB1 to nagios3. > > > That it the reason - server3 has a definition of a parent in the file on > nagios3 , and since the definition does not match the configuration > files , that is the error you get . > > I have no problem when i starting nagios in my central server, > > cuz central server has all configurations of my servers and switches > > > > but distributed nagios server which is nagios3 won't start. > > > > am I doing right or is there a misunderstanding to config nagios > properly? > > > The distributed nagios server should hold parent/child definition only > for the hosts they monitor directly, but not for anything "upstream" > toward the central server, since they are not being monitored from that > machine , the central server should hold for every thing , so it can > know the topology and structure , if all the nagios servers are on the > same switch - the switch should be the parent for the distributed servers . oh, I got it. so you mean, "server3" should configured to its parent to nagios2(distribute monitor server) directly even if there are A2 and BB1 switches between them. because nagios2 monitor server3 directly. am I correct? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ml at nicole-haehnel.de Wed Feb 24 13:25:59 2010 From: ml at nicole-haehnel.de (=?ISO-8859-15?Q?Nicole_H=E4hnel?=) Date: Wed, 24 Feb 2010 13:25:59 +0100 Subject: Problem with check_openmanage 3.5.6 Message-ID: <4B851AD7.1050904@nicole-haehnel.de> Hi I get this message on one pe830 (OM 6.1.0) : CRITICAL: [ xxx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs attention: ------ SYSTEM: PowerEdge 830, SN: xxx INTERNAL ERROR: Use of uninitialized value in string eq at /usr/lib64/nagios/plugins/grontmij/check_openmanage line 1428. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/grontmij/check_openmanage line 1441. Is this a problem of check_openmanage or the disk? It's a non dell sata disk. Thanks! Nicole -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Wed Feb 24 15:03:12 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Wed, 24 Feb 2010 15:03:12 +0100 Subject: Problem with check_openmanage 3.5.6 In-Reply-To: <4B851AD7.1050904@nicole-haehnel.de> ("Nicole =?utf-8?Q?H?= =?utf-8?Q?=C3=A4hnel=22's?= message of "Wed, 24 Feb 2010 13:25:59 +0100") References: <4B851AD7.1050904@nicole-haehnel.de> Message-ID: <15tr5oad93z.fsf@tux.uio.no> Nicole H?hnel writes: > Hi > > I get this message on one pe830 (OM 6.1.0) : > > CRITICAL: [ xxx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs > attention: > ------ SYSTEM: PowerEdge 830, SN: xxx > INTERNAL ERROR: Use of uninitialized value in string eq at /usr/lib64/nagios/ > plugins/grontmij/check_openmanage line 1428. > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/grontmij/check_openmanage line 1441. > > > Is this a problem of check_openmanage or the disk? > It's a non dell sata disk. Hi Nicole, Can you provide the output of the following command, executed on the monitored host: omreport storage pdisk controller=0 Also, are you using check_openmanage in SNMP or local context? Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From nagios at flatto.net Wed Feb 24 14:31:38 2010 From: nagios at flatto.net (Assaf Flatto) Date: Wed, 24 Feb 2010 13:31:38 +0000 Subject: parent/child relationship in distribute environment In-Reply-To: References: <4B84FD4A.5030803@flatto.net> <4B851F7E.4080208@flatto.net> Message-ID: <4B852A3A.3040901@flatto.net> HyunSung Jang wrote: > On Wed, Feb 24, 2010 at 9:45 PM, Assaf Flatto > wrote: > > HyunSung Jang wrote: > > first thing to remember is a nagios server has no parent since he > is the > source of test . > but in a distributed nagios setting there is the exception to the > rule. > > yes. > > > > thank you for your comment, > > but, I think i already configured like you just said. > > actually A1,A2 is not a server but it is our access switch i mean > > there are many servers under A1, A2 > > > > let me describe my environment again, > > > > BB1->A1->server1, server2, nagios1(center), nagios2(for servers > under > > A1), nagios3(for servers under A2), nagios4(just for switch > monitoring) > > So all nagios monitoring servers are on the same network connected to > the same switch ? > > your are right. all nagios monitoring servers are on the same network > and same switch. > > > > BB1->A2->server3, server4... > > > > my central server (nagios1) has all objects > > definition(BB1,A1,A2,server1,server2,server3....etc) > > and then nagios2 has only 2 defined servers which is server1 and > server2 > > nagios3 has server3, server4 > > > > if I start nagios daemon on nagios3, it shows me an error like > > "there's no parent definition for server3" > > because parent for server3 is A2 and parent for A2 is BB1. > > and I didn't put config for A2 and BB1 to nagios3. > > > That it the reason - server3 has a definition of a parent in the > file on > nagios3 , and since the definition does not match the configuration > files , that is the error you get . > > I have no problem when i starting nagios in my central server, > > cuz central server has all configurations of my servers and switches > > > > but distributed nagios server which is nagios3 won't start. > > > > am I doing right or is there a misunderstanding to config nagios > properly? > > > The distributed nagios server should hold parent/child definition only > for the hosts they monitor directly, but not for anything "upstream" > toward the central server, since they are not being monitored from > that > machine , the central server should hold for every thing , so it can > know the topology and structure , if all the nagios servers are on the > same switch - the switch should be the parent for the distributed > servers . > > > oh, I got it. > so you mean, "server3" should configured to its parent to > nagios2(distribute monitor server) directly even if > there are A2 and BB1 switches between them. because nagios2 monitor > server3 directly. > am I correct? > Nope, it should be as follows (example parent -> child ) nagios2 -> A2 -> BB1 - Server 3 and A2 and BB1 should be hosts defined in nagios2 , to match the parent tree. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Wed Feb 24 18:29:53 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Wed, 24 Feb 2010 17:29:53 +0000 Subject: check_cluster - Wrong service state In-Reply-To: <35799310.20100224132706@gmail.com> References: <35799310.20100224132706@gmail.com> Message-ID: <765d77c81002240929m439420e9tbeffee15ba90ba32@mail.gmail.com> On 24 February 2010 11:27, Brandino Andreas wrote: > Hi all, > > I configured check_cluster plugin to monitor two services that work > redundant (1 is up, 1 is down all the time). > > The plugin that check the services, returns "UNKNOWN" state for the > service that are down. However, "check_cluster" return that both > services are "OK" (CLUSTER OK: Disk-Cluster: 2 ok, 0 warning, 0 > unknown, 0 critical). > > Which can possible reasons for detecting incorrectly ?the state of > the services? The syntax for check_cluster is quite subtle and not obvious. I confess I don't use check_cluster for service checks, but I do for host checks. An example which I use for a pair of hosts is: In the command definition: command_line $USER1$/check_cluster --host -l $ARG1$ -w $ARG2$ -c $ARG3$ -d $ARG4$ And in the host definition: check_command check_host_cluster!"Storage Server"!0!1!$HOSTSTATEID:ds4100y1a$,$HOSTSTATEID:ds4100y1b$ In my example I want to have a warning state if one of the two hosts is down and have a critical state if both are down. The important point to note is that the warning and critical thresholds are set according to the documentation at http://nagiosplug.sourceforge.net/developer-guidelines.html#THRESHOLDFORMAT It can take a while to work out precisely how to set -w and -c to achieve the result you want. hth, Jim ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From hsjang at gmail.com Wed Feb 24 18:47:40 2010 From: hsjang at gmail.com (HyunSung Jang) Date: Thu, 25 Feb 2010 02:47:40 +0900 Subject: parent/child relationship in distribute environment In-Reply-To: <4B852A3A.3040901@flatto.net> References: <4B84FD4A.5030803@flatto.net> <4B851F7E.4080208@flatto.net> <4B852A3A.3040901@flatto.net> Message-ID: > > Nope, it should be as follows (example parent -> child ) > nagios2 -> A2 -> BB1 - Server 3 > I'm getting confusing. sorry for misunderstanding at this time, I read nagios documentation again and in my understanding that nagios2 is connected to A1 not A2. all of my nagios servers are connected to A1. so how can A2 be a parent for nagios2 ? and Server3 also connected to A2 not BB1 directly. > > and A2 and BB1 should be hosts defined in nagios2 , to match the parent > tree. is this means distributed nagios server must have all of hosts(it mean switch like A2, BB1) definition to match the parent tree?? what i really wanted to do was separate the hosts definitions to each distributed nagios servers. all switches goes to nagios1, server group1 goes to nagios2, server group2 goes to nagios3...etc. because i didn't want to duplicate the host definition. please look at my topology here. http://yfrog.com/1etopologyj from my topology, nagios1 checks server1, server2 directly then who should be a parent for server1, server2 from view of nagios1?? i thought that would be "A2" and BB1 is parent of A2 and A1 is parent of BB1 and nagios1 is parent of A1. this is same parent/child relationship for Nagios Central server. that means i have to duplicate object configuration or maintain same config file to central server and distribute server when i started with same cfg file to central and monitor server, it works fine but there are duplicated host check occured to the same host from central and distributed sever. do we need to host check enable in distributed server too?? -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rainhard.schwarzinger at gmx.at Wed Feb 24 20:39:31 2010 From: rainhard.schwarzinger at gmx.at (Rainhard) Date: Wed, 24 Feb 2010 20:39:31 +0100 Subject: Nagios MRTG does not deliver data from switch Message-ID: Hey, I want to get informations about the traffic of the port 1 from my cisco switch which understands SNMP. So I installed Nagios, the nagios plugins, snmpd, snmp and mrtg. I use the check_mrtgtraf command in my switch.cfg to do this. After installing mrtg like described at http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html I also used cfgmaker to get my mrtg.cfg file. This file looks like this: # Created by # ./cfgmaker --global "WorkDir: /etc/mrtg" --global "Options[_]: bits,growright" --output /etc/mrtg/cfg/mrtg.cfg 172.17.1.2 ### Global Config Options # for UNIX # WorkDir: /home/http/mrtg # or for NT # WorkDir: c:\mrtgdata ### Global Defaults # to get bits instead of bytes and graphs growing to the right # Options[_]: growright, bits EnableIPv6: no WorkDir: /etc/mrtg Options[_]: bits,growright ###################################################################### # System: # Description: # Contact: # Location: ###################################################################### As the WorkDir says, the log-Files created from mrtg should be in the directory /etc/mrtg but the only files I find there are three pictures from mrtg. My Nagios verion is 3.2.0, my nagios plugins version is 1.4.14 and my mrtg version is 2.16.3. So what did I do wrong?? Thanks & Regards Rainhard Schwarzinger -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Wed Feb 24 20:47:56 2010 From: marc at ena.com (Marc Powell) Date: Wed, 24 Feb 2010 13:47:56 -0600 Subject: Nagios MRTG does not deliver data from switch In-Reply-To: References: Message-ID: <2C1631C2-6784-451C-8712-016B68DD406A@ena.com> On Feb 24, 2010, at 1:39 PM, Rainhard wrote: > Hey, > > I want to get informations about the traffic of the port 1 from my cisco switch which understands SNMP. So I installed Nagios, the nagios plugins, snmpd, snmp and mrtg. I use the check_mrtgtraf command in my switch.cfg to do this. > After installing mrtg like described at http://oss.oetiker.ch/mrtg/doc/mrtg-unix-guide.en.html I also used cfgmaker to get my mrtg.cfg file. > As the WorkDir says, the log-Files created from mrtg should be in the directory /etc/mrtg but the only files I find there are three pictures from mrtg. > > My Nagios verion is 3.2.0, my nagios plugins version is 1.4.14 and my mrtg version is 2.16.3. > > So what did I do wrong?? This is not an MRTG support list. You'll have better luck asking for support there. Did you add the cronjob to actually run MRTG? Is it running successfully? -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Wed Feb 24 21:53:07 2010 From: akpgeek at gmail.com (akp geek) Date: Wed, 24 Feb 2010 15:53:07 -0500 Subject: Installing NRPE Message-ID: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> Hi all - I have installed Nagios and working fine so far. I am running into one issue. I would like to install NRPE on several servers. But I might not get root access for that. my question is Can I install NRPE as an user ? Appreciate your help and thoughts Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From akpgeek at gmail.com Wed Feb 24 22:25:24 2010 From: akpgeek at gmail.com (akp geek) Date: Wed, 24 Feb 2010 16:25:24 -0500 Subject: Installing NRPE In-Reply-To: <33C7BE4CE80D504E8907FDC31F8A083047BC3F7FE3@iexchange.I-EVOLVE.com> References: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> <33C7BE4CE80D504E8907FDC31F8A083047BC3F7FE3@iexchange.I-EVOLVE.com> Message-ID: <2024a9fb1002241325v7c700924q7634a5ed3dceea9f@mail.gmail.com> Can you please elaborate more on full sudo access. It is limited root privileges? Appreciate your time Regards On Wed, Feb 24, 2010 at 3:59 PM, Robert Wolfe wrote: > If you can get full sudo access, that should be good enough. > > > > *From:* akp geek [mailto:akpgeek at gmail.com] > *Sent:* Wednesday, February 24, 2010 3:53 PM > *To:* Nagios-Users Mailinglist > *Subject:* [Nagios-users] Installing NRPE > > > > Hi all - > > > > I have installed Nagios and working fine so far. I am running > into one issue. I would like to install NRPE on several servers. But I might > not get root access for that. my question is Can I install NRPE as an user ? > > > > Appreciate your help and thoughts > > > > Regards > -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rwolfe at i-evolve.com Wed Feb 24 21:59:32 2010 From: rwolfe at i-evolve.com (Robert Wolfe) Date: Wed, 24 Feb 2010 15:59:32 -0500 Subject: Installing NRPE In-Reply-To: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> References: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> Message-ID: <33C7BE4CE80D504E8907FDC31F8A083047BC3F7FE3@iexchange.I-EVOLVE.com> If you can get full sudo access, that should be good enough. From: akp geek [mailto:akpgeek at gmail.com] Sent: Wednesday, February 24, 2010 3:53 PM To: Nagios-Users Mailinglist Subject: [Nagios-users] Installing NRPE Hi all - I have installed Nagios and working fine so far. I am running into one issue. I would like to install NRPE on several servers. But I might not get root access for that. my question is Can I install NRPE as an user ? Appreciate your help and thoughts Regards -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Wed Feb 24 22:45:31 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Wed, 24 Feb 2010 13:45:31 -0800 Subject: Installing NRPE In-Reply-To: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> References: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> Message-ID: <20100224214531.GI4621@bakgwai.americas.hpqcorp.net> On Wed, 24 Feb 2010, akp geek wrote: > Hi all - > > I have installed Nagios and working fine so far. I am running into one issue. I would like to install NRPE on several servers. But I might not get root access for that. my question is Can I install NRPE as an user ? Sure. Just make sure you run it on an unprivileged port (which may be the default, but I'm not somewhere I can confirm that). You don't need any root-level access. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From justin at justinshore.com Wed Feb 24 22:36:40 2010 From: justin at justinshore.com (Justin Shore) Date: Wed, 24 Feb 2010 15:36:40 -0600 Subject: Working version of check_cisco_ping Message-ID: <4B859BE8.2010603@justinshore.com> Does anyone know if there's a working version of check_cisco_ping? http://exchange.nagios.org/directory/Plugins/Hardware/Network-Gear/Cisco/check_cisco_ping/details It has an issue with utils.pm which is being deprecated in favor of Nagios::Plugin. What I need is the ability to SSH into a Cisco router, jump into enable mode, and ping inside of a VRF. I'm carrying a private state education network across my backbone and have no idea if it's having trouble until someone calls. I can't ping from the NMS server because I have to be inside the MPLS/VPN to do that. There are 2 points on my network that I can ping from within that VRF if I can just find a working plugin that can do it for me. Suggestions? Thanks Justin ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From rwolfe at i-evolve.com Wed Feb 24 22:58:32 2010 From: rwolfe at i-evolve.com (Robert Wolfe) Date: Wed, 24 Feb 2010 16:58:32 -0500 Subject: Installing NRPE In-Reply-To: <20100224214531.GI4621@bakgwai.americas.hpqcorp.net> References: <2024a9fb1002241253h7b6414d6u284ecc702953e52b@mail.gmail.com> <20100224214531.GI4621@bakgwai.americas.hpqcorp.net> Message-ID: <33C7BE4CE80D504E8907FDC31F8A083047BC3F7FEA@iexchange.I-EVOLVE.com> I believe the default port of 5666 is an unprivileged port. -----Original Message----- From: patrick.morris at hp.com [mailto:patrick.morris at hp.com] Sent: Wednesday, February 24, 2010 4:46 PM To: akp geek Cc: Nagios-Users Mailinglist Subject: Re: [Nagios-users] Installing NRPE On Wed, 24 Feb 2010, akp geek wrote: > Hi all - > > I have installed Nagios and working fine so far. I am running into one issue. I would like to install NRPE on several servers. But I might not get root access for that. my question is Can I install NRPE as an user ? Sure. Just make sure you run it on an unprivileged port (which may be the default, but I'm not somewhere I can confirm that). You don't need any root-level access. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Wed Feb 24 23:28:56 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Wed, 24 Feb 2010 17:28:56 -0500 Subject: servicegroups directive doesn't seem to work Message-ID: <4B85A828.2070006@paul.dubuc.org> Hello, The documentation at http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html for Service Definition says that you can use a 'servicegroups' directive to assign a service to a servicegroup instead of using the 'members' directive in the service group: > *servicegroups*: This directive is used to identify the /short > name(s)/ of the servicegroup(s) > > that the service belongs to. Multiple servicegroups should be > separated by commas. This directive may be used as an alternative to > using the /members/ directive in servicegroup > > definitions. I would like to do this using a service template that service definitions can use to do the assignment like the configuration below. This would save me from having to add many host,service pairs to the members directive in the service group. But it doesn't seem to work (I'm using Nagios 3.2.0). I get the following configuration error: Error: Servicegroup members must be specified in , pairs (config file ' ... I get the same error when I delete the service template and move the servicegroups directive into the service definitions. What am I doing wrong? Thanks, Paul Dubuc define servicegroup{ servicegroup_name ebusiness alias Business Services # members ; use servcicegroups in service definitions below instead. } # # Nagios service definition template used by services in this config file # define service{ name ebusiness-service register 0 ; this is a template servicegroups ebusiness ; add the service to this service group } define service{ use ebusiness-service host_name host1,host2 service_description service1 check_command ... } # # SciFinder Password Change test service # define service{ use ebusiness-service host_name host1,host2 service_description service2 check_command ... } ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marco.tirado at gmail.com Thu Feb 25 10:34:21 2010 From: marco.tirado at gmail.com (Marco Antonio Tirado Godoy) Date: Thu, 25 Feb 2010 01:34:21 -0800 (PST) Subject: Invitation to connect on LinkedIn Message-ID: <443186244.561885.1267090461570.JavaMail.app@ech3-cdn12.prod> LinkedIn ------------ I'd like to add you to my professional network on LinkedIn. - Marco Antonio Confirm that you know Marco Antonio Tirado Godoy https://www.linkedin.com/e/isd/1102226143/z52XZRMp/EML-invg_56/ ------ (c) 2010, LinkedIn Corporation -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ml at nicole-haehnel.de Thu Feb 25 12:22:14 2010 From: ml at nicole-haehnel.de (=?UTF-8?B?Tmljb2xlIEjDpGhuZWw=?=) Date: Thu, 25 Feb 2010 12:22:14 +0100 Subject: Problem with check_openmanage 3.5.6 In-Reply-To: <15tr5oad93z.fsf@tux.uio.no> References: <4B851AD7.1050904@nicole-haehnel.de> <15tr5oad93z.fsf@tux.uio.no> Message-ID: <4B865D66.80102@nicole-haehnel.de> Am 24.02.2010 15:03, schrieb Trond Hasle Amundsen: > Nicole H?hnel writes: > > >> Hi >> >> I get this message on one pe830 (OM 6.1.0) : >> >> CRITICAL: [ xxx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs >> attention: >> ------ SYSTEM: PowerEdge 830, SN: xxx >> INTERNAL ERROR: Use of uninitialized value in string eq at /usr/lib64/nagios/ >> plugins/grontmij/check_openmanage line 1428. >> INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ >> plugins/grontmij/check_openmanage line 1441. >> >> >> Is this a problem of check_openmanage or the disk? >> It's a non dell sata disk. >> > Hi Nicole, > > Can you provide the output of the following command, executed on the > monitored host: > > omreport storage pdisk controller=0 > > Also, are you using check_openmanage in SNMP or local context? > > Cheers, > Hi Trond, it's a windows server. So I'm using check_openmanage with snmp. check_openmanage -s -C $ARG1$ -H $HOSTADDRESS$ -e -i -p --state --check intrusion=1,alertlog=1,esmlog=1 -o 3 --htmlinfo de List of Physical Disks on Controller CERC SATA 1.5/6ch (Slot 4) Controller CERC SATA 1.5/6ch (Slot 4) ID : 0:0 Status : Unknown Name : Physical Disk 0:0 State : Unknown Failure Predicted : No Progress : Not Applicable Bus Protocol : SATA Media : HDD Capacity : 149.05 GB (160040681472 bytes) Used RAID Disk Space : 0.00 GB (0 bytes) Available RAID Disk Space : 0.00 GB (0 bytes) Hot Spare : No Vendor ID : WDC Product ID : WD1600JS-55MHB0 Revision : 02.0 Serial No. : WD-WCANM3083963 Negotiated Speed : Not Available Capable Speed : Not Available Manufacture Day : Not Available Manufacture Week : Not Available Manufacture Year : Not Available SAS Address : Not Available Thanks! Nicole ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Thu Feb 25 15:38:50 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Thu, 25 Feb 2010 15:38:50 +0100 Subject: Problem with check_openmanage 3.5.6 In-Reply-To: <4B865D66.80102@nicole-haehnel.de> ("Nicole =?utf-8?Q?H=C3=A4?= =?utf-8?Q?hnel=22's?= message of "Thu, 25 Feb 2010 12:22:14 +0100") References: <4B851AD7.1050904@nicole-haehnel.de> <15tr5oad93z.fsf@tux.uio.no> <4B865D66.80102@nicole-haehnel.de> Message-ID: <15tiq9lcrd1.fsf@tux.uio.no> Nicole H?hnel writes: > it's a windows server. > So I'm using check_openmanage with snmp. > > check_openmanage -s -C $ARG1$ -H $HOSTADDRESS$ -e -i -p --state --check > intrusion=1,alertlog=1,esmlog=1 -o 3 --htmlinfo de > > List of Physical Disks on Controller CERC SATA 1.5/6ch (Slot 4) > > Controller CERC SATA 1.5/6ch (Slot 4) > ID : 0:0 > Status : Unknown > Name : Physical Disk 0:0 > State : Unknown > Failure Predicted : No > Progress : Not Applicable > Bus Protocol : SATA > Media : HDD > Capacity : 149.05 GB (160040681472 bytes) > Used RAID Disk Space : 0.00 GB (0 bytes) > Available RAID Disk Space : 0.00 GB (0 bytes) > Hot Spare : No > Vendor ID : WDC > Product ID : WD1600JS-55MHB0 > Revision : 02.0 > Serial No. : WD-WCANM3083963 > Negotiated Speed : Not Available > Capable Speed : Not Available > Manufacture Day : Not Available > Manufacture Week : Not Available > Manufacture Year : Not Available > SAS Address : Not Available Ok, so the status and state are both "Unknown". I'm guessing that these values are completely missing in the SNMP output, which is why perl chokes on it. I've added some robustness in the code that should handle this case properly. Please try the beta version (3.5.7-beta1) available here: http://folk.uio.no/trondham/tmp/check_openmanage-3.5.7-beta1 The plugin will give an alert on the drive, which in my opinion is the correct thing to do. You can always blacklist the drive. The cause of the error is obviously that this is a non-Dell drive, which Openmanage doesn't know how to handle. BTW, you can reduce your command definition to this: check_openmanage -s -C $ARG1$ -H $HOSTADDRESS$ -e -i -p -a -o 3 --htmlinfo de The effect will be the same. You probably defined the command a while ago, and there have been some changes to options since then. Cheers, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ml at nicole-haehnel.de Thu Feb 25 16:08:10 2010 From: ml at nicole-haehnel.de (=?UTF-8?B?Tmljb2xlIEjDpGhuZWw=?=) Date: Thu, 25 Feb 2010 16:08:10 +0100 Subject: Problem with check_openmanage 3.5.6 In-Reply-To: <15tiq9lcrd1.fsf@tux.uio.no> References: <4B851AD7.1050904@nicole-haehnel.de> <15tr5oad93z.fsf@tux.uio.no> <4B865D66.80102@nicole-haehnel.de> <15tiq9lcrd1.fsf@tux.uio.no> Message-ID: <4B86925A.7010007@nicole-haehnel.de> Am 25.02.2010 15:38, schrieb Trond Hasle Amundsen: > Nicole H?hnel writes: > > >> it's a windows server. >> So I'm using check_openmanage with snmp. >> >> check_openmanage -s -C $ARG1$ -H $HOSTADDRESS$ -e -i -p --state --check >> intrusion=1,alertlog=1,esmlog=1 -o 3 --htmlinfo de >> >> List of Physical Disks on Controller CERC SATA 1.5/6ch (Slot 4) >> >> Controller CERC SATA 1.5/6ch (Slot 4) >> ID : 0:0 >> Status : Unknown >> Name : Physical Disk 0:0 >> State : Unknown >> Failure Predicted : No >> Progress : Not Applicable >> Bus Protocol : SATA >> Media : HDD >> Capacity : 149.05 GB (160040681472 bytes) >> Used RAID Disk Space : 0.00 GB (0 bytes) >> Available RAID Disk Space : 0.00 GB (0 bytes) >> Hot Spare : No >> Vendor ID : WDC >> Product ID : WD1600JS-55MHB0 >> Revision : 02.0 >> Serial No. : WD-WCANM3083963 >> Negotiated Speed : Not Available >> Capable Speed : Not Available >> Manufacture Day : Not Available >> Manufacture Week : Not Available >> Manufacture Year : Not Available >> SAS Address : Not Available >> > Ok, so the status and state are both "Unknown". I'm guessing that these > values are completely missing in the SNMP output, which is why perl > chokes on it. I've added some robustness in the code that should handle > this case properly. Please try the beta version (3.5.7-beta1) available > here: > > http://folk.uio.no/trondham/tmp/check_openmanage-3.5.7-beta1 > > The plugin will give an alert on the drive, which in my opinion is the > correct thing to do. You can always blacklist the drive. The cause of > the error is obviously that this is a non-Dell drive, which Openmanage > doesn't know how to handle. > > BTW, you can reduce your command definition to this: > > check_openmanage -s -C $ARG1$ -H $HOSTADDRESS$ -e -i -p -a -o 3 --htmlinfo de > > The effect will be the same. You probably defined the command a while > ago, and there have been some changes to options since then. > > Cheers, > Hi, I tested the new version: CRITICAL: [xxx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs attention: ------ SYSTEM: PowerEdge 830, SN: xxx INTERNAL ERROR: Use of uninitialized value in string eq at /usr/lib64/nagios/plugins/grontmij/check_openmanage line 1432. INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/plugins/grontmij/check_openmanage line 1445. Yes, I defined the command some time ago. Nicole -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From t.h.amundsen at usit.uio.no Thu Feb 25 16:42:19 2010 From: t.h.amundsen at usit.uio.no (Trond Hasle Amundsen) Date: Thu, 25 Feb 2010 16:42:19 +0100 Subject: Problem with check_openmanage 3.5.6 In-Reply-To: <4B86925A.7010007@nicole-haehnel.de> ("Nicole =?utf-8?Q?H?= =?utf-8?Q?=C3=A4hnel=22's?= message of "Thu, 25 Feb 2010 16:08:10 +0100") References: <4B851AD7.1050904@nicole-haehnel.de> <15tr5oad93z.fsf@tux.uio.no> <4B865D66.80102@nicole-haehnel.de> <15tiq9lcrd1.fsf@tux.uio.no> <4B86925A.7010007@nicole-haehnel.de> Message-ID: <15teik9cof8.fsf@tux.uio.no> Nicole H?hnel writes: > I tested the new version: > > CRITICAL: [xxx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs > attention: > ------ SYSTEM: PowerEdge 830, SN: xxx > INTERNAL ERROR: Use of uninitialized value in string eq at /usr/lib64/nagios/ > plugins/grontmij/check_openmanage line 1432. > INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ > plugins/grontmij/check_openmanage line 1445. Hmm.. OK, new test: http://folk.uio.no/trondham/tmp/check_openmanage-3.5.7-beta2 Regards, -- Trond H. Amundsen Center for Information Technology Services, University of Oslo ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From work at paul.dubuc.org Thu Feb 25 17:25:06 2010 From: work at paul.dubuc.org (Paul M. Dubuc) Date: Thu, 25 Feb 2010 11:25:06 -0500 Subject: servicegroups directive doesn't seem to work In-Reply-To: <4B85A828.2070006@paul.dubuc.org> References: <4B85A828.2070006@paul.dubuc.org> Message-ID: <4B86A462.1000403@paul.dubuc.org> FYI, The reason this wasn't working was that there was 'use' directive in the service template that was using a template that also has a servicegroups directive for another service group (that line got edited out of my example). Putting a + sign in front of the ebusiness servicegroup name did the trick, adding the new service group instead of using it to replace the old one. Paul M. Dubuc wrote: > Hello, > > The documentation at > http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html for > Service Definition says that you can use a 'servicegroups' directive to > assign a service to a servicegroup instead of using the 'members' > directive in the service group: > >> *servicegroups*: This directive is used to identify the /short >> name(s)/ of the servicegroup(s) >> >> that the service belongs to. Multiple servicegroups should be >> separated by commas. This directive may be used as an alternative to >> using the /members/ directive in servicegroup >> >> definitions. > > I would like to do this using a service template that service > definitions can use to do the assignment like the configuration below. > This would save me from having to add many host,service pairs to the > members directive in the service group. But it doesn't seem to work > (I'm using Nagios 3.2.0). I get the following configuration error: > > Error: Servicegroup members must be specified in > , pairs (config file ' ... > > I get the same error when I delete the service template and move the > servicegroups directive into the service definitions. > What am I doing wrong? > > Thanks, > Paul Dubuc > > > define servicegroup{ > servicegroup_name ebusiness > alias Business Services > # members ; use servcicegroups in service definitions below instead. > } > > # > # Nagios service definition template used by services in this config file > # > define service{ > name ebusiness-service > register 0 ; this is a template > > servicegroups ebusiness ; add the service to this > service group > > } > > > define service{ > use ebusiness-service > host_name host1,host2 > service_description service1 > > check_command ... > > } > > # > # SciFinder Password Change test service > # > define service{ > use ebusiness-service > host_name host1,host2 > service_description service2 > > check_command ... > > } > ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From ml at nicole-haehnel.de Thu Feb 25 19:41:24 2010 From: ml at nicole-haehnel.de (=?UTF-8?B?Tmljb2xlIEjDpGhuZWw=?=) Date: Thu, 25 Feb 2010 19:41:24 +0100 Subject: Problem with check_openmanage 3.5.6 In-Reply-To: <15teik9cof8.fsf@tux.uio.no> References: <4B851AD7.1050904@nicole-haehnel.de> <15tr5oad93z.fsf@tux.uio.no> <4B865D66.80102@nicole-haehnel.de> <15tiq9lcrd1.fsf@tux.uio.no> <4B86925A.7010007@nicole-haehnel.de> <15teik9cof8.fsf@tux.uio.no> Message-ID: <4B86C454.6000305@nicole-haehnel.de> Am 25.02.2010 16:42, schrieb Trond Hasle Amundsen: > Nicole H?hnel writes: > > >> I tested the new version: >> >> CRITICAL: [xxx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs >> attention: >> ------ SYSTEM: PowerEdge 830, SN: xxx >> INTERNAL ERROR: Use of uninitialized value in string eq at /usr/lib64/nagios/ >> plugins/grontmij/check_openmanage line 1432. >> INTERNAL ERROR: Use of uninitialized value in sprintf at /usr/lib64/nagios/ >> plugins/grontmij/check_openmanage line 1445. >> > Hmm.. OK, new test: > > http://folk.uio.no/trondham/tmp/check_openmanage-3.5.7-beta2 > > Regards, > No, same error: CRITICAL: [xx] Physical Disk 0:0 [Wdc WD1600JS-55MHB0, 160GB] on ctrl 0 needs attention: ------ SYSTEM: PowerEdge 830, SN: xx INTERNAL ERROR: Use of uninitialized value in string eq at ./check_openmanage-3.5.7-beta2 line 1432. INTERNAL ERROR: Use of uninitialized value in sprintf at ./check_openmanage-3.5.7-beta2 line 1445.|'fan_1_bmc_back_fan'=2052RPM;0;0 'temp_0_proc'=33C;120;125 'temp_1_bmc_planar'=36C;53;58 Nicole ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joao.miguel.c.ferreira at gmail.com Thu Feb 25 20:56:26 2010 From: joao.miguel.c.ferreira at gmail.com (Joao Ferreira gmail) Date: Thu, 25 Feb 2010 19:56:26 +0000 Subject: nrpe plugins fail when pass arguments Message-ID: <1267127786.4792.34.camel@debj5n.critical.pt> Hello, I have a strange issue with nrpe - nrpe compiled with support for arguments - option dont_blame_nrpe = 1 plugins fail if I pass arguments; tested with check_tcp, check_icmp and check_load; none of them works if arguments come from the nagios station; they all work if arguments are statically written in nrpe.cfg I've tested the same command: - 1st with all args typed directly in nrpe.cfg: works fine - 2nd with args passed from the "client": fails. bellow a log with details; the only difference between the 1st and the second is that the 2nd receives args from the "client" nagios station Thank you Joao #nrpe boot Feb 25 15:17:27 ebox nrpe[10773]: Added command[check_ssh_localhost]=/usr/lib/nagios/libexec/check_tcp -H localhost -p 22 Feb 25 15:17:27 ebox nrpe[10773]: Added command[check_tcp_host_port]=/usr/lib/nagios/plugins/check_tcp -H $ARG1$ -p $ARG2$ Feb 25 15:17:27 ebox nrpe[10773]: INFO: SSL/TLS initialized. All network traffic will be encrypted. Feb 25 15:17:27 ebox nrpe[10774]: Starting up daemon Feb 25 15:17:27 ebox nrpe[10774]: Warning: Daemon is configured to accept command arguments from clients! Feb 25 15:17:27 ebox nrpe[10774]: Listening for connections on port 5666 #command check_ssh_localhost Feb 25 15:17:55 ebox nrpe[10779]: Connection from 172.16.151.1 port 23788 Feb 25 15:17:55 ebox nrpe[10779]: Handling the connection... Feb 25 15:17:55 ebox nrpe[10779]: Host is asking for command 'check_ssh_localhost' to be run... Feb 25 15:17:55 ebox nrpe[10779]: Running command: /usr/lib/nagios/libexec/check_tcp -H localhost -p 22 Feb 25 15:17:55 ebox nrpe[10779]: Command completed with return code 0 and output: TCP OK - 0.030 second response time on port 22| time=0.030150s;;;0.000000;10.000000 Feb 25 15:17:55 ebox nrpe[10779]: Return Code: 0, Output: TCP OK - 0.030 second response time on port 22|time=0.030150s;;;0.000000;10.000000 Feb 25 15:17:55 ebox nrpe[10779]: Connection from 172.16.151.1 closed. #command check_tcp_host_port Feb 25 15:18:49 ebox nrpe[10827]: Connection from 172.16.151.1 port 25836 Feb 25 15:18:49 ebox nrpe[10827]: Handling the connection... Feb 25 15:18:49 ebox nrpe[10827]: Host is asking for command 'check_tcp_host_port' to be run... Feb 25 15:18:49 ebox nrpe[10827]: Running command: /usr/lib/nagios/plugins/check_tcp -H localhost -p 22 Feb 25 15:18:49 ebox nrpe[10827]: Command completed with return code 3 and output: Feb 25 15:18:49 ebox nrpe[10827]: Return Code: 3, Output: NRPE: Unable to read output Feb 25 15:18:49 ebox nrpe[10827]: Connection from 172.16.151.1 closed. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From marc at ena.com Thu Feb 25 21:37:08 2010 From: marc at ena.com (Marc Powell) Date: Thu, 25 Feb 2010 14:37:08 -0600 Subject: nrpe plugins fail when pass arguments In-Reply-To: <1267127786.4792.34.camel@debj5n.critical.pt> References: <1267127786.4792.34.camel@debj5n.critical.pt> Message-ID: On Feb 25, 2010, at 1:56 PM, Joao Ferreira gmail wrote: > bellow a log with details; the only difference between the 1st and the > second is that the 2nd receives args from the "client" nagios station > Feb 25 15:17:55 ebox nrpe[10779]: Running > command: /usr/lib/nagios/libexec/check_tcp -H localhost -p 22 > Feb 25 15:17:55 ebox nrpe[10779]: Command completed with return code 0 > Feb 25 15:18:49 ebox nrpe[10827]: Running > command: /usr/lib/nagios/plugins/check_tcp -H localhost -p 22 > Feb 25 15:18:49 ebox nrpe[10827]: Command completed with return code 3 You're also changing the path to the plugin in the command definition. I'm guessing it isn't valid any longer. -- Marc ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Thu Feb 25 21:28:23 2010 From: patrick.morris at hp.com (patrick.morris at hp.com) Date: Thu, 25 Feb 2010 12:28:23 -0800 Subject: nrpe plugins fail when pass arguments In-Reply-To: <1267127786.4792.34.camel@debj5n.critical.pt> References: <1267127786.4792.34.camel@debj5n.critical.pt> Message-ID: <20100225202823.GP4621@bakgwai.americas.hpqcorp.net> Are you sure your configs are correct? I noticed your config seems to specify two differnet locations for the check_tcp plugin: > Feb 25 15:17:55 ebox nrpe[10779]: Running > command: /usr/lib/nagios/libexec/check_tcp -H localhost -p 22 > Feb 25 15:18:49 ebox nrpe[10827]: Running > command: /usr/lib/nagios/plugins/check_tcp -H localhost -p 22 My guess is that the second one is erroring out because the plugin is not there. On Thu, 25 Feb 2010, Joao Ferreira gmail wrote: > Hello, > > I have a strange issue with nrpe > > - nrpe compiled with support for arguments > - option dont_blame_nrpe = 1 > > plugins fail if I pass arguments; tested with check_tcp, check_icmp and > check_load; none of them works if arguments come from the nagios > station; they all work if arguments are statically written in nrpe.cfg > > I've tested the same command: > - 1st with all args typed directly in nrpe.cfg: works fine > - 2nd with args passed from the "client": fails. > > bellow a log with details; the only difference between the 1st and the > second is that the 2nd receives args from the "client" nagios station > > Thank you > Joao > > > > #nrpe boot > Feb 25 15:17:27 ebox nrpe[10773]: Added > command[check_ssh_localhost]=/usr/lib/nagios/libexec/check_tcp -H > localhost -p 22 > Feb 25 15:17:27 ebox nrpe[10773]: Added > command[check_tcp_host_port]=/usr/lib/nagios/plugins/check_tcp -H $ARG1$ > -p $ARG2$ > Feb 25 15:17:27 ebox nrpe[10773]: INFO: SSL/TLS initialized. All network > traffic will be encrypted. > Feb 25 15:17:27 ebox nrpe[10774]: Starting up daemon > Feb 25 15:17:27 ebox nrpe[10774]: Warning: Daemon is configured to > accept command arguments from clients! > Feb 25 15:17:27 ebox nrpe[10774]: Listening for connections on port > 5666 > > #command check_ssh_localhost > Feb 25 15:17:55 ebox nrpe[10779]: Connection from 172.16.151.1 port > 23788 > Feb 25 15:17:55 ebox nrpe[10779]: Handling the connection... > Feb 25 15:17:55 ebox nrpe[10779]: Host is asking for command > 'check_ssh_localhost' to be run... > Feb 25 15:17:55 ebox nrpe[10779]: Running > command: /usr/lib/nagios/libexec/check_tcp -H localhost -p 22 > Feb 25 15:17:55 ebox nrpe[10779]: Command completed with return code 0 > and output: TCP OK - 0.030 second response time on port 22| > time=0.030150s;;;0.000000;10.000000 > Feb 25 15:17:55 ebox nrpe[10779]: Return Code: 0, Output: TCP OK - 0.030 > second response time on port 22|time=0.030150s;;;0.000000;10.000000 > Feb 25 15:17:55 ebox nrpe[10779]: Connection from 172.16.151.1 closed. > > #command check_tcp_host_port > Feb 25 15:18:49 ebox nrpe[10827]: Connection from 172.16.151.1 port > 25836 > Feb 25 15:18:49 ebox nrpe[10827]: Handling the connection... > Feb 25 15:18:49 ebox nrpe[10827]: Host is asking for command > 'check_tcp_host_port' to be run... > Feb 25 15:18:49 ebox nrpe[10827]: Running > command: /usr/lib/nagios/plugins/check_tcp -H localhost -p 22 > Feb 25 15:18:49 ebox nrpe[10827]: Command completed with return code 3 > and output: > Feb 25 15:18:49 ebox nrpe[10827]: Return Code: 3, Output: NRPE: Unable > to read output > Feb 25 15:18:49 ebox nrpe[10827]: Connection from 172.16.151.1 closed. > > > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kaushalshriyan at gmail.com Fri Feb 26 08:43:46 2010 From: kaushalshriyan at gmail.com (Kaushal Shriyan) Date: Fri, 26 Feb 2010 13:13:46 +0530 Subject: customized init script Message-ID: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> Hi, Any recommended method to monitor customized init scripts. I know check_tcp will only monitor ports. Please suggest. Thanks, Kaushal ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From kaushalshriyan at gmail.com Fri Feb 26 10:43:16 2010 From: kaushalshriyan at gmail.com (Kaushal Shriyan) Date: Fri, 26 Feb 2010 15:13:16 +0530 Subject: check_tcp Message-ID: <6b16fb4c1002260143h793eed29l1ffb60d9fc166654@mail.gmail.com> Hi, is it documented on how check_tcp plugin works ? Thanks, Kaushal ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From joao.miguel.c.ferreira at gmail.com Fri Feb 26 10:58:04 2010 From: joao.miguel.c.ferreira at gmail.com (Joao Ferreira gmail) Date: Fri, 26 Feb 2010 09:58:04 +0000 Subject: check_tcp In-Reply-To: <6b16fb4c1002260143h793eed29l1ffb60d9fc166654@mail.gmail.com> References: <6b16fb4c1002260143h793eed29l1ffb60d9fc166654@mail.gmail.com> Message-ID: <1267178284.5322.9.camel@debj5n.critical.pt> On Fri, 2010-02-26 at 15:13 +0530, Kaushal Shriyan wrote: > Hi, > > is it documented on how check_tcp plugin works ? try this: /PATH/TO/YOUR/PLUGINS/check_tcp --help or this http://nagiosplugins.org/man/check_tcp Joao > > Thanks, > > Kaushal > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From martinm at op5.org Fri Feb 26 10:57:22 2010 From: martinm at op5.org (Martin Melin) Date: Fri, 26 Feb 2010 10:57:22 +0100 Subject: check_tcp In-Reply-To: <6b16fb4c1002260143h793eed29l1ffb60d9fc166654@mail.gmail.com> References: <6b16fb4c1002260143h793eed29l1ffb60d9fc166654@mail.gmail.com> Message-ID: <5f0bcb7c1002260157r63ab99dbv8d167a6aa8260dae@mail.gmail.com> What are you looking for that's not in the help output? http://nagiosplugins.org/man/check_tcp On Fri, Feb 26, 2010 at 10:43 AM, Kaushal Shriyan wrote: > Hi, > > is it documented on how check_tcp plugin works ? > > Thanks, > > Kaushal > > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when > reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- Martin Melin ____________________________ op5 AB http://www.op5.com http://www.op5.org/ http://www.op5.com/op5/products/network-monitor/nagios/ -------------- next part -------------- An HTML attachment was scrubbed... URL: -------------- next part -------------- ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev -------------- next part -------------- _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From standalone.sysadmin at gmail.com Fri Feb 26 17:07:59 2010 From: standalone.sysadmin at gmail.com (Matt Simmons) Date: Fri, 26 Feb 2010 11:07:59 -0500 Subject: customized init script In-Reply-To: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> References: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> Message-ID: <5bcb62b61002260807p511a18f1g7754969a0d0b0f35@mail.gmail.com> Do you want to monitor whether they ran correctly, or whether the daemons they started are still running correctly? --Matt On Fri, Feb 26, 2010 at 2:43 AM, Kaushal Shriyan wrote: > Hi, > > Any recommended method to monitor customized init scripts. I know > check_tcp will only monitor ports. > > Please suggest. > > Thanks, > > Kaushal > > ------------------------------------------------------------------------------ > Download Intel® Parallel Studio Eval > Try the new software tools for yourself. Speed compiling, find bugs > proactively, and fine-tune applications for parallel performance. > See why Intel Parallel Studio got high marks during beta. > http://p.sf.net/sfu/intel-sw-dev > _______________________________________________ > Nagios-users mailing list > Nagios-users at lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nagios-users > ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. > ::: Messages without supporting info will risk being sent to /dev/null > -- LITTLE GIRL: But which cookie will you eat FIRST? COOKIE MONSTER: Me think you have misconception of cookie-eating process. COOKIE MONSTER: Boy, I wish I were a sysadmin so I could go to the NJ-PICC Sysadmin Conference! http://www.picconf.org ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From jim at jimavery.me.uk Fri Feb 26 20:57:33 2010 From: jim at jimavery.me.uk (Jim Avery) Date: Fri, 26 Feb 2010 19:57:33 +0000 Subject: customized init script In-Reply-To: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> References: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> Message-ID: <765d77c81002261157n62aecae4pe6614de636544183@mail.gmail.com> On 26 February 2010 07:43, Kaushal Shriyan wrote: > Any recommended method to monitor customized init scripts. I know > check_tcp will only monitor ports. I would edit the init script to use send_nsca to send an appropriate check result to Nagios. Of course you will need to install and configure nsca on your Nagios server and configure a passive service check in Nagios to receive the check results. ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null From patrick.morris at hp.com Sun Feb 28 09:09:17 2010 From: patrick.morris at hp.com (Morris, Patrick) Date: Sun, 28 Feb 2010 00:09:17 -0800 Subject: customized init script In-Reply-To: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> References: <6b16fb4c1002252343pea33497p406ffb6fd3e0db19@mail.gmail.com> Message-ID: <4B8A24AD.5020505@hp.com> Kaushal Shriyan wrote: > Hi, > > Any recommended method to monitor customized init scripts. I know > check_tcp will only monitor ports. You may need to rephrase the question; it's not making sense to me, anyway. What about these scripts do you want to monitor? Whether they exist? When they last ran? If they're still running? If the app they start/stop is running? ------------------------------------------------------------------------------ Download Intel® Parallel Studio Eval Try the new software tools for yourself. Speed compiling, find bugs proactively, and fine-tune applications for parallel performance. See why Intel Parallel Studio got high marks during beta. http://p.sf.net/sfu/intel-sw-dev _______________________________________________ Nagios-users mailing list Nagios-users at lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nagios-users ::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. ::: Messages without supporting info will risk being sent to /dev/null