Hi Marc (& others that answered my question),<br> I share your mindset that if using polling 1K instances every 10 seconds (or so) wouldn't make any sense.<br> When I post file locking might be an issue, I was not thinking on a pull mechanism for services (Nagios trying to retrieve information), but in Nagios being pushed by services for this kind of info. As far as I read from documentation, for a push schema, the monitored instance has to write a given file inside Nagios structure to update his status.<br> What do you think about this?<br> Regards,<br><br>Sebastian<br><br> <div class="MsoPlainText">-----Original Message-----<br> From: nagios-users-bounces@lists.sourceforge.net [mailto:nagios-users-bounces@lists.sourceforge.net] On Behalf Of Marc Powell<br> Sent: Martes, 10 de Abril de 2007 12:33 p.m.<br> To: nagios-users@lists.sourceforge.net<br> Subject: Re: [Nagios-users] Files locking
might be an issue for scalability?</div> <div class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText">> -----Original Message-----<o:p></o:p></div> <div class="MsoPlainText">> From: nagios-users-bounces@lists.sourceforge.net [mailto:nagios-users-<o:p></o:p></div> <div class="MsoPlainText">> bounces@lists.sourceforge.net] On Behalf Of Sebastian Ganame<o:p></o:p></div> <div class="MsoPlainText">> Sent: Tuesday, April 10, 2007 10:13 AM<o:p></o:p></div> <div class="MsoPlainText">> To: nagios-users@lists.sourceforge.net<o:p></o:p></div> <div class="MsoPlainText">> Subject: [Nagios-users] Files locking might be an issue for<o:p></o:p></div> <div class="MsoPlainText">scalability?<o:p></o:p></div> <div class="MsoPlainText">> <o:p></o:p></div> <div class="MsoPlainText"><o:p> </o:p></div> <div
class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText">><span style=""> </span>I know that Nagios is able to do this kind of monitoring,<o:p></o:p></div> <div class="MsoPlainText">but<o:p></o:p></div> <div class="MsoPlainText">> as far as I understood from the documentation, Nagios saves all cache<o:p></o:p></div> <div class="MsoPlainText">and<o:p></o:p></div> <div class="MsoPlainText">> status of the monitored environment in text files inside the file<o:p></o:p></div> <div class="MsoPlainText">system,<o:p></o:p></div> <div class="MsoPlainText">> and I was wondering how scalable would this be when trying to online<o:p></o:p></div> <div class="MsoPlainText">> monitor (lets say) 10 services per machine in a 1K machines<o:p></o:p></div> <div class="MsoPlainText">environment<o:p></o:p></div> <div class="MsoPlainText">> that might update their status every 10
seconds. Do you think<o:p></o:p></div> <div class="MsoPlainText">text-files<o:p></o:p></div> <div class="MsoPlainText">> locking might be an issue in this case? I cannot found architecture<o:p></o:p></div> <div class="MsoPlainText">> details on Nagios that might answer my question, and maybe some of you<o:p></o:p></div> <div class="MsoPlainText">can<o:p></o:p></div> <div class="MsoPlainText">> give me an insight on this matter.<o:p></o:p></div> <div class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText">AFAIK, there is only ever 1 nagios process that writes the status files<o:p></o:p></div> <div class="MsoPlainText">at the interval specified by status_update_interval presuming<o:p></o:p></div> <div class="MsoPlainText">aggregate_status_updates is enabled, and it should for that many<o:p></o:p></div> <div class="MsoPlainText">services. File contention isn't an issue, but how long it takes to<o:p></o:p></div> <div
class="MsoPlainText">actually write the file might be if you actually want to see<o:p></o:p></div> <div class="MsoPlainText">up-to-the-second status information in the web interface.<o:p></o:p></div> <div class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText">I personally think you're going to have other scalability issues when<o:p></o:p></div> <div class="MsoPlainText">trying to actually perform 10,000 checks every 10 seconds. That's<o:p></o:p></div> <div class="MsoPlainText">extremely aggressive IMHO and I believe you'd be breaking new ground.<o:p></o:p></div> <div class="MsoPlainText">I'm aware of other installations with that many services but I believe<o:p></o:p></div> <div class="MsoPlainText">they're at a more sane check interval of 5 minutes or longer.<o:p></o:p></div> <div class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText">--<o:p></o:p></div> <div class="MsoPlainText">Marc<o:p></o:p></div> <div
class="MsoPlainText"><o:p> </o:p></div> <div class="MsoPlainText">-------------------------------------------------------------------------<o:p></o:p></div> <div class="MsoPlainText">Take Surveys. Earn Cash. Influence the Future of IT<o:p></o:p></div> <div class="MsoPlainText">Join SourceForge.net's Techsay panel and you'll get the chance to share your<o:p></o:p></div> <div class="MsoPlainText">opinions on IT & business topics through brief surveys-and earn cash<o:p></o:p></div> <div class="MsoPlainText">http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV<o:p></o:p></div> <div class="MsoPlainText">_______________________________________________<o:p></o:p></div> <div class="MsoPlainText">Nagios-users mailing list<o:p></o:p></div> <div class="MsoPlainText">Nagios-users@lists.sourceforge.net<o:p></o:p></div> <div class="MsoPlainText">https://lists.sourceforge.net/lists/listinfo/nagios-users<o:p></o:p></div> <div
class="MsoPlainText">::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. <o:p></o:p></div> <div class="MsoPlainText">::: Messages without supporting info will risk being sent to /dev/null<o:p></o:p></div> <br><p>
<hr size=1>
<b>Preguntá. Respondé. Descubrí.</b><br>
Todo lo que querías saber, y lo que ni imaginabas,<br>
está en <b>Yahoo! Respuestas</b> (Beta).<br>
<b><a href="http://ar.answers.yahoo.com">Probalo ya!</a></b>