|Sent on:||Thursday, August 23, 2012 8:21 PM|
This thread was What is DevOps, but it morphed into a monitoring thread. I'll return to what is DevOps later, but for now I wanted to offer up some monitoring resources for everyone.Following up on what Craig said regarding the monitoring sucks project on github, this is the post that explains it all: http://lusislog.blogspot.com/2011/06/why-monitoring-sucks.htmlJohn Vincent, @lusis on Twitter, explains what made him start the repo and it's various purposes. The repo itself is here:For great explanations on how Etsy uses monitoring, their Code as Craft blog is invaluable. Here is a blog post where they talk about using Graphite:Follow their blog if you don't already.Someone mentioned New Relic as a company that can provide monitoring for your app. Along with New Relic, BlueStripe is a local company that also provides a way to monitor your app. Boundary is another company doing interesting things in the space, as they provide network monitoring. I personally have not used any of these company's apps, but perhaps they might prove useful to you.As far as doing monitoring you setup yourself, Nagios still appears to be king. Frequently I've seen Nagios paired with Munin or Cacti and more and more I'm seeing Graphite, sometimes paired with StatsD (Etsy blogged on Statsd here).There is now also a tool called Rorschach to allow you to do alerting off of Graphite data.Some of these things I've used, some of them I haven't. Clearly there is a lot going on in the monitoring space and it is evolving a lot after having stagnated for a while.Coincidently, the next meetup is on Bronto's use of some of these monitoring tools. If you're interested in monitoring it would be a good meetup to attend.- Mark
On Wednesday, August 15, 2012 at 9:43 AM, Michael Hale wrote:Where I work (heroku.com) we use a bit of a home grown approach to metrics/monitoring/visibility using tools like graphite and some that we have open-sourced: http://vimeo.com/43699577. One of the things I think is really cool is how we are able to alert on data that we trend to graphite. For example if service A has more than 100 connections over say 5 minutes we could trigger a nagios alert.
Please Note: If you hit "REPLY", your message will be sent to everyone on this mailing list ([address removed])
This message was sent by Mark Mzyk ([address removed]) from Triangle DevOps.
To learn more about Mark Mzyk, visit his/her member profile
Set my mailing list to email me As they are sent | In one daily email | Don't send me mailing list messages
Meetup, PO Box 4668 #37895 New York, New York[masked] | [address removed]