On code.hootsuite.com you can find an article I’ve written about measuring a service latency with statsd.
What are the key takeaways when monitoring service performance?
- Measuring average response times is a bad way of assessing service performance
- Maximum response time data offers up much more useful information
- StatsD + Graphite is a match made in heaven (with some tweaking)