The 18 ghosts in your infrastructure stack that can cause failure (and how to avoid them) Simple/hard metrics that help reduce MTTR when looking for a root cause Health Checks and Graceful Degradation in Distributed Systems Why We Chose Kafka For The Trello Socket Architecture AWS Parameter Store