"Automating incident response enables organizations to detect and resolve issues faster, reducing the mean time to resolution and minimize the impact of incidents on the business."
Site Reliability Engineering
Reliability and Availability of Cloud Computing
The Site Reliability Workbook: Practical Ways to Implement SRE
The Phoenix Project: A Novel about IT, DevOPs, and helping Your Business win
The DevOps Handbook: How to Create World -Class Agility, Reliability, & Security in Technology Organizations
Implementing Service Level Objectives
DevSecOps: A leader's guide to producing secure software without compromising flow, feedback and continuous improvement
Team Topologies: Organizing Business and Technology Teams for Fast Flow
The Lean Startup
Reliability.org is a community for people interested in achieving better software application and infrastructure reliability.
Follow us on