Explore the fundamentals of Site Reliability Engineering (SRE) practices in this 50-minute Linux Foundation webinar. Delve into the trio of crucial measurements for maintaining a reliable and robust platform: SLAs, SLOs, and SLIs. Gain insights into establishing a culture of reliability and navigating your reliability journey. Learn about the three pillars of reliability, complex systems, and the concept that slowness is the new downtime. Compare DevOps and SRE approaches, understand SLA objectives and indicators, and discover the four golden signals of infrastructure management. Examine the current state of affairs in reliability, explore blameless practices, and understand the importance of root cause analysis. Cover topics such as availability, DevSecOps, and the role of different groups in leading reliability efforts.
Reliability, Everyone’s Responsibility - Intro to Site Reliability Engineering Practices