Главная
Study mode:
on
1
Intro
2
Nat Welch
3
Quick Aside: Context
4
Monitoring!
5
Grow monitoring to match your business needs
6
Incident Response
7
The fight against noise
8
Postmortems!
9
Google Compute Engine Postmortems
10
Testing & Releasing
11
Two quick stories
12
Capacity Planning
13
Pogostick
14
Development
15
Communication!
16
Examples
17
User Experience
18
References. Further Reading
Description:
Explore a conference talk that delves into the practical applications of the Dickerson Pyramid in Site Reliability Engineering. Learn how to implement each level of the hierarchy using real-life examples from Google, Hillary for America, and First Look Media. Discover how to define reliability for your organization and prevent future outages by focusing on monitoring, incident response, postmortems, testing and releasing, capacity planning, development, and product. Gain insights from Nat Welch's decade-long experience in software engineering and his role as Lead Site Reliability Engineer at First Look Media. Understand how SRE priorities differ from those of product engineers and how to apply these concepts to improve your organization's reliability practices.

Practical Applications of the Dickerson Pyramid

Strange Loop Conference
Add to list