Главная
Study mode:
on
1
Intro
2
And the problem space is complex.
3
Write workload, trailing year
4
Read workload, trailing year
5
Service Level Objectives (SLO)
6
Data storage engine and analytics flow
7
SLOs are user flows
8
Service-Level Objectives
9
Functional and visual testing.
10
Design for feature flag deployment.
11
Automated integration & human review.
12
Green button merge.
13
Auto-updates, rollbacks, & pins.
14
Observe behavior in prod.
15
Non-trivial savings.
16
Three case studies of failure
17
1 Shepherd: ingest API service
18
Honeycomb Ingest Outage
19
Now what?
20
Kafka: data bus
21
Our month of Kafka pain
22
Unexpected constraints
23
Take care of your people
24
Optimize for safety
25
Retriever: query service
26
Making progress carefully
27
Takeaways
28
Acknowledge hidden risks
29
Make experimentation routine!
30
Understand & control production.
Description:
Explore the challenges of modern distributed systems engineering and the importance of observability in a 58-minute conference talk by Charity Majors, CTO at Honeycomb.io. Delve into the complexities of modern development paradigms and their impact on operational futures. Examine the limitations of traditional debugging tools in the face of increasingly complex systems. Discover potential disasters awaiting distributed systems engineers and learn how instrumentation and observability can help mitigate these risks. Gain insights into adapting tooling and organizational culture to keep pace with evolving technologies. Follow case studies of failure, including issues with ingest API services, Kafka data buses, and query services. Understand the importance of acknowledging hidden risks, making experimentation routine, and maintaining control over production environments. Benefit from Majors' extensive experience in systems engineering and database management at companies like Facebook, Parse, and Linden Lab. Read more

Observability and the Future of Complex Systems

ChariotSolutions
Add to list