Главная
Study mode:
on
1
Introduction
2
Schematic diagram
3
High utilization vs low latency
4
Common problem with utilization
5
Request timeline
6
Service time
7
Measuring tail latency
8
Measuring utilization
9
Queueing effects
10
Takeaway
11
Conclusion
Description:
Explore a 28-minute conference talk from SREcon19 Asia/Pacific that delves into the trade-offs between server utilization and tail latency in large-scale systems. Learn about the fundamentals of queueing theory and its practical applications in system performance optimization. Discover how increasing average utilization impacts tail latency, and gain insights into measuring and analyzing these crucial metrics. Follow along with a schematic diagram, request timeline, and service time explanations to better understand the concepts. Acquire valuable takeaways and basic rules for balancing utilization and tail latency in your own systems. Presented by Julius Plenz from Google, this talk offers a concise yet comprehensive overview of this important topic in site reliability engineering.

How to Trade off Server Utilization and Tail Latency

USENIX
Add to list