Главная
Study mode:
on
1
Introduction
2
We are impatient people
3
Tail latency
4
Big data centers
5
Energy consumption
6
Cost savings
7
Efficiency
8
Server Architecture
9
Overview
10
Long Requests
11
Simplified Requests
12
Prior State of the Art
13
Memory Locations
14
Shared Locations
15
Log of Samples
16
Configuration
17
Program Counter
18
Shortening Queue
19
Parallelization
20
Slow to Fast
21
Pegasus
22
Results
23
Conclusion
24
Questions
Description:
Explore the intricacies of measuring and optimizing tail latency in data centers through this 52-minute conference talk from Strange Loop. Dive into the challenges of engineering interactive user request systems to optimize 99th percentile response times. Learn about a new tool and methodology for measuring performance at 1000 cycle granularities with minimal overhead. Discover root causes of tail latency and various optimization techniques, including scaling approaches for queuing delay and a novel dynamic adaptive parallelization method. Understand how these optimizations can improve server efficiency, benefiting users, profitability, and the environment. Gain insights from Kathryn McKinley, a Research Scientist at Google and Adjunct Professor at the University of Texas at Austin, as she shares her expertise in programming languages, compilers, runtime systems, and performance optimization.

Measuring and Optimizing Tail Latency

Strange Loop Conference
Add to list
0:00 / 0:00