Главная
Study mode:
on
1
Intro
2
SREs care about efficiency and performan
3
Tuning system configuration matters...
4
but it is getting harder and harder
5
Key requirements for a new approach
6
ML techniques for smart exploration
7
ML enables automated performance tuning
8
and a new performance tuning process
9
The target system: Online Boutique
10
Use Case: optimizing cost of K8s microservices while ensuring reliability
11
The reference architecture
12
The optimization goals & constraints
13
Best configuration found by ML in 24H improves cost efficiency by 77%
14
Best config: optimal resources assigned to microservices
15
Best config: higher performance & efficiency for the overall service Baseline vs Best Service throughout Baseline vs Best Service po response time
16
Use Case: maximizing service performance & efficiency with JVM tuning
17
Best config: +28% throughput, and meeting SLOS
18
Best config: optimal JVM options 8
19
Key takeaways
Description:
Explore a novel approach to automating performance tuning using machine learning in this 22-minute conference talk from SREcon21. Dive into the challenges faced by Site Reliability Engineers (SREs) in optimizing application performance, stability, and availability through configuration management. Learn how reinforcement learning techniques can be leveraged to find optimal configurations based on specific optimization goals, such as minimizing service latency or cloud costs. Examine a practical example of optimizing Kubernetes microservice cost and latency by tuning container resources and JVM options. Analyze the discovered optimal configurations, identify the most impactful parameters, and gain valuable insights for tuning microservices. Understand the key requirements for implementing this innovative approach and how it transforms the performance tuning process. Discover how machine learning enables smart exploration and automated performance tuning, potentially improving cost efficiency by up to 77% and increasing throughput by 28% while meeting Service Level Objectives (SLOs). Read more

Automating Performance Tuning with Machine Learning

USENIX
Add to list