Главная
Study mode:
on
1
Intro
2
Deep Learning System Stack
3
Introducing Compiler
4
TVM's Approach
5
Halide's Auto-scheduler
6
Challenges and our approach
7
Program Sampling
8
Sketch Generation Examples 1/2
9
Random Annotation Examples
10
Evolutionary Search
11
Learned Cost Model
12
Task Scheduler
13
Single Operator
14
Subgraph
15
Network
16
Ablation Study
17
Summary
Description:
Explore a groundbreaking approach to generating high-performance tensor programs for deep learning in this 20-minute conference talk from OSDI '20. Dive into Ansor, a novel framework that revolutionizes the optimization of tensor programs across various hardware platforms. Learn how Ansor's hierarchical search space representation, evolutionary search, and learned cost model outperform existing strategies, leading to significant performance improvements for deep neural networks on Intel CPUs, ARM CPUs, and NVIDIA GPUs. Discover the challenges in current deep learning systems and how Ansor's innovative task scheduler simultaneously optimizes multiple subgraphs. Follow the presentation's structure, covering topics such as the deep learning system stack, compiler approaches, program sampling techniques, and ablation studies, to gain a comprehensive understanding of this cutting-edge technology that pushes the boundaries of efficient deep learning execution.

Ansor - Generating High-Performance Tensor Programs for Deep Learning

USENIX
Add to list
0:00 / 0:00