Главная
Study mode:
on
1
Introduction
2
Modern Scientific Applications
3
Deinsum Workflow
4
Binary Operations
5
Minimize Communication
6
Example
7
Cartesian Process Grid
8
Optimal Tile Sizes
9
Annotation
10
Data Distribution
11
Code Generation
12
Experimental Results
13
Conclusion
Description:
Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only! Grab it Explore a conference talk on Deinsum, a framework for distributed multilinear algebra computations using Einstein notation. Delve into the challenges of optimizing data movement in massively-parallel systems and learn how Deinsum addresses these issues. Discover the framework's approach to deriving data movement-optimal tiling and generating distributed schedules, enhancing local computation performance. Examine the application of Deinsum to important tensor kernel classes, including Matricized Tensor Times Khatri-Rao Products and Tensor Times Matrix chains. Gain insights into the framework's workflow, binary operations, communication minimization techniques, and optimal tile size determination. Analyze experimental results showcasing Deinsum's performance on the Piz Daint supercomputer, demonstrating significant speedups over existing solutions.

Deinsum: Practically I/O Optimal Multi-Linear Algebra for Distributed Systems

Scalable Parallel Computing Lab, SPCL @ ETH Zurich
Add to list
0:00 / 0:00