What is RAPIDS? New GPU Accelerated Data Science Pipeline
4
RAPIDS End-to-End GPU-Accelerated Data Science
5
Learning from Apache Arrow
6
Data Science Workflow with RAPIDS
7
Ecosystem Partners
8
ML Technology Stack
9
Distributing Dask
10
Dask SVD Example
11
Numpy Array Function (NEP-18)
12
Python CUDA Array Interface
13
Interoperability for the Win
14
Challenges: Communication
15
SVD Benchmark
16
Scale up with RAPIDS
17
Road to 1.0
18
Additional Reading Material
Description:
Explore distributed multi-GPU computing using Dask, CuPy, and RAPIDS in this EuroPython 2019 conference talk. Discover how recent developments in NumPy community standards and protocols have simplified the integration of distributed and GPU computing libraries. Learn about GPU-accelerated clustering, the RAPIDS ecosystem for end-to-end GPU-accelerated data science, and the benefits of the Apache Arrow format. Dive into practical examples of distributed computing with Dask, including SVD benchmarks and scaling up with RAPIDS. Gain insights into the challenges of communication in distributed systems and explore the roadmap towards version 1.0 of these technologies. Enhance your understanding of high-performance computing techniques for data science applications.
Distributed Multi-GPU Computing with Dask, CuPy and RAPIDS