The Amazon ML stack: Broadest & deepest set of capabilities
5
Amazon EKS-run Kubernetes in cloud
6
Amazon EKS deployment
7
Getting started with Amazon EKS
8
Creating an EKS cluster using eksctl
9
GPUs for Machine Learning training • Training maps to matrix multiplications • Coupled with extremely high memory bandwidth
10
Set up K8s for ML-option 1
11
Create K8s cluster for ML-option 1
12
Scaling the cluster
13
Create K8s cluster for ML-option 2
14
Set up K8s for ML-option 2b
15
Challenges in setting up containers for ML
16
AWS deep learning containers
17
16 container images
18
ML on K8s—without KubeFlow
19
MNIST database
20
Fashion MNIST
21
AWS is the platform of choice to run TensorFlow
22
Machine Learning using TensorFlow on K8s
23
Apache MXNet
24
Distributed training using Horovod
25
Machine Learning pipeline for K8s
26
Machine Learning pipeline using SageMaker
Description:
Explore how to leverage Kubernetes for machine learning frameworks in this comprehensive conference talk from GOTO Chicago 2019. Discover why Kubernetes is well-suited for training and running machine learning models in production, with a focus on setting up open-source frameworks like TensorFlow, Apache MXNet, and PyTorch on a Kubernetes cluster. Learn about the isolation, auto-scaling, load balancing, flexibility, and GPU support that Kubernetes provides for computationally intensive and data-heavy machine learning models. Dive into the training, massaging, and inference phases of setting up a machine learning framework on Kubernetes, and gain insights into the Amazon ML stack and Amazon EKS for running Kubernetes in the cloud. Explore practical examples using the MNIST database and Fashion MNIST, and understand the challenges and solutions for setting up containers for machine learning. Get acquainted with AWS deep learning containers and learn how to create a machine learning pipeline using Kubernetes and SageMaker.
Read more