Главная
Study mode:
on
1
Overview
2
Introduction to TensorFlow
3
What is Machine Intelligence
4
What is TensorFlow
5
Open Source TensorFlow
6
What is DCOS
7
Highlevel picture of DCOS
8
What are DCOS
9
Deep Learning Overview
10
Training Phase
11
Recap
12
Demo Overview
13
Demo Setup
14
Demo Visualization
15
Summary
16
Demo
17
Cluster Overview
18
DCOS Catalog
19
DCOS Tensorflow Tools
20
DCOS Tensorflow Packages
21
Storage Bucket
22
CPUGPU Allocation
23
Beta Tensorflow
24
Deployment Phase
25
Developer Workflow
26
Moving to Distributed
27
The Real Challenge
28
What DCOS Does
29
Cluster Spec
30
Dealing with failures
31
Manual configuration
32
Highlevel service definition
33
Deployer responsibilities
34
Clean Read
35
DCOS Secrets
36
Runtime Configuration Dictionary
37
Single Framework
38
Meta Framework
39
Flow
40
Special Thanks
41
Questions Links
42
Discussion
Description:
Explore distributed TensorFlow implementation on DC/OS in this conference talk. Learn about the challenges of running distributed TensorFlow on personal infrastructure and discover an open-source framework designed to simplify machine learning with large models and datasets. Gain insights into TensorFlow on Mesos and DC/OS, and witness a live demonstration of the framework. Delve into topics such as machine intelligence, open-source TensorFlow, DC/OS architecture, deep learning overview, and the training phase. Examine the demo setup, visualization, and cluster overview, including DC/OS Catalog and TensorFlow tools. Understand CPU and GPU allocation, beta TensorFlow, deployment phase, and developer workflow. Investigate the challenges of moving to distributed systems, cluster specifications, failure handling, and manual configuration. Learn about high-level service definition, deployer responsibilities, DC/OS Secrets, runtime configuration, and framework types. Conclude with special thanks, questions, and links for further discussion. Read more

Running Distributed TensorFlow on DC/OS

Linux Foundation
Add to list
0:00 / 0:00