Explore distributed TensorFlow implementation on DC/OS in this conference talk. Learn about the challenges of running distributed TensorFlow on personal infrastructure and discover an open-source framework designed to simplify machine learning with large models and datasets. Gain insights into TensorFlow on Mesos and DC/OS, and witness a live demonstration of the framework. Delve into topics such as machine intelligence, open-source TensorFlow, DC/OS architecture, deep learning overview, and the training phase. Examine the demo setup, visualization, and cluster overview, including DC/OS Catalog and TensorFlow tools. Understand CPU and GPU allocation, beta TensorFlow, deployment phase, and developer workflow. Investigate the challenges of moving to distributed systems, cluster specifications, failure handling, and manual configuration. Learn about high-level service definition, deployer responsibilities, DC/OS Secrets, runtime configuration, and framework types. Conclude with special thanks, questions, and links for further discussion.
Read more