Explore the intricacies of deploying deep learning models in production at Booking.com in this 31-minute EuroPython 2017 conference talk. Delve into the challenges and solutions for serving model predictions, covering topics such as training models in Docker containers, automated retraining processes, and deployment using Kubernetes. Learn about optimizing prediction serving for both latency and throughput in a containerized environment. Gain valuable insights into the lifecycle of a model, from initial training on a laptop to full-scale production deployment. Discover practical applications in image tagging and recommendation engines, and understand how to overcome bottlenecks in putting machine learning models into production. Perfect for data scientists and engineers looking to bridge the gap between model development and real-world implementation.
How Booking.com Serves Deep Learning Model Predictions