An exploding ecosystem makes ML deployment difficult
6
ML-based optimizations
7
Why use Apache TVM?
8
TVM: Getting Optimal Performance
9
OctoML's Broad HW & Model Architecture Coverage Octomizer supports any model architecture with standard operators see lists here and here
10
Thank you Apache TVM community! 615+!
Description:
Explore a 30-minute talk on optimizing machine learning deployment using Apache TVM and OctoML Platform. Learn about graph- and operator-level optimizations for performance portability across diverse hardware back-ends. Discover how TVM's learning-based approach rapidly explores optimizations, saving engineering time and delivering top performance for edge and server use cases. Gain insights into TVM's broad model coverage and efficient hardware resource utilization. Get a preview of OctoML's Octomizer, a SaaS platform for continuous model optimization, benchmarking, and packaging. Understand the challenges of ML deployment in diverse hardware environments and how TVM and OctoML address the exploding ecosystem of ML workloads and hardware capabilities.
Bringing Choice, Automation and Performance to ML Deployment with Apache TVM and the OctoML Platform