Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only!
Grab it
Explore the challenges and opportunities of multimodal AI in this 55-minute podcast episode featuring Ethan Rosenthal, Member of Technical Staff at Runway. Dive into the complexities of managing and accelerating multimodal AI systems, from data management to efficient inference. Learn about the similarities and differences between tabular machine learning, large language models, and generative video systems. Discover effective setups and tools for supporting both research and productionization processes in the rapidly evolving field of AI. Gain insights into topics such as multimodal feature stores, large-scale distributed training, and the emerging Generative DevOps movement. Understand the challenges of bridging the gap between researchers and engineers in AI development and explore strategies for structuring teams to maximize efficiency in multimodal AI projects.
Accelerating Multimodal AI - From Tabular ML to Generative Video Systems