Play all

Description:

Save Big on Coursera Plus. 7,000+ courses at $160 off. Limited Time Only! Grab it Explore the latest advancements in OpenAI's text-to-speech (TTS) and GPT-4V models in this 15-minute video tutorial. Discover innovative applications of these technologies, including generating image descriptions and creating audio content. Learn how to produce voiceovers for images and videos using a combination of TTS and GPT-4V. Follow along as the presenter demonstrates practical examples and showcases novel ways developers have been utilizing these powerful tools. Gain insights into the potential of AI-driven content creation and enhance your understanding of cutting-edge language and vision models.

Creating Voiceovers with OpenAI's Text-to-Speech and Vision Models

Ian Wootten

Add to list

#Computer Science #Artificial Intelligence #OpenAI #Computer Vision #Text to Speech #Art & Design #Music #Music Production #Audio Production #Sound Engineering #Audio generation

0:00 / 0:00

Creating Voiceovers with OpenAI's Text-to-Speech and Vision Models

Intro