Explore the latest advancements in generative model-based approaches for speech synthesis in this 38-minute conference talk by Heiga Zen from Google. Gain insights into the significant improvements in synthesized speech naturalness, learn about the probabilistic formulation of text-to-speech systems, and discover various acoustic models including HMM-based, FFNN-based, and NN-based generative models. Delve into the architecture of WaveNet, a groundbreaking generative model for raw audio, and understand its advantages over conventional audio generative models. Examine the potential future directions in text-to-speech synthesis and its applications beyond traditional boundaries.