Explore the evolution of generative image models in this insightful talk by Robin Rombach, co-creator of Stable Diffusion. Delve into the progression from GANs to Transformers and latent Diffusion models, gaining a comprehensive understanding of high-resolution image synthesis techniques. Learn about two-stage generative models, the QCVAE architecture, Vision Transformers, and the groundbreaking Stable Diffusion model. Discover applications in text-to-image generation, semantic synthesis, upscaling, and creative endeavors like text-to-color palette conversion and video stylization. Gain valuable insights from Rombach's extensive research experience and his pivotal role in developing widely-used projects such as VQGAN, Taming Transformers, and Latent Diffusion Models.
Stable Diffusion and Friends - High-Resolution Image Synthesis via Two-Stage Generative Models