Главная
Study mode:
on
1
- Intro
2
- Paper Intro
3
- Training recipe overview
4
- Image and Video generation pipeline
5
- Temporal Auto-Encoder architecture
6
- Transformer backbone architecture
7
- Training ObjectiveOutlier Penalty Loss
8
- Tiled inference
9
- Superresolution model
10
- Training setting
11
- Parallelism for training
12
- Pre-training data
13
- Multi-stage training
14
- Fine-tuning
15
- Inference
16
- Extro
Description:
Explore a 23-minute technical video analysis breaking down Meta's latest video generation model, MovieGen, and its research paper. Dive deep into the model's sophisticated architecture, training methodology, and ambitious goals for AI-powered content creation. Learn about key components including the temporal auto-encoder architecture, transformer backbone, training objectives with outlier penalty loss, and the complete pipeline from pre-training through inference. Understand the technical intricacies of tiled inference, superresolution modeling, parallelism in training, and the multi-stage training approach. Presented by an experienced machine learning researcher with 15 years of software engineering background and expertise in computer vision and robotics, gain valuable insights into how this cutting-edge technology aims to revolutionize movie generation through artificial intelligence.

Meta MovieGen: Understanding Video Generation Model Architecture and Training

AI Bites
Add to list
0:00 / 0:00