Главная
Study mode:
on
1
Outline
2
Objective
3
Distributional RL
4
Return Distribution
5
Policy Evaluation
6
Convergence
7
Bellman Equation
8
C51 (Categorical DQN)
9
Advantage
10
Atari Results
11
Distributional Representations
Description:
Explore distributional reinforcement learning in this 23-minute video lecture from Pascal Poupart's CS885 course at the University of Waterloo. Delve into key concepts including return distribution, policy evaluation, convergence, and the Bellman equation. Examine the C51 (Categorical DQN) algorithm, its advantages, and its performance on Atari games. Gain insights into various distributional representations and their applications in reinforcement learning. Access accompanying slides from the course website to enhance your understanding of this advanced topic in machine learning and artificial intelligence.

Distributional RL

Pascal Poupart
Add to list