Главная
Study mode:
on
1
What is Q?
2
Q function explained
3
Q-learning update rule Bellman
4
Markov Decision Process
5
We compute Q
6
Residual Q-Learning Oct 2023
7
Policy customization, multi tasks
8
Residual Soft Actor Critic
9
Residual Max-Entropy MC
10
Q* a soft Q-function Oct 2023
11
Q* in Max Entropy RL
12
Q* dev by OpenAI & Berkeley
13
Maximum Entropy Policies w/ Q star
Description:
Learn about the technical foundations and recent developments of Q* (Q-star) in this 20-minute educational video that demystifies complex reinforcement learning concepts. Explore the evolution from basic Q-functions to advanced Q* applications, covering fundamental topics like Bellman equations, Markov Decision Processes, and entropy-based reinforcement learning. Delve into recent developments including Residual Q-Learning, policy customization, and maximum entropy policies, with particular focus on collaborative work between OpenAI and UC Berkeley. Gain clear explanations of how Q* relates to physics principles and agent behavior through imitation learning, dispelling common misconceptions about its connection to artificial general intelligence (AGI). Master technical concepts through structured segments that progress from basic Q-functions to advanced applications in maximum entropy reinforcement learning.

Understanding Q* (Q-star) - From Q-Learning to Maximum Entropy Reinforcement Learning

Discover AI
Add to list
0:00 / 0:00