Главная
Study mode:
on
1
Intro
2
Differential Equation
3
Reward Function
4
Value Function
5
Q Function
6
Discrete Time
7
Vector to Go
8
Results
Description:
Explore a groundbreaking approach to reinforcement learning in this 30-minute video that delves into concurrent control scenarios where agents must think and act simultaneously. Discover how researchers reformulate Q-learning in continuous time, introduce concurrency, and then revert to discrete time to address real-world challenges like robotic control. Learn about the novel continuous-time formulation of Bellman equations and their delay-aware discretization, leading to a new class of approximate dynamic programming methods. Examine the application of this framework to simulated benchmark tasks and a large-scale robotic grasping problem, demonstrating the practical implications of "thinking while moving" in reinforcement learning.

Thinking While Moving - Deep Reinforcement Learning with Concurrent Control

Yannic Kilcher
Add to list