Главная
Study mode:
on
1
Intro
2
Sequential Decision Making
3
Reinforcement Learning
4
Sample Efficiency
5
Value-based Algorithms
6
Exploration
7
Multi-armed Bandits
8
Upper Confidence Bound (UCB)
9
Q-learning with UCB
10
Beyond Tabular Setting
11
Linear Function Approximation
12
A Natural Algorithm
13
Linear MDP
14
Related Work
Description:
Explore provably efficient reinforcement learning with linear function approximation in this 28-minute lecture from the Workshop on Theory of Deep Learning. Delve into sequential decision making, sample efficiency, and value-based algorithms as Chi Jin, a Member of the School of Mathematics at the Institute for Advanced Study, presents cutting-edge research. Examine exploration techniques, including multi-armed bandits and Upper Confidence Bound (UCB), before moving beyond tabular settings to linear function approximation. Investigate linear MDPs and related work in this comprehensive overview of reinforcement learning theory and applications.

Provably Efficient Reinforcement Learning with Linear Function Approximation - Chi Jin

Institute for Advanced Study
Add to list