Главная
Study mode:
on
1
Introduction
2
Reinforcement Learning
3
Theoretical Questions
4
Notation
5
Outline
6
What if
7
Generalisation
8
Covariate Shift
9
Value Error
10
Common Tasks
11
Common Assumptions
12
Evaluation Criteria
13
Discussion Question
14
Important Sampling
15
Is this a form of wishful thinking
16
Unbiased policy estimates
17
Models
18
doubly robust estimators
Description:
Explore offline reinforcement learning in this 59-minute lecture by Emma Brunskill from Stanford University, presented at the Theory of Reinforcement Learning Boot Camp hosted by the Simons Institute. Delve into theoretical questions, notation, and key concepts such as generalization, covariate shift, and value error. Examine common tasks, assumptions, and evaluation criteria in batch reinforcement learning. Engage with discussion questions on important sampling, unbiased policy estimates, and models. Gain insights into doubly robust estimators and critically analyze the potential for wishful thinking in this approach to machine learning.

Batch Offline Reinforcement Learning - Part 1

Simons Institute
Add to list
0:00 / 0:00