Главная
Study mode:
on
1
Intro
2
Reproducibility refers to the ability of a researcher to duplicate the results of a prior study....
3
Reproducibility crisis in science (2016)
4
Reinforcement learning (RL)
5
Adaptive neurostimulation
6
RL via Policy gradient methods
7
Policy gradient papers
8
Policy gradient baseline algorithms
9
Robustness of policy gradient algorithms
10
Codebase comparison
11
An intricate interplay of hyperparameters!
12
Fair comparison is easy, right?
13
How should we measure performance of the learned policy?
14
From fair comparisons...
15
How about a reproducibility checklist?
16
The role of infrastructure on reproducibility
17
Myth or fact?
18
Generalization in RL
19
Natural world has incredible complexity!
20
Natural world = RL simulation
21
Real-world video = RL simulation
22
Step out into the real-world!
23
ICLR Reproducibility Challenge Second Edition, 2019
Description:
Explore a comprehensive lecture on reproducibility, reusability, and robustness in reinforcement learning delivered by Joelle Pineau from Facebook/McGill University at the Institute for Advanced Study. Delve into the reproducibility crisis in science, policy gradient methods in reinforcement learning, and the challenges of fair algorithm comparisons. Examine the intricate interplay of hyperparameters, performance measurement techniques, and the role of infrastructure in reproducibility. Investigate the myths and facts surrounding generalization in reinforcement learning, and understand the complexities of applying RL to real-world scenarios. Learn about the ICLR Reproducibility Challenge and gain insights into creating more reliable and robust reinforcement learning systems.

Reproducible, Reusable, and Robust Reinforcement Learning - Joelle Pineau

Institute for Advanced Study
Add to list
0:00 / 0:00