Главная
Study mode:
on
1
Introduction
2
Online Learning
3
Theory
4
ECube
5
RMax
6
General of U principle
7
Algorithm Design
8
Notation
9
MDPs
10
Optimal MDP
11
Questions
12
Bellman Equation
13
Bellman Theorem
14
Analysis
15
Optimal
Description:
Explore the fundamentals of online learning in Markov Decision Processes (MDPs) through this comprehensive lecture by Ambuj Tewari from the University of Michigan. Delve into key concepts such as online learning theory, E-Cube, R-Max, and the general U principle. Gain insights into algorithm design, notation, and MDPs. Understand optimal MDPs, Bellman equations, and Bellman's theorem. Analyze the optimal approach to online learning in MDPs. This talk, part of the Theory of Reinforcement Learning Boot Camp at the Simons Institute, provides a thorough introduction to the subject and addresses important questions in the field.

Online Learning in Markov Decision Processes - Part 1

Simons Institute
Add to list
0:00 / 0:00