Главная
Study mode:
on
1
- Introductions
2
- What is the Goal
3
- Exploit Versus Explore
4
- Greedy Versus Epsilon-Greedy
5
- Discount Factor “g”
6
- Calculating Rewards
7
- Pseudo Code
8
- Working With Q-Tables 1
9
- Working With Q-Tables 2
10
- Online Q-Table
11
- States & Actions
12
- TD Learning vs Monte Carlo
13
- From DRA to MDP
14
- Stochastic Actions
15
- OpenAI CartPole
16
- More Terminology
17
- Useful Links
Description:
Explore an intuition-based approach to reinforcement learning in this 42-minute talk by Oswald Campesato, co-founding CEO of iQuarkt and author of over 35 technical books. Gain insights into a framework that helps algorithms learn decision-making through environmental feedback, inspired by human and animal intuition. Delve into key concepts such as exploit versus explore, greedy versus epsilon-greedy strategies, discount factors, and reward calculations. Examine practical applications in game playing, robotic control, and autonomous driving. Learn about Q-tables, TD learning versus Monte Carlo methods, and the transition from DRA to MDP. Discover the OpenAI CartPole environment and gain access to useful resources for further exploration of reinforcement learning techniques.

An Intuition-Based Approach to Reinforcement Learning

Open Data Science
Add to list