2021-07-26 09:13:20 -04:00
|
|
|
---
|
|
|
|
title: Lecture Notes for Reinforcement Learning
|
|
|
|
showthedate: false
|
|
|
|
---
|
2020-01-15 21:51:49 -05:00
|
|
|
|
2023-09-26 17:43:41 -04:00
|
|
|
[Chapter 1: An Introduction](intro/)
|
2020-01-15 21:51:49 -05:00
|
|
|
|
2023-09-26 17:43:41 -04:00
|
|
|
[Chapter 2: Multi-armed Bandits](bandits/)
|
2020-01-15 21:51:49 -05:00
|
|
|
|
2023-09-26 17:43:41 -04:00
|
|
|
[Chapter 3: Markov Decision Processes](mdp/)
|
2020-01-15 21:51:49 -05:00
|
|
|
|
2023-09-26 17:43:41 -04:00
|
|
|
[Chapter 4: Dynamic Programming](dynamic/)
|
2020-01-15 21:51:49 -05:00
|
|
|
|
2023-09-26 17:43:41 -04:00
|
|
|
[Chapter 5: Monte Carlo Methods](mcmethods/)
|
2020-01-15 21:51:49 -05:00
|
|
|
|