website/content/research/reinforcementlearning/notes.md

13 lines
258 B
Markdown
Raw Normal View History

2020-01-16 02:51:49 +00:00
# Lecture Notes for Reinforcement Learning
[Chapter 1: An Introduction](intro)
[Chapter 2: Multi-armed Bandits](bandits)
[Chapter 3: Markov Decision Processes](mdp)
[Chapter 4: Dynamic Programming](dynamic)
[Chapter 5: Monte Carlo Methods](mcmethods)