mirror of
https://github.com/Brandon-Rozek/website.git
synced 2024-11-26 10:03:58 -05:00
13 lines
258 B
Markdown
13 lines
258 B
Markdown
|
# Lecture Notes for Reinforcement Learning
|
||
|
|
||
|
[Chapter 1: An Introduction](intro)
|
||
|
|
||
|
[Chapter 2: Multi-armed Bandits](bandits)
|
||
|
|
||
|
[Chapter 3: Markov Decision Processes](mdp)
|
||
|
|
||
|
[Chapter 4: Dynamic Programming](dynamic)
|
||
|
|
||
|
[Chapter 5: Monte Carlo Methods](mcmethods)
|
||
|
|