mirror of
https://github.com/Brandon-Rozek/website.git
synced 2024-12-01 18:41:03 -05:00
12 lines
258 B
Markdown
12 lines
258 B
Markdown
# Lecture Notes for Reinforcement Learning
|
|
|
|
[Chapter 1: An Introduction](intro)
|
|
|
|
[Chapter 2: Multi-armed Bandits](bandits)
|
|
|
|
[Chapter 3: Markov Decision Processes](mdp)
|
|
|
|
[Chapter 4: Dynamic Programming](dynamic)
|
|
|
|
[Chapter 5: Monte Carlo Methods](mcmethods)
|
|
|