mirror of
https://github.com/Brandon-Rozek/website.git
synced 2025-02-03 00:42:37 +00:00
258 B
258 B
Lecture Notes for Reinforcement Learning
Chapter 2: Multi-armed Bandits
Chapter 3: Markov Decision Processes