Lecture Notes for Reinforcement Learning Chapter 1: An Introduction Chapter 2: Multi-armed Bandits Chapter 3: Markov Decision Processes Chapter 4: Dynamic Programming Chapter 5: Monte Carlo Methods