Lecture Notes for Reinforcement Learning

Chapter 1: An Introduction

Chapter 2: Multi-armed Bandits

Chapter 3: Markov Decision Processes

Chapter 4: Dynamic Programming

Chapter 5: Monte Carlo Methods