<p>Reinforcement learning is the art of analyzing situations and mapping them to actions in order to maximize a numerical reward signal.</p>
<p>In this independent study, I as well as Dr. Stephen Davies, will explore the Reinforcement Learning problem and its subproblems. We will go over the bandit problem, markov decision processes, and discover how best to translate a problem in order to <strong>make decisions</strong>.</p>
<p>I have provided a list of topics that I wish to explore in a <ahref="index.html%3Fresearch%252FReinforcementLearning%252Fsyllabus.html">syllabus</a></p>
<h2>Readings</h2>
<p>In order to spend more time learning, I decided to follow a textbook this time. </p>
<p>The notes for this course, is going to be an extreemly summarized version of the textbook. There will also be notes on whatever side tangents Dr. Davies and I explore.</p>
<p>I wrote a small little quirky/funny report describing the bandit problem. Great for learning about the common considerations for Reinforcement Learning problems.</p>
<p>Specifically, if you want to see agents I've created to solve some OpenAI environments, take a look at this specific folder in the Github Repository</p>