mirror of
https://github.com/Brandon-Rozek/website.git
synced 2025-12-08 11:10:25 +00:00
Fixed titles, math rendering, and links on some pages
This commit is contained in:
parent
9f096a8720
commit
330ace0de9
61 changed files with 303 additions and 115 deletions
|
|
@ -1,4 +1,8 @@
|
|||
# Chapter 2: Multi-armed Bandits
|
||||
---
|
||||
title: Chapter 2 - Multi-armed Bandits
|
||||
showthedate: false
|
||||
math: true
|
||||
---
|
||||
|
||||
Reinforcement learning *evaluates* the actions taken rather than accepting $instructions$ of the correct actions. This creates the need for active exploration.
|
||||
|
||||
|
|
@ -141,4 +145,4 @@ This is an example of an *associative search* task, so called because it involve
|
|||
|
||||
If actions are allowed to affect the next situation as well as the reward, then we have the full reinforcement learning problem. This will be presented in the next chapter of the book with its ramifications appearing throughout the rest of the book.
|
||||
|
||||

|
||||

|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue