From e98ed73083b0bbc6c78a1b9a97699a598643e938 Mon Sep 17 00:00:00 2001 From: Brandon Rozek Date: Tue, 22 Jan 2019 10:31:36 -0500 Subject: [PATCH] Cleaned up the formatting --- README.md | 10 +++++++--- 1 file changed, 7 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 755caf6..5a3cea0 100644 --- a/README.md +++ b/README.md @@ -4,7 +4,7 @@ This is a walk through my journey of deep reinforcement learning. This will high ## Deep Q-Networks (DQN) -Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller. **Playing Atari with Deep Reinforcement Learning**. +[1] Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller. **Playing Atari with Deep Reinforcement Learning**. https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf @@ -14,7 +14,9 @@ My implementation: https://github.com/Brandon-Rozek/DeepRL/blob/master/PoleBalan -Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas. **Dueling Network Architectures for Deep Reinforcement Learning**. + + +[2] Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas. **Dueling Network Architectures for Deep Reinforcement Learning**. https://arxiv.org/abs/1511.06581 @@ -24,7 +26,9 @@ My implementation: https://github.com/Brandon-Rozek/DeepRL/blob/master/DuelingPo -Tom Schaul, John Quan, Ioannis Antonoglou, David Silver. **Prioritized Experience Replay**. + + +[3] Tom Schaul, John Quan, Ioannis Antonoglou, David Silver. **Prioritized Experience Replay**. https://arxiv.org/abs/1511.05952