From e98ed73083b0bbc6c78a1b9a97699a598643e938 Mon Sep 17 00:00:00 2001
From: Brandon Rozek <brozek@mail.umw.edu>
Date: Tue, 22 Jan 2019 10:31:36 -0500
Subject: [PATCH] Cleaned up the formatting

---
 README.md | 10 +++++++---
 1 file changed, 7 insertions(+), 3 deletions(-)

diff --git a/README.md b/README.md
index 755caf6..5a3cea0 100644
--- a/README.md
+++ b/README.md
@@ -4,7 +4,7 @@ This is a walk through my journey of deep reinforcement learning. This will high
 
 ## Deep Q-Networks (DQN)
 
-Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller. **Playing Atari with Deep Reinforcement Learning**.
+[1] Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller. **Playing Atari with Deep Reinforcement Learning**.
 
 https://www.cs.toronto.edu/~vmnih/docs/dqn.pdf
 
@@ -14,7 +14,9 @@ My implementation: https://github.com/Brandon-Rozek/DeepRL/blob/master/PoleBalan
 
 
 
-Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas. **Dueling Network Architectures for Deep Reinforcement Learning**.
+
+
+[2] Ziyu Wang, Tom Schaul, Matteo Hessel, Hado van Hasselt, Marc Lanctot, Nando de Freitas. **Dueling Network Architectures for Deep Reinforcement Learning**.
 
 https://arxiv.org/abs/1511.06581
 
@@ -24,7 +26,9 @@ My implementation: https://github.com/Brandon-Rozek/DeepRL/blob/master/DuelingPo
 
 
 
-Tom Schaul, John Quan, Ioannis Antonoglou, David Silver. **Prioritized Experience Replay**.
+
+
+[3] Tom Schaul, John Quan, Ioannis Antonoglou, David Silver. **Prioritized Experience Replay**.
 
 https://arxiv.org/abs/1511.05952