Reinforcement Learning Framework for PyTorch

Find a file

Brandon Rozek a47f3f6037 Seed documentation		2020-03-20 19:38:35 -04:00
docs	Created documentation for memory module	2020-03-20 19:31:09 -04:00
examples	Added improvements to the REINFORCE algorithm	2019-03-04 17:10:24 -05:00
rltorch	Seed documentation	2020-03-20 19:38:35 -04:00
tests	Added templates for unit testing and sphinx documentation	2020-03-15 14:27:56 -04:00
.gitignore	Added templates for unit testing and sphinx documentation	2020-03-15 14:27:56 -04:00
license.md	Added license	2019-03-30 16:32:57 -04:00
Readme.md	Initial Commit	2019-01-31 23:34:32 -05:00
setup.py	Added templates for unit testing and sphinx documentation	2020-03-15 14:27:56 -04:00
tox.ini	Added templates for unit testing and sphinx documentation	2020-03-15 14:27:56 -04:00

rltorch

A reinforcement learning framework with the primary purpose of learning and cleaning up personal scripts.

Installation

From GitHub

pip install git+https://github.com/brandon-rozek/rltorch

This is a dictionary that is shared around the different components. Contains hyperparameters and other configuration values.

This component needs to support the standard openai functions reset and step.

For Tensorboard to work, you need to define a logger that will (optionally) later go into the network, runner, and agent/trainer.

Due to issues with multiprocessing, the Logger is a shared dictionary of lists that get appended to and the LogWriter writes on the main thread.

A network takes a PyTorch nn.Module, PyTorch optimizer, configuration, and the optional logger.

Takes in a network and provides methods to sync a copy of the original network.

Typtically takes in a network which it then uses to help make decisions on which actions to take.

For example, the ArgMaxSelector chooses the action that produces the highest entry in the output vector of the network.

Stores experiences during simulations of the environment. Useful for later training.

Takes in a network and performs some sort of training upon it.