Giter Club home page Giter Club logo

reinforcepy's Introduction

ReinforcePy

ReinforcePy is meant to be an easy to use collection of helpers, networks, and learners for reinforcement learning. Right now the project is mainly focused on implementing papers from DeepMind and neural network based methods. There have been a ton of new papers published about deep reinforcement learning but no combined place or package of implementations, that is the main purpose of this library.

Current Status & Usage

Currently supported methods are:

  • DQN kinda old code, replaced by async paper.
  • Async 1 step DQN & SARSA, N-step, A3C, Recurrent support is a WIP. Paper
  • Async Double Q-Learning, Double N-step [no paper]
  • Async with Experience Replay, supports all Async methods but A3C

Upcoming features can be found in the wiki roadmap.

Example usage, trained models, and results can be found under examples/ALE/. A more in depth look at implementation details can be found in the wiki.

Installation (requires Python 3)

If you don't already I recommend using Anaconda to manage python environments, it also makes installation of Numpy & Scipy a breeze. Required packages:

Then simply:

    git clone https://github.com/Islandman93/reinforcepy
    cd reinforcepy/
    python setup.py install

Windows support

NEW: TensorFlow supports windows, the ALE uses cmake but I was unable to get it working with windows. A Visual Studio port can be found here

Documentation

Documentation is a work in progress available at: http://reinforcepy.readthedocs.org/.

For support, please submit an issue.

Development

All pull requests are welcome, this project is not tied to any specific reinforcement learning method so feel free to submit any published method or environment.

To hack on the code simply use:

    python setup.py develop

reinforcepy's People

Contributors

islandman93 avatar

Watchers

 avatar paper2code - bot avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.