Giter Club home page Giter Club logo

rl_delay_atari's Introduction

Acting in Delayed Environments with Non-Stationary Markov Policies

This repository contains the implementation of the Delayed, Agumented, Oblivious, and RNN agents from the paper: "Acting in Delayed Environments with Non-Stationary Markov Policies", Esther Derman*, Gal Dalal*, Shie Mannor (*equal contribution), published in ICLR 2021.

The agent here supports the Atari environments. The simpler agent that supports Cartpole and Acrobot can be found here.

Installation

This is a fork of Stable-Baselines (v2.10.1, based on TensorFlow), with the addition of the delayed agent.

To set up the environment please follow the instructions in Stable-Baselines.

Running the code

Running the code is straightforward using run_experiment_rl_delay.py.

Citing the Project

To cite this repository in publications:

@article{derman2021acting,
  title={Acting in delayed environments with non-stationary markov policies},
  author={Derman, Esther and Dalal, Gal and Mannor, Shie},
  journal={International Conference on Learning Representations (ICLR)},
  year={2021}
}

Happy delaying!

rl_delay_atari's People

Contributors

araffin avatar hill-a avatar galdl avatar joschu avatar adamgleave avatar andrewliao11 avatar siemanko avatar shwang avatar ernestum avatar pzhokhov avatar kalifou avatar assaf-hallak avatar theling avatar miffyli avatar m-rph avatar unixpickle avatar kantneel avatar 20chase avatar matthiasplappert avatar shakenes avatar antymon avatar srivatsankrishnan avatar richardwu avatar olegklimov avatar neoextended avatar louiehelm avatar keshaviyengar avatar jkterry1 avatar rusu24edward avatar christopherhesse avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.