Giter Club home page Giter Club logo

dynamic-sparse-training-for-deep-reinforcement-learning's Introduction

Dynamic Sparse Training for Deep Reinforcement Learning

This is the Pytorch implementation for the Dynamic Sparse Training for Deep Reinforcement Learning paper.

Abstract

In this paper, we introduce dynamic sparse training for Deep Reinforcement Learning (DRL). In particular, we propose a new training algorithm to train DRL agents using sparse neural networks from scratch and dynamically optimize the sparse topology jointly with the parameters. We integrated our proposed method with the Twin Delayed Deep Deterministic policy gradient (TD3) algorithm and introduce "Dynamic Sparse training for TD3 (DS-TD3)".

Our proposed method is tested on MuJoCo continuous control tasks in OpenAI gym. The experimental results show the effectiveness of our training algorithm in boosting the learning speed of the agent and achieving higher performance. Moreover, DS-TD3 offers a 50% reduction in the network size and floating-point operations (FLOPs).

Requirements

Usage

For DS-TD3: Dynamic Sparse training of TD3 algorithm

python main.py --env HalfCheetah-v3 --policy DS-TD3

For Static-TD3

python main.py --env HalfCheetah-v3 --policy StaticSparseTD3

For TD3

python main.py --env HalfCheetah-v3 --policy TD3

Results

Reference

If you use this code, please cite our paper:

 @article{sokar2021dynamic,
  title={Dynamic Sparse Training for Deep Reinforcement Learning},
  author={Sokar, Ghada and Mocanu, Elena and Mocanu, Decebal Constantin and Pechenizkiy, Mykola and Stone, Peter},
  journal={arXiv preprint arXiv:2106.04217},
  year={2021}
}

Acknowledgments

We start from the official code of the TD3 method from the following repository

TD3

dynamic-sparse-training-for-deep-reinforcement-learning's People

Contributors

ghadasokar avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.