Giter Club home page Giter Club logo

dm_hard_eight's Introduction

dm_hard_eight: DeepMind Hard Eight Task Suite

DeepMind Hard Eight Tasks is a set of 8 diverse machine-learning tasks that require exploration in partially observable environments to solve.

Hard Eight video

Overview

These tasks are provided through pre-packaged Docker containers.

This package consists of support code to run these Docker containers. You interact with the task environment via a dm_env Python interface.

Please see the documentation for more detailed information on the available tasks, actions and observations.

Requirements

The Hard Eight tasks are intended to be run on Linux and are not officially supported on Mac and Windows. However, they can in principle be run on any platform. In particular, on Windows, you may need to run the Python code from within WSL.

dm_hard_eight requires Docker, Python 3.6.1 or later and a x86-64 CPU with SSE4.2 support. We do not attempt to maintain a working version for Python 2.

Note: We recommend using Python virtual environment to mitigate conflicts with your system's Python environment.

Download and install Docker:

Installation

You can install dm_hard_eight by cloning a local copy of our GitHub repository:

$ git clone https://github.com/deepmind/dm_hard_eight.git
$ pip install ./dm_hard_eight

To also install the dependencies for the examples/, install with:

$ pip install ./dm_hard_eight[examples]

Usage

Once dm_hard_eight is installed, to instantiate a dm_env instance run the following:

import dm_hard_eight

settings = dm_hard_eight.EnvironmentSettings(seed=123,
    level_name='ball_room_navigation_cubes')
env = dm_hard_eight.load_from_docker(settings)

Citing

If you use dm_hard_eight in your work, please cite the accompanying paper:

@article{paine2019making,
  title={Making Efficient Use of Demonstrations to Solve Hard Exploration Problems},
  author={Tom Le Paine and
          Caglar Gulcehre and
          Bobak Shahriari and
          Misha Denil and
          Matt Hoffman and
          Hubert Soyer and
          Richard Tanburn and
          Steven Kapturowski and
          Neil Rabinowitz and
          Duncan Williams and
          Gabriel Barth-Maron and
          Ziyu Wang and
          Nando de Freitas and
          Worlds Team}
  journal={arXiv preprint arXiv:1909.01387},
  year={2019}
}

Notice

This is not an officially supported Google product.

dm_hard_eight's People

Contributors

rsfbarreira avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

dm_hard_eight's Issues

Non Docker version

Hello, thank you for creating and releasing this environment and tasks.

Unfortunately our compute cluster does not allow the use of containers such as Docker. Is the source code for this environment available for compilation?

Demonstration data available?

Hello, thank you so much for releasing such a good environment.

I notice in the paper Making Efficient Use of Demonstrations to Solve Hard Exploration Problems, you said

We collected a total of 100 demonstrations for each task spread across three different experts (each expert contributed roughly one third of the demonstrations for each task). Demonstrations for the tasks were collected using keyboard and mouse controls mapped to the agent’s exact action space, which was necessary to enable both behaviour cloning and learning from demonstrations.

So I want to know whether it is possible to release those demostration data as well?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.