Giter Club home page Giter Club logo

dcil_i_xpag's Introduction

DCIL_I_XPAG

Implementation of the first version of the DCIL algorithm (paper) based on jax-based XPAG library.

๐Ÿ“˜ An improved version of the DCIL algorithm ((paper) is available here: https://github.com/AlexandreChenu/DCIL_XPAG

Install

  1. Clone DCIL repo,
git clone https://github.com/AlexandreChenu/DCIL_I_XPAG.git
  1. Create virtual environment dcil_env from environment.ylm,
cd DCIL_XPAG
conda env create --name dcil_env --file environment.yml
  1. Clone + install XPAG (+ Jax),
git clone https://github.com/perrin-isir/xpag.git
cd xpag
git checkout 9ef7dd74b74fc71cee83c6a476adfebe4b977814
pip install -e .

Check this Repo for more instructions.

  1. Clone + install maze or humanoid environments
git clone https://github.com/AlexandreChenu/gmaze_dcil.git

OR

git clone https://github.com/AlexandreChenu/gfetch_dcil.git

OR

git clone https://github.com/AlexandreChenu/ghumanoid_dcil.git

and

cd <env_directory>
pip install -e .
```-e .

Dependencies for the Fetch environment (from First Return Then Explore)

To run DCIL in the fetch environment, please clone Go-Explore repo:

git clone https://github.com/uber-research/go-explore.git

move to the robustified directory:

cd go-explore/robustified

From there, clone uber-research/atari-reset:

git clone https://github.com/uber-research/atari-reset.git atari_reset

Finally, update the PYTHONPATH to include robustified:

export PYTHONPATH=$PYTHONPATH:path_to_goexplore/go-explore/robustified

with path_to_goexplore, the absolute path to the go-explore repo.

Run Dubins Experiment

python test_DCIL_I_XPAG_dubins.py --demo_path ./demos/dubins_convert/1.demo --save_path </path/to/save/path>

Run Fetch Experiment

python test_DCIL_I_XPAG_fetch.py --demo_path ./demos/fetch_convert/6.demo --save_path </path/to/save/path>

Run Humanoid Experiment

python test_DCIL_I_XPAG_humanoid.py --demo_path ./demos/humanoid_convert/1.demo --save_path </path/to/save/path> --eps_state 0.5 --value_clipping 1

Visual logs produced in /path/to/save/path

  • trajs_it_- : training rollouts + skill-chaining evaluation
  • value_skill_-it- : value for x-y position of skill starting state for different orientations
  • transitions_- : sampled training transitions + segment between true desired goal and relabelled desired goal

dcil_i_xpag's People

Contributors

alexandrechenu avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.