Giter Club home page Giter Club logo

elicit's Introduction

Sharpening DAggers

Code, Experimental Configurations, and Visualization Tools for "Sharpening DAggers." Spins up both toy environments and simulated robosuite environments, and provides utilities for collecting demonstrations using various "algorithmic experts," from human user inputs, and from reinforcement learned policies (code for training these also included).

Core Question: How does the type, or quality of interventions (corrections, annotation) affect interactive imitation learning algorithms like DAgger, LazyDAgger, or ThriftyDAgger?

Side Questions:

  • Can we identify a clear way of scoring/ranking various demonstrations given an existing policy, task, and environment?
  • Can we elicit better demonstrations from human users using this score?

Quickstart

Clones thriftydagger (TODO: rename repo) to the working directory, then walks through dependency setup, mostly using the environment-<arch>.yaml files.

Shared Environments (ILIAD Cluster)

Conda environment sharp-daggers already exists on the ILIAD cluster, and should work for any GPU-capable nodes. The only necessary steps to take are cloning this repository, and activating the environment.

Local CPU Development (Mac OS)

Note: Assumes that conda (Miniconda or Anaconda are both fine) is installed and on your path. Use the -cpu environment file.

git clone https://github.com/madelineliao/thriftydagger
cd thriftydagger
conda env create -f environments/environment-cpu.yaml
conda activate sharp-daggers

Start-Up (from Scratch)

Use these commands if you're starting a repository from scratch (this shouldn't be necessary, but is documented in the case that building from conda .yaml breaks in the future).

Generally, if you're just trying to run/use this code, look at the Quickstart section above.

Local CPU Environment (Mac OS)

conda create --name sharp-daggers python=3.8
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 ipython -c pytorch

pip install black gym h5py isort matplotlib pandas pyyaml tqdm

# Install Robosuite
pip install robosuite

GPU & Cluster Environments (ILIAD Cluster - CUDA 11.1, PyTorch 1.8)

conda create --name sharp-daggers python=3.8
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=11.1 -c pytorch -c conda-forge
conda install ipython

pip install black gym h5py isort matplotlib pandas pyyaml tqdm

# Install Robosuite
pip install robosuite

elicit's People

Contributors

kanishkg avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.