Multi-Agent Continuous Control in Collaborative-Competitive environment

Introduction

For this project, I am using slightly modified versions of Unity Tennis environment. See the animation with trained agent in comparison to "dumb" agent below

Trained Agent:

Random Agent

In this environment, two agents control rackets to bounce a ball over a net. If an agent hits the ball over the net, it receives a reward of +0.1. If an agent lets a ball hit the ground or hits the ball out of bounds, it receives a reward of -0.01. Thus, the goal of each agent is to keep the ball in play. The observation space consists of 8 variables corresponding to the position and velocity of the ball and racket. Each agent receives its own, local observation. Two continuous actions are available, corresponding to movement toward (or away from) the net, and jumping.

The task is episodic, and in order to solve the environment, agents must get an average score of +0.5 (over 100 consecutive episodes, after taking the average over both agents).

Getting Started

Note: This guide was only tested for macOS with pyenv for python versions management.

Please follow these steps to be able to run this project:

Install build tools (such as C++ compiler and etc.) by installing Xcode and then Xcode command-line tools following one of the various guides .
Install dependencies. It is highly recommended to install all dependencies in virtual environment (see guide).
- Install Unity ML-Agents Toolkit following instruction from this page (official GitHub of Unity ML-Agents Toolkit). It is very likely that most of you will only need to install mlagents and unityagents packages with the following command:
```
pip install mlagents unityagents
```
  It is highly recommended to use Python not higher then 3.7, because TesnsorFlow (one of the dependency for mlagents) is only compatible with Python 3.7).
- Install PyTorch with
```
pip insall torch torchvision
```
  Please see official installation guide for more information.
Alternatively, it is possible to install required dependencies using requirements.txt. To do that jsut run the following command in your terminal (preferably in project's virtual environment):
```
pip install -r requirements.txt
```
Please note: this method is a bit of an overkill and has some packages that are not really used in this project. In fact this is a "R&D" environment for the experiments and testing.
Download the environment from one of the links below. You need only select the environment that matches your operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Place the file in the root of the project repository.
Run main.py from terminal with
```
python main.py
```
or simply run main.py in your IDE.
Run visualize.py to see intelligent agent with
```
python vizualize.py
```
or, again, simply run this file in you IDE.

Technical report

Check out technical report for implementation details.

andre1m / multi-agent-ddpg Goto Github PK

multi-agent-ddpg's Introduction

Multi-Agent Continuous Control in Collaborative-Competitive environment

Introduction

Getting Started

Technical report

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent