Deep Implicit Coordination Graphs (DICG)

Installation

Recommend to run code within a conda virtual environment. Create a virtual environment by

conda create -n dicg python=3.7

Activate the virtual environment by running

conda activate dicg

Install dependencies by running the following command in the root directory of this repo (in the virtual environment):

pip install -r requirements.txt

The predator-prey environment and the traffic junction environment are included in this repo. To run the StarCraft Multi-agent Challenge (SMAC) environment, please follow the instructions here. You will need to install the full StarCraft II game as well as the SMAC maps.

Running Experiments

Predator-Prey

Experiment runners are in /exp_runners/predatorprey/, detailed argument specification list can be found in the included runner files (e.g. detailed environment size, number of agents, network sizes, algorithm hyperparemters, etc.). To train in Predator-Prey, please cd into /exp_runners/predatorprey/ and run following commands by replacing penalty P with desired values (0, -1, -1.25 or -1.5):

Approach	Command
DICG-CE-MLP	`python dicg_ce_predatorprey_runner.py --penalty P`
DICG-DE-MLP	`python dicg_de_predatorprey_runner.py --penalty P`
CENT-MLP	`python cent_predatorprey_runner.py --penalty P`
DEC-MLP	`python dec_predatorprey_runner.py --penalty P`

The model checkpoints will be saved in /exp_runners/predatorprey/data/local/.

Reported average return (average of 5 random seeds, utill 7.5e6 environment steps):

Approach	`P = 0`	`P = -1`	`P = -1.25`	`P = -1.5`
DICG-CE-MLP	78.6 ± 0.1	70.2 ± 0.7	72.0 ± 1.0	72.0 ± 0.7
DICG-DE-MLP	78.2 ± 0.0	70.5 ± 2.7	72.4 ± 1.1	70.0 ± 2.7
CENT-MLP	75.0 ± 0.1	-39.8 ± 3.8	-52.28 ± 1.3	-65.8 ± 2.9
DEC-MLP	74.4 ± 0.3	56.6 ± 0.7	56.7 ± 1.1	57.9 ± 0.3

SMAC

Experiment runners are in /exp_runners/smac/, detailed argument specification list can be found in the included runner files. To train in SMAC, please cd into /exp_runners/smac/ and run following commands by replacing MAP with desired scenario strings (8m_vs_9m, 3s_vs_5z or 6h_vs_8z):

Approach	Command
DICG-CE-LSTM	`python dicg_ce_smac_runner.py --map MAP`
DICG-DE-LSTM	`python dicg_de_smac_runner.py --map MAP`
CENT-LSTM	`python cent_smac_runner.py --map MAP`
DEC-LSTM	`python dec_smac_runner.py --map MAP`

The model checkpoints will be saved in /exp_runners/smac/data/local/.

Reported average win rate (average of 5 random seeds, utill 9e6 environment steps for 8m_vs_9m and 1.8e7 environment steps for the other two maps):

Approach	`8m_vs_9m`	`3s_vs_5z`	`6h_vs_8z`
DICG-CE-LSTM	72 ± 11 %	96 ± 3 %	9 ± 9 %
DICG-DE-LSTM	87 ± 6 %	99 ± 1 %	0
CENT-LSTM	42 ± 6 %	0	0
DEC-LSTM	65 ± 16 %	94 ± 5 %	0

Traffic Junction

Experiment runners are in /exp_runners/traffic/, detailed argument specification list can be found in the included runner files.

To train in Traffic Junction, please cd into /exp_runners/traffic/ and run following commands by replacing difficulty D with desired setting strings (easy, medium or hard):

Approach	Command
DICG-CE-MLP	`python dicg_ce_traffic_runner.py --difficulty D`
DICG-DE-MLP	`python dicg_de_traffic_runner.py --difficulty D`
CENT-MLP	`python cent_traffic_runner.py --difficulty D`
DEC-MLP	`python dec_traffic_runner.py --difficulty D`

The model checkpoints will be saved in /exp_runners/traffic/data/local/.

Reported average success rate (average of 5 random seeds, utill 2.5e6 environment steps):

Approach	`easy`	`medium`	`hard`
DICG-CE-MLP	98.1 ± 1.9 %	80.5 ± 6.8 %	22.8 ± 4.6 %
DICG-DE-MLP	95.6 ± 1.5 %	90.8 ± 2.9 %	82.2 ± 6.0 %
CENT-MLP	97.7 ± 0.9 %	0	0
DEC-MLP	90.2 ± 6.5 %	81.3 ± 4.8 %	69.4 ± 4.9 %

minglou1984 / dicg Goto Github PK

dicg's Introduction

Deep Implicit Coordination Graphs (DICG)

Installation

Running Experiments

Predator-Prey

SMAC

Traffic Junction

dicg's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent