PPO_tf
Implementation of proximal policy optimization(PPO) using tensorflow
environment
CartPole-v0 of open ai gym
state space: continuous
action space: discrete
dependencies
python3.6
tensorflow v1.4
open ai gym
Training
python main.py
Test trained policy
python test_policy.py
Tensorboard
tensorboard --logdir=log
LICENSE
MIT ICENSE