Giter Club home page Giter Club logo

supermariobros-ppo-pytorch's Introduction

Supermariobros-PPO-pytorch

基于超级玛丽游戏的pytorch版本强化学习实践教程

rl(ppo) course with super-mario-bros

你可以直接在jupyter notebook中开始学习(course.ipynb、course2.ipynb)

run the code with docker (推荐)

play with docker (ON your local computer with display),just run:

推荐使用docker直接运行,可以无需关注软件环境

docker run --gpus all -v /tmp/.X11-unix:/tmp/.X11-unix registry.cn-shanghai.aliyuncs.com/tcc-public/super-mario-ppo:localdisplay 

if you want debuge the code and exec into container ,command like this:

docker run --gpus all -it -v /tmp/.X11-unix:/tmp/.X11-unix registry.cn-shanghai.aliyuncs.com/tcc-public/super-mario-ppo:localdisplay  /bin/bash

train the model:

python ppo_lstm.py

test on super-mario-bros(see the video of agent):

python test_lstm.py

run the code witch conda

conda create -n ppo python=3.7
conda activate ppo

python request:

torch torchvision
gym_super_mario_bros
spinup(要用源码安装:https://spinningup.openai.com/en/latest/user/installation.html)
opencv-python

train:

python ppo_lstm.py

test:

python test_lstm.py

learn the course in jupyter notebook:

the notebook can be find at course.ipynb、course2.ipynb

jion the rl Communication group,contact us:

remarks(添加请备注):github rl

learn more in our DRL Training camp (aliyun tianchi)

you can find some ppo info on https://tianchi.aliyun.com/specials/promotion/aicamprl

supermariobros-ppo-pytorch's People

Contributors

gaoxiaos avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

supermariobros-ppo-pytorch's Issues

这模型效果不大好~

阿里天池过来的,用你的权重进行验证发现失败率还是很高的(没成功过),总是突然不动等待时间超时,没有惩罚机制学会偷懒了

镜像拉取失败

input:
docker pull registry.cn-shanghai.aliyuncs.com/tcc-public/super-mario-ppo:race
output:

Pulling repository registry.cn-shanghai.aliyuncs.com/tcc-public/super-mario-ppo
Error: image tcc-public/super-mario-ppo:race not found

course.ipynb 缺少依赖python模块 net_py

ModuleNotFoundError Traceback (most recent call last)
in
1 #导入实验需要的包
----> 2 from nes_py.wrappers import JoypadSpace
3 import gym_super_mario_bros
4 from gym_super_mario_bros.actions import SIMPLE_MOVEMENT

ModuleNotFoundError: No module named 'nes_py'

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.