Giter Club home page Giter Club logo

proximal-policy-optimization-lunarlander-v2's Introduction

近端策略优化-月球登录训练实例

关于程序

如果你有足够的数学知识,这是一个相当简单,且有趣的项目。
不得不说这个项目的效果超出了我的预期,对于新手而言可以尝试一番。
命名都为中文,尽可能地贴近其所描述的含义。
是我仿照自《强化学习实战系列(2020最新)》唐老师的视频课程所提供的英文源代码(这里没有它)。

文件说明

《强化学习数学公式.docx》这个文件里面是基础的数学公式,我有对公式的组成进行说明;
《主要.py》这里是用来来训练的;
《测试模型.py》这个是用来测试模型的;
《近端策略优化_LunarLander-v2.pth》这个是我自己训练好的模型。

其他

gym官网,用于强化学习的标准 API,以及各种参考环境的集合,里面集成了不少游戏。

proximal-policy-optimization-lunarlander-v2's People

Contributors

zozero avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.