Giter Club home page Giter Club logo

rl-book's Introduction

强化学习:理论与Python实现

本书介绍强化学习理论及其 Python 实现。

  • 理论方面:全书用一套完整的数学体系,严谨地讲授强化学习的理论基础,主要定理均给出证明过程。各章内容循序渐进,覆盖了所有主流强化学习算法,包括资格迹等经典算法和柔性执行者/评论者等深度强化学习算法。
  • 实现方面:在您最爱的操作系统(包括 Windows、macOS、Linux)上,基于最新的 Python 3.7、Gym 0.12 和 Tensorflow 2(兼容 Tensorflow 1),实现强化学习算法。全书实现统一规范,体积小、重量轻。第 1~9 章给出了算法的配套实现,环境部分只依赖于 Gym 的最小安装,在没有 GPU 的计算机上也可运行;第 10~12 章介绍了多个热门综合案例,涵盖 Gym 的完整安装和自定义扩展,在有普通 GPU 的计算机上即可运行。

目录

  1. 初识强化学习
  2. Markov决策过程
  3. 有模型数值迭代
  4. 回合更新价值迭代
  5. 时序差分价值迭代
  6. 函数近似方法
  7. 回合更新策略梯度方法
  8. 执行者/评论者方法
  9. 连续动作空间的确定性策略
  10. 综合案例:电动游戏
  11. 综合案例:棋盘游戏
  12. 综合案例:自动驾驶

Reinforcement Learning: Theory and Python Implementation

We support Python 3.7, Gym 0.12 and Tensorflow 2.

Table of Contents

  1. Introduction of Reinforcement Learning
  2. Markov Decision Process
  3. Model-based Numeric Iteration
  4. Monte-Carlo Learning
  5. Temporal Difference Learning
  6. Function Approximation
  7. Policy Gradient
  8. Actor-Critic
  9. Deterministic Policy Gradient
  10. Case Study: Video Game
  11. Case Study: Board Game
  12. Case Study: Self-Driving Car

BibTeX

@book{xiao2019,
 title     = {Reinforcement Learning: Theory and {Python} Implementation},
 author    = {Zhiqing Xiao}
 year      = 2019,
 publisher = {China Machine Press},
}

rl-book's People

Contributors

zhiqingxiao avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.