Giter Club home page Giter Club logo

reinforcement-learning-an-introduction-chinese's Introduction

说明

因为官方翻译版本已经出版,本项目进入不定期更新维护。 请前往查看食用官方翻译版本:强化学习

reinforcement-learning-an-introduction-chinese

本项目为《Reinforcement Learning: An Introduction》(第二版)中文翻译,旨在帮助喜欢 强化学习(Reinforcement Learning)的各位能更好的学习交流。

中文在线阅读地址:《强化学习导论》 英文原版地址:Reinforcement Learning: An Introduction

cover

翻译进度:

  • 第二版前言
  • 第一版前言
  • 符号说明
  • 第1章(粗译,粗校)
  • 第2章(粗译)
  • 第3章(粗译)
  • 第4章(粗译)
  • 第5章(粗译)
  • 第6章(粗译)
  • 第7章(粗译)
  • 第8章(粗译)
  • 第9章(粗译)
  • 第10章(粗译)
  • 第11章(粗译)
  • 第12章
  • 第13章
  • 第14章
  • 第15章
  • 第16章
  • 第17章

reinforcement-learning-an-introduction-chinese's People

Contributors

andyli386 avatar cfeng avatar cuter44 avatar qiwihui avatar ritchiehuang avatar ynjxsjmh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

reinforcement-learning-an-introduction-chinese's Issues

(第2章) 2.5 追踪非平稳问题

公式(7)下面的“注意,对于样本平均情况……”一行所说的“恒定步长参数的情况“下,α_n(α)=n应该改为α_n(α)=α?

第五章 21点例子一处翻译错误

“ 这种情况下就算庄家也是natural也判玩家赢,这种情况叫draw,游戏结束。” 翻译错了,原文是
He then wins unless the dealer also has a natural, in which case the
game is a draw.

应该翻译为:他将赢得比赛,除非庄家也有natural,这种情况下,游戏将是平局。

另外,natural翻译为:天然,天然牌,天牌?

section 10.1 的小bug

10.1的pseudocode里面 ,"如果 𝑆′不是终点"大概是译反了,多了一个否定词,应该是"如果 𝑆′终点" (更新w去下一个episode)

希望作者有空更新一下

官方译文是老师给学生安排的一人一章翻译任务,整体看来质量并不是那么好,希望作者也能跟进翻译,支持作者

译文的小bug?

第一张强化学习中的dynamical system theory theory是否应该译为动力系统理论而非动态规划(dynamic programming)?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.