Reinforcement Learning: An Introduction

Python code for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

If you have any confusion about the code or want to report a bug, please open an issue instead of emailing me directly.

Chapter 1

Tic-Tac-Toe

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Figure 7.2: Performance of n-step TD methods on 19-state random walk

Chapter 8

Chapter 9

Figure 9.1: Gradient Monte Carlo algorithm on the 1000-state random walk task
Figure 9.2: Semi-gradient n-steps TD algorithm on the 1000-state random walk task
Figure 9.5: Fourier basis vs polynomials on the 1000-state random walk task
Figure 9.8: Example of feature width’s effect on initial generalization and asymptotic accuracy
Figure 9.10: Single tiling and multiple tilings on the 1000-state random walk task

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Environment

python 3.6
numpy
matplotlib
seaborn
tqdm

Usage

All files are self-contained

python any_file_you_want.py

Contribution

If you want to contribute some missing examples or fix some bugs, feel free to open an issue or make a pull request.

Following are missing figures/examples:

Figure 12.14: The effect of λ

scuizhibin / reinforcement-learning-an-introduction Goto Github PK

reinforcement-learning-an-introduction's Introduction

Reinforcement Learning: An Introduction

Contents

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Environment

Usage

Contribution

reinforcement-learning-an-introduction's People

Contributors

Watchers

Recommend Projects

Recommend Topics

Recommend Org