Implementation of various RL algorithms and learning resources
- Foundations of RL
- Markov Decision Process
- Bellman Equation
- Value Iteration
- Policy Iteration
- Q Tables
- Deep Q-Networks(DQN)
- Deep Deterministic Policy Gradient(DDPG)
- POMDPS
- Proximal Policy Optimization(PPO)
- Temporal Difference Learning