Giter Club home page Giter Club logo

reinforcement_learning's Introduction

Reinforcement_Learning

Here is the list of course projects done during Reinforcement Learning course.

  1. cs18s038_PA1 Course Project

    • Implementation of epsilon-greedy and related plots.
    • Implementation of soft-max algorithm and related plots.
    • Implementation of UCB1 algorithm and comparing with epsilon-greedy and soft-maxand related plots.
    • Implementation of Median algorithm and comparing with epsilon-greedy, soft-maxand UCB1 and related plots.
    • Comparison of the above four algorithm as the number of arm grows.

    Note: For more detailed analysis of each observation and inferernce drawn from it, Please have a look at full report Click here.

  2. cs18s038_PA2 Course Project

    PUDDLE WORLD!

    picture alt

    • Implementation of Sarsa for different goals and related plots in puddle world.
    • Implementation of Q-Learning for different goals and related plots in puddle world.
    • Implementation of Sarsa Lambda for different goals and related plots in puddle world.

    picture alt

    • Implementation of Policy gradient on above environments. Experiment involves-

      • Hyper parameter tuning.
      • Value function visualisation.
      • Trajectory and policy.
      • Inference from observation .

      Note: For more detailed analysis of each observation and inferernce drawn from it, Please have a look at full report Click here.

3.cs18s038_PA3 Course Project

Learning with options

picture alt

  • Implementation of SMDP-Q Learning.
  • Implementation of Intra option Q-Learning.
  • Visualisation of Q-values.
  • Visualisation of V-values.
  • Analysis of Observations and Inference

picture alt

Balancing CartPole

Problem solved using DQN model. Experiment involves-

  • Implementation of DQN and related plots.
  • Best hyperparameters.
  • Observations and inference.
  • Playing with Replay memory and Target network .

Note: For more detailed analysis of each observation and inferernce drawn from it, Please have a look at full report Click here.

reinforcement_learning's People

Contributors

rajanskumarsoni avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.