Giter Club home page Giter Club logo

pymc3's Introduction

GithubProfile

SUMMARY

Staff design engineer with expertise in developing reinforcement learning algorithms to automate analog IC design process. Additionally, I have worked on side projects pertaining to other areas of deep learning such as NLP and computer vision -- some of which are highlighted in this repository.

Relevant Course Work: Advanced Analog Design, Power Electronics, Deep Reinforcement Learning, Deep Learning for NLP, Deep Learning for Computer Vision, Advanced Robotics, Intro to AI, Intro to Machine Learning, Algorithms & Data Structures, Software Fundamentals for Engineering Systems

EXPERTISE

Deep Learning

  • Transformer based networks such as BERT and GPT as well as traditional LSTM based networks
  • Improve inference performance using graph optimization and weight quantization
  • CNN architectures such as AlexNet, VGGNet, GoogLeNet for image classification as well as object detection using YOLO and SSD
  • Semantic segmentation using Fully Convolutional Networks (https://amitp-ai.medium.com/fcn-571881788e70)
  • Synthesize new images using Generative Adversarial Networks (GAN) and Variational Auto Encoders (VAE)

Deep Reinforcement Learning

  • Dynamic Programming, Bayesian Optimization, Thompson Sampling, Monte-Carlo (MC) learning
  • Temporal Difference (TD) learning: SARSA, Q-Learning, Expected SARSA, Deep Q Network (DQN), Double DQN
  • Policy Gradient Methods: Advantage Actor Critic (A2C), Deep Deterministic Policy Gradient (https://medium.com/@amitp-ai/policy-gradients-1edbbbc8de6b)

Natural Language Processing

RELEVANT PROJECTS

Banana Collection Agent (Fall 2018)

  • Trained a robot to pick the maximum number of good bananas while avoiding bad bananas.
  • Received a reward of +1 for picking a good banana and -1 for picking a bad banana.
  • State augmentation by including previous observations to transform the problem from POMDP to MDP.
  • Trained the agent (end-to-end) from raw pixels to q-values using CNN based double DQN learning algorithm.
  • For faster training, batch normalization technique was used.
  • Trained using PyTorch on Google Cloud, achieving a 100-episode average reward of 12.

Text Summarization

  • Input text was first pre-processed followed by data wrangling and data exploration.
  • Thereafter experimented with various encoder-decoder type of models using LSTM, attention based LSTM, transformers, and memory efficient transformers. Memory efficient transformers performed the best with Rouge-1 and Rouge-2 scores of 38.3 and 13.3.
  • Productionized using a Docker container deployed on an AWS EC2 instance and served using a Flask based API.

Question-Answering System on the SQuAD2.0 Dataset

  • As part of Stanford’s CS224N’s final project, I experimented with a few different architectures for this task.
  • Using Bi-Directional Attention Flow (BiDAF) network, achieved an F1 score of 62 on the validation set.
  • Then added character level embeddings (in addition to word embeddings) to BiDAF and achieved F1 of 65.
  • Thereafter built the transformer based QANet to further improve the F1 score to 70.
  • Lastly used a pretrained BERT network to further improve the F1 score.

pymc3's People

Contributors

aloctavodia avatar apatil avatar aseyboldt avatar austinrochford avatar bjedwards avatar bwengals avatar canyon289 avatar colcarroll avatar denadai2 avatar domenzain avatar eigenfoo avatar ericmjl avatar ferrine avatar gbokiau avatar hvasbath avatar jimenofonseca avatar jonathanhfriedman avatar jordan-melendez avatar jsalvatier avatar junpenglao avatar kiudee avatar kyleabeauchamp avatar kyleam avatar michaelosthege avatar springcoil avatar superbobry avatar takluyver avatar taku-y avatar twiecki avatar wesm avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.