Name: Vidyasagar Sadhu
Type: User
Company: SRI International
Bio: I am passionate about research being done at the intersection of deep learning (CNNs, LSTMs), reinforcement learning (e.g., A3C, TRPO) and multi-agent systems.
Twitter: vshssvs7
Location: Menlo Park, CA, USA
Vidyasagar Sadhu's Projects
Ray tutorials from Anyscale
Scenarios, tutorials and demos for Autonomous Driving
A curated list of awesome Deep Learning tutorials, projects and communities.
BabyAI platform. A testbed for training agents to understand and execute language commands.
Let us control diffusion models
Fast trajectory replanning
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
A PyTorch implementation of Deep SAD, a deep Semi-supervised Anomaly Detection method.
A set of Deep Reinforcement Learning Agents implemented in Tensorflow.
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
this is downloadings of all educative.io free student subscription courses as pdf from GitHub student pack
PyTorch tutorials and best practices.
gpt4all: a chatbot trained on a massive collection of clean assistant data including code, stories and dialogue
Code for "Text Generation from Knowledge Graphs with Graph Transformers"
PyTorch implementation of Guided Image Filtering
Minimalistic gridworld package for OpenAI Gym
Public implementation of Heterogeneous Policy Networks (HetNet) from AAMAS'22 -- Paper Title: Learning Efficient Diverse Communication for Cooperative Heterogeneous Teaming
Code for ICLR 2019 paper: Learning when to Communicate at Scale in Multiagent Cooperative and Competitive Tasks
ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models (ICCV 2021 Oral)
Release for Improved Denoising Diffusion Probabilistic Models
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
Goal driven language generation using knowledge graph A2C agents
KG-BERT: BERT for Knowledge Graph Completion
A collection of multi agent environments based on OpenAI gym.
Implementation of "MADiff: Offline Multi-agent Learning with Diffusion Models"
Repo containing code for multi-agent deep reinforcement learning (MADRL).