shenjiede Goto Github PK
Name: shenjie
Type: User
Name: shenjie
Type: User
Code for MOPO: Model-based Offline Policy Optimization
Coding Demos from the School of AI's Move37 Course
Automatic object XML generation for Mujoco
A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm
multi agent RL for traffic light control in Sumo using distributed PPO
PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.
Hello, I pushed some python environments for Multi Agent Reinforcement Learning.
Multi & Single Agent Reinforcement Learning for Traffic Signal Control Problem
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
Mycroft Core, the Mycroft Artificial Intelligence platform.
Visualizer for neural network, deep learning, and machine learning models
Nyxt - the hacker's browser.
Benchmarked implementations of Offline RL Algorithms.
This is the official implementation of Multi-Agent PPO (MAPPO).
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
Otter Browser aims to recreate the best aspects of the classic Opera (12.x) UI using Qt5
The fastest knowledge base for growing teams. Beautiful, realtime collaborative, feature packed, and markdown compatible.
Web application where humans can play Overcooked with AI agents.
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
Prioritized Experience Replay (PER) implementation in PyTorch
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
PFRL: a PyTorch-based deep reinforcement learning library
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Unmodified Postgres with some useful plugins
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
A clean and robust Pytorch implementation of PPO on Discrete action space
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
Managed Frappe Hosting
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.