williamyuanv0 Goto Github PK
Name: William
Type: User
Name: William
Type: User
An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.
🧑🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
In (discrete) Colonel Blotto game, a special strategy is proposed and proved to be an approximate equilibrium. The following numerical experiments are used to evaluate the approximation error in using this strategy.
A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
程序员应该访问的最佳网站中文版
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Implemented the CFR+ and PureCFR algorithms in Python to find the optimal strategies to 2-player extensive-form games, which was also used in Libratus, the best poker AI in the world
Scalable Implementation of Deep CFR and Single Deep CFR
PyTorch implementations of deep reinforcement learning algorithms and environments
Notes about courses Dive into Deep Learning by Mu Li
Example implementation of the DeepStack algorithm for no-limit Leduc poker
Implementation of ensemble methods in Pytorch to boost the performance of your deep learning model.
Check out the new game server:
Implementing Algorithms for Computing Stackelberg Equilibria in Security Games
Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning
https://hrl.boyuai.com/
just another repositoty
In this project, we implement a PPO-based RL agent for no-limit texas hold'em. This is joint work with Rahul Bhatia, Steven Friedman, and Nicole Giannopoulou as part of the Advanced Machine Learning class at Columbia University in Fall 2020.
kmeans clustering with multi-GPU capabilities
The official GitHub page for the survey paper "A Survey of Large Language Models".
A general-purpose multi-agent training framework.
A Multi-agent Learning Framework
Paper list of multi-agent reinforcement learning (MARL)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.