williamyuanv0 Goto Github PK

followers: 4.0 following: 5.0 repos: 63.0 gists: 0.0

Name: William

Type: User

William's Projects

alphanlholdem

An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.

annotated_deep_learning_paper_implementations

🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

approx_discrete_blotto

In (discrete) Colonel Blotto game, a special strategy is proposed and proved to be an approximate equilibrium. The following numerical experiments are used to evaluate the approximation error in using this strategy.

ashe

awesome-meta-learning

A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

best-websites-a-programmer-should-visit-zh

程序员应该访问的最佳网站中文版

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

counterfactual-regret-minimization

Implemented the CFR+ and PureCFR algorithms in Python to find the optimal strategies to 2-player extensive-form games, which was also used in Libratus, the best poker AI in the world

decisionholdem

deep-cfr

Scalable Implementation of Deep CFR and Single Deep CFR

deep-reinforcement-learning-algorithms-with-pytorch

PyTorch implementations of deep reinforcement learning algorithms and environments

deepholdem

deeplearning-muli-notes

Notes about courses Dive into Deep Learning by Mu Li

deepstack-leduc

Example implementation of the DeepStack algorithm for no-limit Leduc poker

diverse_psro

ensemble-pytorch

Implementation of ensemble methods in Pytorch to boost the performance of your deep learning model.

football

Check out the new game server:

game_theory

Implementing Algorithms for Computing Stackelberg Equilibria in Security Games

gr2

Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning

hands-on-rl

https://hrl.boyuai.com/

hello_world

just another repositoty

holdem-agent

In this project, we implement a PPO-based RL agent for no-limit texas hold'em. This is joint work with Rahul Bhatia, Steven Friedman, and Nicole Giannopoulou as part of the Advanced Machine Learning class at Columbia University in Fall 2020.

kmeans

kmeans clustering with multi-GPU capabilities

llmsurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

maca

malib

A general-purpose multi-agent training framework.

malib_deprecated

A Multi-agent Learning Framework

marl-papers

Paper list of multi-agent reinforcement learning (MARL)

williamyuanv0 Goto Github PK

William's Projects

Recommend Projects

Recommend Topics

Recommend Org