Giter Club home page Giter Club logo

William's Projects

alphanlholdem icon alphanlholdem

An unoffical implementation of AlphaHoldem. 1v1 nl-holdem AI.

annotated_deep_learning_paper_implementations icon annotated_deep_learning_paper_implementations

🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

approx_discrete_blotto icon approx_discrete_blotto

In (discrete) Colonel Blotto game, a special strategy is proposed and proved to be an approximate equilibrium. The following numerical experiments are used to evaluate the approximation error in using this strategy.

awesome-meta-learning icon awesome-meta-learning

A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.

baselines icon baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

cleanrl icon cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

counterfactual-regret-minimization icon counterfactual-regret-minimization

Implemented the CFR+ and PureCFR algorithms in Python to find the optimal strategies to 2-player extensive-form games, which was also used in Libratus, the best poker AI in the world

deep-cfr icon deep-cfr

Scalable Implementation of Deep CFR and Single Deep CFR

deepstack-leduc icon deepstack-leduc

Example implementation of the DeepStack algorithm for no-limit Leduc poker

ensemble-pytorch icon ensemble-pytorch

Implementation of ensemble methods in Pytorch to boost the performance of your deep learning model.

game_theory icon game_theory

Implementing Algorithms for Computing Stackelberg Equilibria in Security Games

gr2 icon gr2

Appendix and Code for Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning

holdem-agent icon holdem-agent

In this project, we implement a PPO-based RL agent for no-limit texas hold'em. This is joint work with Rahul Bhatia, Steven Friedman, and Nicole Giannopoulou as part of the Advanced Machine Learning class at Columbia University in Fall 2020.

kmeans icon kmeans

kmeans clustering with multi-GPU capabilities

llmsurvey icon llmsurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

malib icon malib

A general-purpose multi-agent training framework.

marl-papers icon marl-papers

Paper list of multi-agent reinforcement learning (MARL)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.