Topic: bandit-algorithms Goto Github

Some thing interesting about bandit-algorithms

👇 Here are 85 public repositories matching this topic...

aayushmanghosh / rl-algorithms-for-ibmi-applications

bandit-algorithms,Official repository for Reinforcement Learning Decoders used for intra-cortical brain machine interfaces - IEEE TNNLS 2023

User: aayushmanghosh

brain-machine-interface hardware-optimization reinforcement-learning bandit-algorithms neural-network

alanthink / banditpylib

bandit-algorithms,A lightweight python library for bandit algorithms

User: alanthink

Home Page: https://alanthink.github.io/banditpylib-doc/

bandit-algorithms

albertopirillo / ola-project-2023

bandit-algorithms,Pricing and advertising strategy for the e-commerce of an airline company, based on Multi-Armed Bandits (MABs) algorithms and Gaussian Processes. Simulations include non-stationary environments.

User: albertopirillo

bandit-algorithms marketing-automation online-learning reinforcement-learning

amirbalef / ps_momab

bandit-algorithms,Multi-Objective Multi-Armed Bandit

User: amirbalef

bandit-algorithms multi-armed-bandit multi-objective non-stationary ucb-algorithm

amirhosein-mesbah / reinforcement_learning

bandit-algorithms,This repository contains the implementation of a wide variety of Reinforcement Learning Projects in different applications of Bandit Algorithms, MDPs, Distributed RL and Deep RL. These projects include university projects and projects implemented due to interest in Reinforcement Learning.

User: amirhosein-mesbah

bandit-algorithms deep-reinforcement-learning deeprl distributed-reinforcement-learning mdp multi-agent-reinforcement-learning network-routing off-policy on-policy reinforcement-learning

anishacharya / bandits-online-learning

bandit-algorithms,Simple Implementations of Bandit Algorithms in python

User: anishacharya

online-learning online-learning-python online-learning-algorithms bandit bandit-algorithms bandit-learning bandits multi-armed-bandits

babaniyi / deep-contextual-bandits

bandit-algorithms,A benchmark to test decision-making algorithms for contextual-bandits. The library implements a variety of algorithms (many of them based on approximate Bayesian Neural Networks and Thompson sampling), and a number of real and syntethic data problems exhibiting a diverse set of properties.

User: babaniyi

bandit-algorithms bandits multiarmed-bandits

c-bata / goptuna

bandit-algorithms,A hyperparameter optimization framework, inspired by Optuna.

User: c-bata

Home Page: https://pkg.go.dev/github.com/c-bata/goptuna

bayesian-optimization bandit-algorithms evolution-strategies blackbox-optimization

chunjenpeng / pybandit

bandit-algorithms,Bandit and Evolutionary Algorithms using Python

User: chunjenpeng

python evolutionary-algorithms pso aco bandit-algorithms optimization cmaes

dkimpara / bandit_oco

bandit-algorithms,Extending Agarwal, Dekel, and Xiao (2010) to the online convex optimization setting with experiments.

User: dkimpara

bandit-algorithms convex-optimization online-convex-optimization

doerlbh / banditzoo

bandit-algorithms,Python library of bandits and RL agents in different real-world environments

User: doerlbh

bandit bandit-algorithms bandits reinforcement-learning simulation

doerlbh / minivox

bandit-algorithms,Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".

User: doerlbh

speaker-diarization paper speaker-recognition online-learning bandit-algorithms contextual-bandits interspeech2020 interspeech acml self-supervised-learning online-speaker-diarization

duongnhatthang / meta-bandit

bandit-algorithms,Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

User: duongnhatthang

bandit meta-learning python3 partial-monitoring sequential-decisions sequential-decision-making-problems multi-task bandit-algorithms meta-bandit

duruii / replica-aucb

bandit-algorithms,🐯REPLICA of "Auction-based combinatorial multi-armed bandit mechanisms with strategic arms"

User: duruii

aucb aution bandit-algorithms bandits cmab mab multi-armed-bandit

enigmadata / rlinc

bandit-algorithms,Reinforcement Learning Starters Package for Multi-arm Bandits Problem

User: enigmadata

bandit-algorithms python reinforcement-learning-agent starter-kit

gdmarmerola / advanced-bandit-problems

bandit-algorithms,More about the exploration-exploitation tradeoff with harder bandits

User: gdmarmerola

machine-learning bandit-algorithms multi-armed-bandit

gjjvdburg / thompsonsampling

bandit-algorithms,Source code for blog post on Thompson Sampling

User: gjjvdburg

Home Page: https://gertjanvandenburg.com/blog/thompson_sampling/

thompson-sampling bandit-algorithms multi-armed-bandit multiarmed-bandits

gohjiayi / beer_recommender

bandit-algorithms,Building a beer recommender using collaborative filtering and bandit algorithms, and evaluating the best performing technique.

User: gohjiayi

bandit-algorithms collaborative-filtering machine-learning

gokceuludogan / interactive-music-recommendation

bandit-algorithms,Personalized and Interactive Music Recommendation with Bandit approach

User: gokceuludogan

music-recommendation bandit-algorithms bayes-ucb exploration-exploitation

guptav96 / bandit-algorithms

bandit-algorithms,A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB

User: guptav96

reinforcement-learning bandit-algorithms exploration-exploitation

hins-hu / bandit-algorithms

bandit-algorithms,An illustrative project including some multi-armed bandit algorithms and contextual bandit algorithms

User: hins-hu

bandit-algorithms multi-armed-bandit contextual-bandit

jayrcausal / essential3crl

bandit-algorithms,Research about Causality-based Reinforcement Learning. This repository includes all needed fundamentals, summary of past work and some most recent development

User: jayrcausal

causal-inference causality reinforcement-learning bandit-algorithms covariate-shift domain-adaptation

jia-yi-chen / bandit-and-reinforcement-learning

bandit-algorithms,Python implementation for Reinforcement Learning algorithms -- Bandit algorithms, MDP, Dynamic Programming (value/policy iteration), Model-free Control (off-policy Monte Carlo, Q-learning)

User: jia-yi-chen

reinforcement-learning bandit-algorithms q-learning monte-carlo dynamic-programming markov-decision-processes grid-world multi-armed-bandit

junjiedong / warfarin-bandit

bandit-algorithms,Contextual Bandit algorithms for Warfarin Treatment

User: junjiedong

bandit bandit-algorithms warfarin

kkeishiro / yahoo_recommendation

bandit-algorithms,Yahoo! news article recommendation system by linUCB

User: kkeishiro

linucb contextual-bandit bandit-algorithms recommendation-system

kulinshah98 / multi-armed-bandit-algorithms

bandit-algorithms,Python implementation of UCB, EXP3 and Epsilon greedy algorithms

User: kulinshah98

multi-armed-bandits bandit-algorithms stochastic-bandit-algorithms upper-confidence-bounds epsilon-greedy adversarial-bandit-algorithms exp3-algorithm

luke-davidson / reinforcementlearning

bandit-algorithms,Programming assignments completed for my Reinforcement Learning course: Topics include Bandit Algorithms, Dynamic Programming, policy iteration, Monte-Carlo methods, SARSA, Q-Learning, Dyna-Q/Dyna-Q+, gradient control methods, state aggregation methods, and Deep Q-Learning Networks (DQNs).

User: luke-davidson

bandit-algorithms deep-learning deep-q-network deep-reinforcement-learning dyna-q dynamic-programming gradient-descent-algorithm monte-carlo policy-gradient policy-iteration

maxencegiraud / machinelearningalgos

bandit-algorithms,Personal reimplementation of some ML algorithms for learning purposes

User: maxencegiraud

machine-learning machine-learning-algorithms deep-learning lda qda knn naive-bayes decision-tree random-forest clustering

mifa-lab / ldpbandit2020

bandit-algorithms,Implementation for NeurIPS 2020 paper "Locally Differentially Private (Contextual) Bandits Learning" (https://arxiv.org/abs/2006.00701)

Organization: mifa-lab

differential-privacy bandit-algorithms numpy

mknbv / zamburak

bandit-algorithms,Bandit algorithms in OCaml

User: mknbv

bandit-algorithms adversarial-bandit ucb exp3 trading stochastic-bandit ocaml

mmalekzadeh / privacy-preserving-bandits

bandit-algorithms,Privacy-Preserving Bandits (MLSys'20)

User: mmalekzadeh

Home Page: https://proceedings.mlsys.org/paper/2020/hash/42a0e188f5033bc65bf8d78622277c4e-Abstract.html

bandit-algorithms differential-privacy machine-learning online-machine-learning reinforcement-learning contextual-bandits privacy-preserving-machine-learning privacy-preserving-bandits criteo-dataset federated-learning

naereen / kullback-leibler-divergences-and-kl-ucb-indexes

bandit-algorithms,🐍 🔬 Fast Python implementation of various Kullback-Leibler divergences for 1D and 2D parametric distributions. Also provides optimized code for kl-UCB indexes

User: naereen

Home Page: https://naereen.github.io/Kullback-Leibler-divergences-and-kl-UCB-indexes/docs/index.html

kullback-leibler-divergence kl-ucb bandit-algorithms divergence numba cython python-library

naereen / kullbackleibler.jl

bandit-algorithms,💫 Fast Julia implementation of various Kullback-Leibler divergences for 1D parametric distributions. 🏋 Also provides optimized code for kl-UCB indexes

User: naereen

Home Page: https://naereen.github.io/KullbackLeibler.jl/docs/index.html

kullback-leibler-divergence kl-ucb bandit-algorithms divergence julia-package

ngutowski / algossim

bandit-algorithms,This repository aims at learning most popular MAB and CMAB algorithms and watch how they run. It is interesting for those wishing to start learning these topics.

User: ngutowski

bandit-algorithms artificial-intelligence-algorithms recommendation-system contextual-bandits

nicoleorzan / multi-armed-bandit-rl

bandit-algorithms,C++ implementation of Multi-Armed Bandits (Gaussian and Bernoulli)

User: nicoleorzan

multi-armed-bandits reinforcement-learning softmax-policy bernoulli-bandit gaussian-bandit softmax ucb bandit-algorithms

niffler92 / bandit

bandit-algorithms,Bandit algorithms

User: niffler92

multiarm-bandit contextual-bandit bandit-algorithms thompson-sampling simulation linucb

niravnb / multi-armed-bandit-algortihms

bandit-algorithms,Implementation of famous Bandits algortihm: Explore then commit, UCB & Thompson sampling in python.

User: niravnb

bandit-algorithms

rasros / combo

bandit-algorithms,

User: rasros

kotlin kotlin-library optimization bandit-algorithms bandit-learning ab-testing genetic-algorithm

rssalessio / dpe

bandit-algorithms,DPE code - Code used in "Optimal Algorithms for Multiplayer Multi-Armed Bandits" (AISTATS 2020)

User: rssalessio

dpe bandit-algorithms multi-armed-bandits multiplayer-multi-armed-bandits aistats aistats-2020

rssalessio / reading-list

bandit-algorithms,This is a collection of interesting papers that I have read so far or want to read. Note that the list is not up-to-date. Topics: reinforcement learning, deep learning, mathematics, statistics, bandit algorithms, optimization.

User: rssalessio

bandit-algorithms deep-learning machine-learning reading-list reinforcement-learning learning optimization statistics

sagarnandeshwar / bandit_algorithms

bandit-algorithms,Reinforcement Learning (COMP 579) Project

User: sagarnandeshwar

bandit-algorithms bernoulli-distribution epsilon-greedy exploration-exploitation reinforcement-learning thompson-sampling ucb

shashankp914 / over-the-wire-wargames-solutions

bandit-algorithms,Detailed solution of solving wargames of over the wire which includes bandit and in future many more.

User: shashankp914

ctf cybersecurity linux open-source overthewire bandit bandit-algorithms bandit-learning

smpybandits / smpybandits

bandit-algorithms,🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

Organization: smpybandits

Home Page: https://SMPyBandits.github.io/

research multi-arm-bandits internet-of-things simulations python open-source learning-theory cognitive-radio bandit-algorithms multi-armed-bandit