Movatterモバイル変換

Skip to content

#

thompson-sampling

Here are 107 public repositories matching this topic...

Language:All

Filter by language

All107 Python52 Jupyter Notebook34 Go3 JavaScript2 MATLAB2 TypeScript2 C++1 D1 Julia1 PHP1

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

sail-sg /oat

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

thompson-sampling alignment reasoning distributed-training ppo dueling-bandits dpo distributed-rl llm online-rl rlhf llm-aligment online-alignment llm-exploration grpo r1-zero

UpdatedApr 20, 2025
Python

alison-carrera /onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

reinforcement-learning neural-network pytorch thompson-sampling reinforcement-learning-algorithms machine-learning-library neural-architecture-search contextual-bandits mab pytorch-implemention multiarmed-bandits pytorch-implementation thompson-algorithm

UpdatedDec 11, 2019
Python

alison-carrera /mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

UpdatedSep 6, 2022
Python

Eric-Bradford /TS-EMO

This repository contains the source code for “Thompson sampling efficient multiobjective optimization” (TSEMO).

machine-learning matlab thompson-sampling multi-objective-optimization genetic-algorithms black-box-optimization gaussian-processes bayesian-optimization kriging expensive-to-evaluate-functions surrogate-based-optimization spectral-sampling

UpdatedJun 19, 2020
MATLAB

mab

stitchfix /mab

Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

go golang data-science reinforcement-learning thompson-sampling experimentation multi-armed-bandits multi-armed-bandit thompson multiarmed-bandits

UpdatedApr 8, 2025
Go

andrecianflone /thompson

Thompson Sampling Tutorial

reinforcement-learning thompson-sampling bandit bandit-algorithm

UpdatedJan 25, 2019
Jupyter Notebook

farhanchoudhary /Machine_Learning_A-Z_All_Codes_and_Templates

All codes, both created and optimized for best results from the SuperDataScience Course

natural-language-processing reinforcement-learning deep-learning clustering cross-validation naive-bayes-classifier thompson-sampling neural-networks classification dimensionality-reduction grid-search principal-component-analysis clustering-algorithm upper-confidence-bounds k-fold xgboost-algorithm association-rule-learning machine-learning-az

UpdatedNov 5, 2017
Python

Nikronic /Machine-Learning-Models

In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.

reinforcement-learning random-forest svm naive-bayes linear-regression cnn thompson-sampling xgboost pca logistic-regression apriori lda ann decision-tree nlp-machine-learning k-nn support-vector-regression kernel-pca eclat upper-confidence-bound

UpdatedNov 7, 2021
Python

niffler92 /Bandit

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

UpdatedOct 12, 2017
Python

michaelosthege /pyrff

pyrff: Python implementation of random fourier feature approximations for gaussian processes

thompson-sampling gaussian-processes bayesian-optimization

UpdatedMar 18, 2025
Jupyter Notebook

antoine-hochart /bandit_algo_evaluation

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

UpdatedDec 1, 2020
Python

RonyAbecidan /Neural-Thompson-Sampling

Study of the paper 'Neural Thompson Sampling' published in October 2020

neural-network thompson-sampling multi-armed-bandits non-linear-optimization contextual-bandits neural-tangent-kernel neural-thompson-sampling

UpdatedSep 27, 2022
Jupyter Notebook

nphdang /Bandit-BO

Bayesian Optimization for Categorical and Continuous Inputs

machine-learning optimization thompson-sampling hyperparameter-optimization hyperopt gaussian-processes bayesian-optimization multi-armed-bandits hyperparameter-tuning automl automated-machine-learning smac categorical-variables continuous-variable acquisition-functions gpyopt batch-bayesian-optimization

UpdatedJul 20, 2020
Python

v-i-s-h /MAB.jl

A Julia Package for providing Multi Armed Bandit Experiments

reinforcement-learning julia julia-language thompson-sampling reinforcement-learning-algorithms multi-arm-bandits ucb julia-package exp julialang mab bandit-experiments

UpdatedJul 19, 2018
Julia

akshaykhadse /reinforcement-learning

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

UpdatedMay 21, 2018
Python

ZIYU-DEEP /Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

A curated list on papers about combinatorial multi-armed bandit problems.

thompson-sampling multi-armed-bandit combinatorial-optimization bandit-algorithms combinatorial-bandit

UpdatedMay 10, 2021

aijunbai /thompson-sampling

Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs

mdp mcts thompson-sampling pomdps

UpdatedJun 20, 2016
C++

Correlated-AoI-Bandits

ishank-juneja /Correlated-AoI-Bandits

Author's implementation of the paper Correlated Age-of-Information Bandits.

thompson-sampling ucb multi-armed-bandit aoi age-of-information correlated-multi-armed-bandits correlated-arms aoi-regret

UpdatedJun 19, 2021
Python

sharmaroshan /Ads-Optimization

Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.

data-science reinforcement-learning eda data-visualization thompson-sampling data-analysis beginner upper-confidence-bound

UpdatedMay 24, 2019
Jupyter Notebook

atse0612 /Machine-Learning-A-Z

python r random-forest numpy linear-regression regression pandas random-generation naive-bayes-classifier thompson-sampling logistic-regression matplotlib kernel-support polynomial-regression upper-confidence-bounds bayesian-statistics model-building

UpdatedMar 29, 2018
Jupyter Notebook

Improve this page

Add a description, image, and links to thethompson-sampling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thethompson-sampling topic, visit your repo's landing page and select "manage topics."

[8]ページ先頭

©2009-2025 Movatter.jp