thompson-sampling
Here are 107 public repositories matching this topic...
Language:All
Sort:Most stars
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
- Updated
Apr 20, 2025 - Python
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
- Updated
Dec 11, 2019 - Python
👤 Multi-Armed Bandit Algorithms Library (MAB) 👮
- Updated
Sep 6, 2022 - Python
This repository contains the source code for “Thompson sampling efficient multiobjective optimization” (TSEMO).
- Updated
Jun 19, 2020 - MATLAB
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
- Updated
Apr 8, 2025 - Go
Thompson Sampling Tutorial
- Updated
Jan 25, 2019 - Jupyter Notebook
All codes, both created and optimized for best results from the SuperDataScience Course
- Updated
Nov 5, 2017 - Python
In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
- Updated
Nov 7, 2021 - Python
Bandit algorithms
- Updated
Oct 12, 2017 - Python
pyrff: Python implementation of random fourier feature approximations for gaussian processes
- Updated
Mar 18, 2025 - Jupyter Notebook
Offline evaluation of multi-armed bandit algorithms
- Updated
Dec 1, 2020 - Python
Study of the paper 'Neural Thompson Sampling' published in October 2020
- Updated
Sep 27, 2022 - Jupyter Notebook
Bayesian Optimization for Categorical and Continuous Inputs
- Updated
Jul 20, 2020 - Python
A Julia Package for providing Multi Armed Bandit Experiments
- Updated
Jul 19, 2018 - Julia
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
- Updated
May 21, 2018 - Python
A curated list on papers about combinatorial multi-armed bandit problems.
- Updated
May 10, 2021
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
- Updated
Jun 20, 2016 - C++
Author's implementation of the paper Correlated Age-of-Information Bandits.
- Updated
Jun 19, 2021 - Python
Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.
- Updated
May 24, 2019 - Jupyter Notebook
Improve this page
Add a description, image, and links to thethompson-sampling topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thethompson-sampling topic, visit your repo's landing page and select "manage topics."