ucb1

Star

Here are 16 public repositories matching this topic...

Language:All

Filter by language

All16 Jupyter Notebook7 Python5 C#1 Go1 JavaScript1 Kotlin1

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

alextanhongpin /go-bandit

Sponsor

Star29

Multi-Armed Bandit (MAB) algorithm implementation in go

go ucb1 mulit-arm-bandit greedy-epsilon

UpdatedNov 25, 2019
Go

akshaykhadse /reinforcement-learning

Star17

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

UpdatedMay 21, 2018
Python

viswanath57 /Bandit-Algorithms

Star10

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

UpdatedApr 5, 2021
Jupyter Notebook

gokhanmeteerturk /adaptive-shots

Star7

Few-shot prompting using Contextual Combinatorial Bandit optimizations

python reinforcement-learning ai contextual-bandits few-shot ucb1

UpdatedDec 19, 2024
Python

HoangTran0410 /Reversi-mcts

Star4

Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.

board-game machine-learning csharp bitboard mcts monte-carlo-tree-search othello-game reversi-game ucb1 othello-ai mcts-algorithm

UpdatedMay 31, 2021
C#

Pegah-Ardehkhani /Reinforcement-Learning-Algorithms-from-Scratch

Star4

Explore key RL algorithms with detailed explanations and fully commented Python code implementations

reinforcement-learning monte-carlo q-learning thompson-sampling epsilon-greedy reinforcement-learning-algorithms sarsa rl policy-iteration value-iteration deep-q-learning reinforcement-learning-agent ucb1 td-lambda reinforcement-learning-environments td-0 optimistic-inital-values iterative-policy-evaluation

UpdatedDec 8, 2024
Jupyter Notebook

kochlisGit /Reinforcement-Learning-Algorithms

Star3

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

python reinforcement-learning monte-carlo openai-gym q-learning policy rl-agents epsilon-greedy dynamic-programming markov-chains approximation-algorithms ucb1 q-lambda exploration-exploitation thomson-sampling frozen-lake multi-bandit-army

UpdatedFeb 15, 2022
Python

mykeels /multi-armed-bandit-problem

Star3

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

thompson-sampling epsilon-greedy multi-armed-bandit ucb1

UpdatedApr 25, 2019
JavaScript

leiluk1 /stat-techniques

Star2

This repository is focused on my assignments solutions for the Statistical Techniques for Data Science course at Innopolis University.

statistics thompson-sampling multi-armed-bandit ucb1 dna-replication mrl98-quantile-algo

UpdatedMay 18, 2023
Jupyter Notebook

sanxore /py-mcts

Star1

Python implementation of Monte Carlo Tree Search

mcts uct monte-carlo-tree-search ucb1

UpdatedJan 4, 2020
Python

zzmtsvv /ml_sandbox

Star0

regression calibration gan style-transfer classification mlp self-organizing-map knearest-neighbor-algorithm gradient-boosting variational-autoencoder cyclegan ucb1 spectral-normalization vq-vae cnn-visualization self-normalizing-neural-networks diffusion-models

UpdatedSep 3, 2022
Jupyter Notebook

Nikita-Kudrin /funcorp-bandit

Star0

REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin

kotlin spring-boot ucb1

UpdatedJan 29, 2022
Kotlin

Stepan-Makarenko /Multi-armed-bandit-research

Star0

multi-armed-bandits ucb1 e-greedy

UpdatedDec 17, 2023
Jupyter Notebook

Twice22 /Reinforcement-Learning

Star0

My reports for the reinforcement learning class given at the ENS

reinforcement-learning policy-gradient reinforce policy-iteration value-iteration ucb1

UpdatedJan 16, 2018
Jupyter Notebook

VladMarianCimpeanu /OLA_project

Star0

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

reinforcement-learning pricing thompson-sampling multi-armed-bandit montecarlo-simulation mab ucb1 online-learning-applications

UpdatedOct 30, 2022
Jupyter Notebook

EmanuelAlogna /Data-Intelligence-Applications

Star0

Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan

reinforcement-learning social-network pricing thompson-sampling reinforcement-learning-algorithms multi-armed-bandit ucb1 social-influence

UpdatedFeb 12, 2020
Python

Improve this page

Add a description, image, and links to theucb1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theucb1 topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ucb1

Here are 16 public repositories matching this topic...

alextanhongpin /go-bandit

akshaykhadse /reinforcement-learning

viswanath57 /Bandit-Algorithms

gokhanmeteerturk /adaptive-shots

HoangTran0410 /Reversi-mcts

Pegah-Ardehkhani /Reinforcement-Learning-Algorithms-from-Scratch

kochlisGit /Reinforcement-Learning-Algorithms

mykeels /multi-armed-bandit-problem

leiluk1 /stat-techniques

sanxore /py-mcts

zzmtsvv /ml_sandbox

Nikita-Kudrin /funcorp-bandit

Stepan-Makarenko /Multi-armed-bandit-research

Twice22 /Reinforcement-Learning

VladMarianCimpeanu /OLA_project

EmanuelAlogna /Data-Intelligence-Applications

Improve this page

Add this topic to your repo