Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

ucb1

Here are 16 public repositories matching this topic...

Multi-Armed Bandit (MAB) algorithm implementation in go

  • UpdatedNov 25, 2019
  • Go

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

  • UpdatedMay 21, 2018
  • Python

Few-shot prompting using Contextual Combinatorial Bandit optimizations

  • UpdatedDec 19, 2024
  • Python

Reversi (Othello) AI game in C#. Using Monte Carlo Tree Search algorithm AND BTMM algorithm.

  • UpdatedMay 31, 2021
  • C#

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

  • UpdatedFeb 15, 2022
  • Python

An implementation of solvers for the multi-armed-bandit-problem in JavaScript.

  • UpdatedApr 25, 2019
  • JavaScript

This repository is focused on my assignments solutions for the Statistical Techniques for Data Science course at Innopolis University.

  • UpdatedMay 18, 2023
  • Jupyter Notebook

Python implementation of Monte Carlo Tree Search

  • UpdatedJan 4, 2020
  • Python

REST service, that returns content sorted by UCB1 algorithm.(Multi-Armed Bandit algorithm). Spring Boot, Kotlin

  • UpdatedJan 29, 2022
  • Kotlin
  • UpdatedDec 17, 2023
  • Jupyter Notebook

My reports for the reinforcement learning class given at the ENS

  • UpdatedJan 16, 2018
  • Jupyter Notebook

Reinforcement learning techniques applied to solve pricing problems in e-commerce applications. Final project for "Online learning applications" course (2021-2022)

  • UpdatedOct 30, 2022
  • Jupyter Notebook

Pricing and Social Influence Maximization using Reinforcement Learning algorithms in Data Intelligence Applications projects from Politechnic of Milan

  • UpdatedFeb 12, 2020
  • Python

Improve this page

Add a description, image, and links to theucb1 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theucb1 topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp