Movatterモバイル変換

Skip to content

#

reward

Here are 126 public repositories matching this topic...

Language:All

Filter by language

All126 JavaScript23 Python22 Java13 TypeScript9 C++8 C#5 HTML4 Kotlin4 PHP4 CSS3

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

awesome-deep-rl

tigerneil /awesome-deep-rl

For deep RL and the future of AI.

game reinforcement-learning deep-reinforcement-learning agi planning artificial-general-intelligence theoretical-computer-science reward aaai ijcai hierarchical-reinforcement-learning iclr icml distributional multiagent-reinforcement-learning aamas exploration-exploitation inverse-rl aistats uai

UpdatedMar 1, 2024
HTML

aleju /mario-ai

Playing Mario with Deep Reinforcement Learning

agent mario machine-learning deep-learning deep-reinforcement-learning torch reward

UpdatedMay 26, 2016
Lua

yanm1ng /hexo-theme-vexo

🍟 Vexo is a Hexo theme inspired by Vue's official website.

theme vuejs vue hexo stylus ejs hexo-theme reward hexo-blog hexoblog

UpdatedNov 22, 2022
JavaScript

henry-fun /hanshan-lottery

An amazing lottery app created for the world

html5 lottery reward

UpdatedJan 14, 2020
HTML

greedying /tctip

javascript tips reward tctip

UpdatedDec 7, 2022
JavaScript

drallgood /jpasskit

jPasskit is an Java™ implementation of the Apple™ PassKit Web Service.

java webservice apple generator server wallet reward passkit boardingpass

UpdatedJul 18, 2024
Java

ecency /ecency-mobile

Ecency Mobile - reimagined social blogging, contribute and get rewarded (for Android and iOS)

react android ios social-media mobile react-native crypto hive blockchain esteem reward rewarding epoint hiveio ecency

UpdatedMar 15, 2025
TypeScript

BingRewards

Prem-ium /BingRewards

🤖 Automate Bing Searches 🔍, Quizzes 🧪, Polls 📝, & more for Bing Rewards. 💸

python bot docker automation bing script docker-container proxy selenium rewards reward gift-cards giftcards proxy-checker bing-search passive-income webautomation reward-points multiple-accounts bing-rewards

UpdatedDec 4, 2024
Python

Miraclemarvel55 /ChatGLM-RLHF

对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF

custom similarity reward nickname ppo rlhf chatglm

UpdatedMay 23, 2023
Python

alison-carrera /mabalgs

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

UpdatedSep 6, 2022
Python

WFCD /warframe-drop-data

💰 Warframe Drop Data in an easier to parse format.

game data mod warframe relic enemies play-game reward drop-data drop-sorting

UpdatedFeb 23, 2025
JavaScript

NiuTrans /Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

vision alignment multi-model reward ppo sft dpo llm rlhf mllm llava llama3-vision

UpdatedOct 16, 2024
Python

bulwark-crypto /Bulwark

The primary development repository for the Bulwark project

qt privacy bitcoin tor cryptocurrency proof-of-stake reward proof-of-work masternodes bulwark

UpdatedMay 13, 2020
C++

TwitchSpawn

iGoodie /TwitchSpawn

👾 TwitchSpawn is a Minecraft mod, which is designed for Twitch streamers using 3rd party streaming tools! (comes with its own language!)

game minecraft dsl mod streamer minecraft-mod donations streamlabs reward twitch-streamers tsl twitch-nicks multiple-streamers

UpdatedAug 4, 2024
Java

powerpool-finance /powerindex

📈📉Power Index is an ecosystem product of PowerPool. The main feature of Power Index is a possibility to create special pools with unique governance and design.

ethereum solidity reward governance cvp defi liquidity-providers powerpool

UpdatedMay 28, 2024
JavaScript

ihoey /Playing-reward

超好看的打赏功能~ 演示地址

demo wechat qq reward wechatpay

UpdatedFeb 6, 2024
CSS

khinthandarkyaw98 /Optimizing-UAV-trajectory-for-maximum-data-rate-via-Q-Learning

During our participation in the Internship Exchange Program, my friend and I collaborated with the guidance of our esteemed supervisor from NTHU.

reinforcement-learning uav reward data-rate uav-trajectory

UpdatedMay 18, 2024
Python

ssbuild /chatglm_rlhf

chatglm_rlhf_finetuning

chat lora reward finetuning rlhf chatglm qlora

UpdatedOct 10, 2023
Python

anarkrypto /P2PoW

A P2P Delegated Proof of Work solution for Nano cryptocurrency

api client website demo library web serverless worker proof p2p nano mining miner pow reward delegated trustless worker-api nano-cryptocurrency dpow

UpdatedMay 22, 2023
JavaScript

ssbuild /llm_rlhf

realize the reinforcement learning training for gpt2 llama bloom and so on llm model

lora reward trl llm rlhf trlx llm-rlhf

UpdatedSep 19, 2023
Python

Improve this page

Add a description, image, and links to thereward topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thereward topic, visit your repo's landing page and select "manage topics."

[8]ページ先頭

©2009-2025 Movatter.jp