Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

reward

Here are 126 public repositories matching this topic...

awesome-deep-rl

Playing Mario with Deep Reinforcement Learning

  • UpdatedMay 26, 2016
  • Lua

🍟 Vexo is a Hexo theme inspired by Vue's official website.

  • UpdatedNov 22, 2022
  • JavaScript

An amazing lottery app created for the world

  • UpdatedJan 14, 2020
  • HTML
  • UpdatedDec 7, 2022
  • JavaScript

jPasskit is an Java™ implementation of the Apple™ PassKit Web Service.

  • UpdatedJul 18, 2024
  • Java

Ecency Mobile - reimagined social blogging, contribute and get rewarded (for Android and iOS)

  • UpdatedMar 15, 2025
  • TypeScript
BingRewards

对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF

  • UpdatedMay 23, 2023
  • Python

💰 Warframe Drop Data in an easier to parse format.

  • UpdatedFeb 23, 2025
  • JavaScript

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

  • UpdatedOct 16, 2024
  • Python

The primary development repository for the Bulwark project

  • UpdatedMay 13, 2020
  • C++
TwitchSpawn

👾 TwitchSpawn is a Minecraft mod, which is designed for Twitch streamers using 3rd party streaming tools! (comes with its own language!)

  • UpdatedAug 4, 2024
  • Java

📈📉Power Index is an ecosystem product of PowerPool. The main feature of Power Index is a possibility to create special pools with unique governance and design.

  • UpdatedMay 28, 2024
  • JavaScript

超好看的打赏功能~ 演示地址

  • UpdatedFeb 6, 2024
  • CSS

During our participation in the Internship Exchange Program, my friend and I collaborated with the guidance of our esteemed supervisor from NTHU.

  • UpdatedMay 18, 2024
  • Python

chatglm_rlhf_finetuning

  • UpdatedOct 10, 2023
  • Python

A P2P Delegated Proof of Work solution for Nano cryptocurrency

  • UpdatedMay 22, 2023
  • JavaScript

realize the reinforcement learning training for gpt2 llama bloom and so on llm model

  • UpdatedSep 19, 2023
  • Python

Improve this page

Add a description, image, and links to thereward topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thereward topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp