Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

preference-learning

Here are 43 public repositories matching this topic...

RewardBench: the first evaluation tool for reward models.

  • UpdatedFeb 27, 2025
  • Python

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

  • UpdatedDec 7, 2024
  • Jupyter Notebook

The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)

  • UpdatedDec 5, 2023
  • Python

This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refer to our project website athttps://sites.google.com/view/san-navistar.

  • UpdatedMar 8, 2025
  • Python
metis

Python-based GUI to collect Feedback of Chemist in Molecules

  • UpdatedOct 15, 2024
  • Python

Official implementation of Bootstrapping Language Models via DPO Implicit Rewards

  • UpdatedJul 29, 2024
  • Python

Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".

  • UpdatedNov 23, 2024
  • Python

Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)

  • UpdatedOct 15, 2024
  • Python

PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

  • UpdatedMar 24, 2025
  • Python

Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"

  • UpdatedJul 27, 2024
  • Python

Code for "Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model" as published at CVPR 2021.

  • UpdatedFeb 3, 2024
  • Python

This repository contains the source code for our paper: "Feedback-efficient Active Preference Learning for Socially Aware Robot Navigation", accepted to IROS-2022. For more details, please refer to our project website athttps://sites.google.com/view/san-fapl.

  • UpdatedOct 17, 2022
  • Python

Preference Learning with Gaussian Processes and Bayesian Optimization

  • UpdatedAug 10, 2017
  • Python

[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model

  • UpdatedFeb 24, 2025
  • Python
GAN-Assisted-Preference-Based-Learning

A paper under AAAI-20 review

  • UpdatedAug 27, 2019
  • Python

[P]reference and [R]ule [L]earning algorithm implementation for Python 3 (https://arxiv.org/abs/1812.07895)

  • UpdatedMar 17, 2019
  • Python

Code for the project: "Analysis of Recommendation-systems based on User Preferences".

  • UpdatedMar 6, 2018
  • Python

Improve this page

Add a description, image, and links to thepreference-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thepreference-learning topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp