preference-learning
Here are 43 public repositories matching this topic...
Sort:Most stars
RewardBench: the first evaluation tool for reward models.
- Updated
Feb 27, 2025 - Python
Free and open source code of thehttps://tournesol.app platform. Meet the community on Discordhttps://discord.gg/WvcSG55Bf3
- Updated
Mar 30, 2025 - Python
Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.
- Updated
Dec 7, 2024 - Jupyter Notebook
The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)
- Updated
Dec 5, 2023 - Python
This repository contains the source code for our paper: "NaviSTAR: Socially Aware Robot Navigation with Hybrid Spatio-Temporal Graph Transformer and Preference Learning". For more details, please refer to our project website athttps://sites.google.com/view/san-navistar.
- Updated
Mar 8, 2025 - Python
Python-based GUI to collect Feedback of Chemist in Molecules
- Updated
Oct 15, 2024 - Python
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
- Updated
Jul 29, 2024 - Python
Code for the paper "Aligning LLM Agents by Learning Latent Preference from User Edits".
- Updated
Nov 23, 2024 - Python
Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)
- Updated
Oct 15, 2024 - Python
A Survey of Direct Preference Optimization (DPO)
- Updated
Mar 18, 2025
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
- Updated
Mar 24, 2025 - Python
Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"
- Updated
Jul 27, 2024 - Python
Code for "Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model" as published at CVPR 2021.
- Updated
Feb 3, 2024 - Python
This repository contains the source code for our paper: "Feedback-efficient Active Preference Learning for Socially Aware Robot Navigation", accepted to IROS-2022. For more details, please refer to our project website athttps://sites.google.com/view/san-fapl.
- Updated
Oct 17, 2022 - Python
Preference Learning with Gaussian Processes and Bayesian Optimization
- Updated
Aug 10, 2017 - Python
Java framework for Preference Learning
- Updated
Mar 5, 2018 - Java
[ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model
- Updated
Feb 24, 2025 - Python
A paper under AAAI-20 review
- Updated
Aug 27, 2019 - Python
[P]reference and [R]ule [L]earning algorithm implementation for Python 3 (https://arxiv.org/abs/1812.07895)
- Updated
Mar 17, 2019 - Python
Code for the project: "Analysis of Recommendation-systems based on User Preferences".
- Updated
Mar 6, 2018 - Python
Improve this page
Add a description, image, and links to thepreference-learning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thepreference-learning topic, visit your repo's landing page and select "manage topics."