#
lm-alignment
Here is 1 public repository matching this topic...
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
- Updated
Jan 15, 2024 - Python
Improve this page
Add a description, image, and links to thelm-alignment topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thelm-alignment topic, visit your repo's landing page and select "manage topics."