#
sft-data
Here are 4 public repositories matching this topic...
ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling
- Updated
Jul 3, 2025 - Python
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation
- Updated
Jul 8, 2025 - Python
代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota
- Updated
Jul 25, 2024 - Python
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
- Updated
Mar 23, 2025
Improve this page
Add a description, image, and links to thesft-data topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thesft-data topic, visit your repo's landing page and select "manage topics."