sft
Here are 68 public repositories matching this topic...
Language:All
Sort:Most stars
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
- Updated
Mar 18, 2025 - Python
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).
- Updated
Mar 17, 2025 - Python
chatglm 6b finetuning and alpaca finetuning
- Updated
Mar 9, 2025 - Python
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
- Updated
Jun 30, 2023 - Python
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.
- Updated
Sep 6, 2024 - Jupyter Notebook
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
- Updated
Oct 29, 2024 - Python
An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案,包含从训练到推理的完整代码和脚本,以及实践中积累一些经验和结论。)
- Updated
Mar 13, 2025 - Python
Ethereum Semi Fungible Standard (ERC-1155)
- Updated
Dec 4, 2024 - TypeScript
🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
- Updated
Dec 3, 2024 - Python
SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality of their work.
- Updated
Nov 25, 2024 - Python
本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。
- Updated
Feb 28, 2024 - Python
Fine-Tuning Dataset Auto-Generation for Graph Query Languages.
- Updated
Mar 10, 2025 - Python
Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).
- Updated
Jul 14, 2024 - TypeScript
LDPC MATLAB simulation using BPSK + AWGN modulation decoded using Sum Product and Min Sum Algorithm
- Updated
Jun 4, 2024 - MATLAB
Improve this page
Add a description, image, and links to thesft topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thesft topic, visit your repo's landing page and select "manage topics."