Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

sft

Here are 68 public repositories matching this topic...

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

  • UpdatedMar 18, 2025
  • Python

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2.5, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL2, Phi3.5-Vision, GOT-OCR2, ...).

  • UpdatedMar 17, 2025
  • Python

chatglm 6b finetuning and alpaca finetuning

  • UpdatedMar 9, 2025
  • Python

聚宝盆(Cornucopia): 中文金融系列开源可商用大模型,并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)

  • UpdatedJun 30, 2023
  • Python

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

  • UpdatedSep 6, 2024
  • Jupyter Notebook

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

  • UpdatedOct 29, 2024
  • Python

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions. (DeepSeek-V3/R1 满血版 671B 全参数微调的开源解决方案,包含从训练到推理的完整代码和脚本,以及实践中积累一些经验和结论。)

  • UpdatedMar 13, 2025
  • Python

Awesome-RAG: Collect typical RAG papers and systems.

  • UpdatedJan 23, 2025

Ethereum Semi Fungible Standard (ERC-1155)

  • UpdatedDec 4, 2024
  • TypeScript

ERC-3525 Reference Implementation

  • UpdatedDec 4, 2023
  • Solidity

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

  • UpdatedOct 16, 2024
  • Python

🚀LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training

  • UpdatedDec 3, 2024
  • Python

SEA is an automated paper review framework capable of generating comprehensive and high-quality review feedback with high consistency for papers, thereby assisting researchers in improving the quality of their work.

  • UpdatedNov 25, 2024
  • Python

moss chat finetuning

  • UpdatedApr 23, 2024
  • Python

本项目旨在结合以往研究人员的代表性工作,从多个维度评估sft数据,并自动化过滤sft数据。

  • UpdatedFeb 28, 2024
  • Python

Fine-Tuning Dataset Auto-Generation for Graph Query Languages.

  • UpdatedMar 10, 2025
  • Python

Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).

  • UpdatedJul 14, 2024
  • TypeScript

This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).

  • UpdatedJul 2, 2024
  • Python

Improve this page

Add a description, image, and links to thesft topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thesft topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp