pipeline-parallelism
Here are 28 public repositories matching this topic...
Language:All
Sort:Most stars
Making large AI models cheaper, faster and more accessible
- Updated
Dec 8, 2025 - Python
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
- Updated
Dec 17, 2025 - Python
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
- Updated
Sep 7, 2024 - Python
A GPipe implementation in PyTorch
- Updated
Jul 25, 2024 - Python
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.
- Updated
Mar 10, 2025 - Jupyter Notebook
飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。
- Updated
May 24, 2024 - Python
InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.
- Updated
Aug 21, 2025 - Python
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
- Updated
Jul 31, 2025 - Python
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
- Updated
Mar 31, 2023 - Python
A curated list of awesome projects and papers for distributed training or inference
- Updated
Oct 8, 2024
Serving Inside Pytorch
- Updated
Dec 11, 2025 - Python
Decentralized LLMs fine-tuning and inference with offloading
- Updated
Dec 15, 2025 - Python
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
- Updated
Dec 14, 2023 - Python
An Efficient Pipelined Data Parallel Approach for Training Large Model
- Updated
Dec 11, 2020 - Python
Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.
- Updated
Mar 20, 2025 - Python
gLLM: Global Balanced Pipeline Parallelism System for Distributed LLM Serving with Token Throttling
- Updated
Dec 18, 2025 - Python
A Throughput-Optimized Pipeline Parallel Inference System for Large Language Models
- Updated
Aug 5, 2025 - Python
FTPipe and related pipeline model parallelism research.
- Updated
May 16, 2023 - Python
Implementation of autoregressive language model using improved Transformer and DeepSpeed pipeline parallelism.
- Updated
Jan 2, 2022 - Python
Official implementation of DynPartition: Automatic Optimal Pipeline Parallelism of Dynamic Neural Networks over Heterogeneous GPU Systems for Inference Tasks
- Updated
May 5, 2023 - Python
Improve this page
Add a description, image, and links to thepipeline-parallelism topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thepipeline-parallelism topic, visit your repo's landing page and select "manage topics."