data-parallelism
Here are 54 public repositories matching this topic...
Language:All
Sort:Most stars
Making large AI models cheaper, faster and more accessible
- Updated
Mar 27, 2025 - Python
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
- Updated
Mar 27, 2025 - Python
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
- Updated
Jul 25, 2018 - Python
A state-of-the-art multithreading runtime: message-passing based, fast, scalable, ultra-low overhead
- Updated
Jun 29, 2024 - Nim
飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。
- Updated
May 24, 2024 - Python
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
- Updated
Mar 21, 2025 - Python
Easy Parallel Library (EPL) is a general and efficient deep learning framework for distributed model training.
- Updated
Mar 31, 2023 - Python
Distributed Keras Engine, Make Keras faster with only one line of code.
- Updated
Oct 3, 2019 - Python
Ternary Gradients to Reduce Communication in Distributed Deep Learning (TensorFlow)
- Updated
Nov 19, 2018 - Python
Orkhon: ML Inference Framework and Server Runtime
- Updated
Feb 1, 2021 - Rust
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
- Updated
Dec 14, 2023 - Python
Distributed training (multi-node) of a Transformer model
- Updated
Apr 10, 2024 - Python
SC23 Deep Learning at Scale Tutorial Material
- Updated
Sep 16, 2024 - Python
Deep Learning at Scale Training Event at NERSC
- Updated
Mar 3, 2025 - Python
WIP. Veloce is a low-code Ray-based parallelization library that makes machine learning computation novel, efficient, and heterogeneous.
- Updated
Aug 4, 2022 - Python
♨️ Optimized Gaussian blur filter on CPU.
- Updated
Dec 12, 2017 - C++
This repository provides hands-on labs on PyTorch-based Distributed Training and SageMaker Distributed Training. It is written to make it easy for beginners to get started, and guides you through step-by-step modifications to the code based on the most basic BERT use cases.
- Updated
Jul 18, 2023 - Jupyter Notebook
☕Implement of Parallel Matrix Multiplication Methods Using FOX Algorithm on Peking University's High-performance Computing System
- Updated
Jan 28, 2019 - C
Fast and easy distributed model training examples.
- Updated
Nov 26, 2024 - Python
Improve this page
Add a description, image, and links to thedata-parallelism topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thedata-parallelism topic, visit your repo's landing page and select "manage topics."