Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

distributed-training

Here are 209 public repositories matching this topic...

Made-With-ML

Learn how to design, develop, deploy and iterate on production-grade ML applications.

  • UpdatedAug 18, 2024
  • Jupyter Notebook

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

  • UpdatedOct 12, 2025
  • Python

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

  • UpdatedOct 13, 2025
  • C++
PaddleNLPmetaflow

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).

  • UpdatedOct 13, 2025
  • Python
Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

  • UpdatedAug 13, 2024
  • Python

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

  • UpdatedAug 11, 2025
  • Python

A high performance and generic framework for distributed DNN training

  • UpdatedOct 3, 2023
  • Python

Fast and flexible AutoML with learning guarantees.

  • UpdatedNov 30, 2023
  • Jupyter Notebook
determined

Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.

  • UpdatedMar 20, 2025
  • Go

Training and serving large-scale neural networks with auto parallelization.

  • UpdatedDec 9, 2023
  • Python

Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.

  • UpdatedOct 12, 2025
  • Python

DLRover: An Automatic Distributed Deep Learning System

  • UpdatedSep 29, 2025
  • Python

Collective communications library with various primitives for multi-machine training.

  • UpdatedSep 12, 2025
  • C++

Library for Fast and Flexible Human Pose Estimation

  • UpdatedMar 25, 2023
  • Python

DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.

  • UpdatedJan 21, 2025
  • C++

Efficient Deep Learning Systems course materials (HSE, YSDA)

  • UpdatedApr 23, 2025
  • Jupyter Notebook

Best practice for training LLaMA models in Megatron-LM

  • UpdatedJan 2, 2024
  • Python

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

  • UpdatedOct 2, 2025
  • Python

Improve this page

Add a description, image, and links to thedistributed-training topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thedistributed-training topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp