Movatterモバイル変換

DeepSpeed

From Wikipedia, the free encyclopedia

Microsoft open source library

DeepSpeed

Original author(s)	Microsoft Research
Developer(s)	Microsoft
Initial release	May 18, 2020; 4 years ago (2020-05-18)

Stable release	v0.16.2 / December 18, 2024; 2 months ago (2024-12-18)

Repository	github.com/microsoft/DeepSpeed
Written in	Python,CUDA,C++
Type	Software library
License	Apache License 2.0
Website	deepspeed.ai

DeepSpeed is anopen source deep learning optimization library forPyTorch.^[1]

Library

[edit]

The library is designed to reduce computing power andmemory use and to train largedistributed models with betterparallelism on existingcomputer hardware.^[2]^[3] DeepSpeed is optimized for low latency, high throughput training. It includes the Zero Redundancy Optimizer (ZeRO) for training models with 1 trillion or more parameters.^[4] Features include mixed precision training, single-GPU, multi-GPU, and multi-node training as well as custom model parallelism. The DeepSpeed source code is licensed underMIT License and available onGitHub.^[5]

The team claimed to achieve up to a 6.2x throughput improvement, 2.8x faster convergence, and 4.6x less communication.^[6]

References

[edit]

^"Microsoft Updates Windows, Azure Tools with an Eye on The Future".PCMag UK. May 22, 2020.
^Yegulalp, Serdar (February 10, 2020)."Microsoft speeds up PyTorch with DeepSpeed".InfoWorld.
^"Microsoft unveils "fifth most powerful" supercomputer in the world".Neowin. 18 June 2023.
^"Microsoft trains world's largest Transformer language model". February 10, 2020.
^"microsoft/DeepSpeed". July 10, 2020 – via GitHub.
^"DeepSpeed: Accelerating large-scale model inference and training via system optimizations and compression".Microsoft Research. 2021-05-24. Retrieved2021-06-19.

External links

[edit]

v t e Deep learning software
Comparison
Open source	Apache MXNet Apache SINGA Caffe Deeplearning4j DeepSpeed Dlib Keras Microsoft Cognitive Toolkit ML.NET OpenNN PyTorch TensorFlow Theano Torch ONNX OpenVINO MindSpore
Proprietary	Apple Core ML IBM Watson Neural Designer Wolfram Mathematica MATLAB Deep Learning Toolbox
Category

Microsoft free and open-source software (FOSS)

Overview

Software

Applications	3D Movie Maker Atom Conference XP Family.Show File Manager Open Live Writer Microsoft PowerToys Terminal Windows Calculator Windows Console Windows Package Manager WorldWide Telescope XML Notepad
Video games	Allegiance
Programming languages	Bosque C# Dafny F# F* GW-BASIC IronPython IronRuby Lean P Power Fx PowerShell Project Verona Q# Small Basic Online TypeScript Visual Basic
Frameworks, development tools	.NET .NET Framework .NET Gadgeteer .NET MAUI .NET Micro Framework AirSim ASP.NET ASP.NET AJAX ASP.NET Core ASP.NET MVC ASP.NET Razor ASP.NET Web Forms Avalonia Babylon.js BitFunnel Blazor C++/WinRT CCF ChakraCore CLR Profiler Dapr DeepSpeed DiskSpd Dryad Dynamic Language Runtime eBPF on Windows Electron Entity Framework Fluent Design System Fluid Framework Infer.NET LightGBM Managed Extensibility Framework Microsoft Automatic Graph Layout Microsoft C++ Standard Library Microsoft Cognitive Toolkit Microsoft Design Language Microsoft Detours Microsoft Enterprise Library Microsoft SEAL mimalloc Mixed Reality Toolkit ML.NET mod_mono Mono MonoDevelop MSBuild MsQuic Neural Network Intelligence npm NuGet OneFuzz Open Management Infrastructure Open Neural Network Exchange Open Service Mesh Open XML SDK Orleans Playwright ProcDump ProcMon Python Tools for Visual Studio R Tools for Visual Studio RecursiveExtractor Roslyn Sandcastle SignalR StyleCop SVNBridge T2 Temporal Prover Text Template Transformation Toolkit TLA+ Toolbox U-Prove vcpkg Virtual File System for Git Voldemort VoTT Vowpal Wabbit Windows App SDK Windows Communication Foundation Windows Driver Frameworks KMDF UMDF Windows Forms Windows Presentation Foundation Windows Template Library Windows UI Library WinJS WinObjC WiX XDP for Windows XSP xUnit.net Z3 Theorem Prover
Operating systems	MS-DOS (v1.25, v2.0 & v4.0) Barrelfish SONiC Azure Linux
Other	ChronoZoom Extensible Storage Engine FlexWiki FourQ Gollum Project Mu ReactiveX SILK TLAPS TPM 2.0 Reference Implementation WikiBhasha

Licenses

Forges

Movatterモバイル変換

DeepSpeed

Library

See also

References

Further reading

External links