multi-gpu
Here are 79 public repositories matching this topic...
Language:All
Sort:Most stars
The Forge Cross-Platform Framework PC Windows, Steamdeck (native), Ray Tracing, macOS / iOS, Android, XBOX, PS4, PS5, Switch, Quest 2
- Updated
Apr 11, 2025 - C++
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
- Updated
May 11, 2021 - Python
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
- Updated
Jan 31, 2025 - Python
Multi-threaded GUI manager for mass creation of AI-generated art with support for multiple GPUs.
- Updated
Aug 9, 2024 - Python
Face recognition system for ID photos
- Updated
Oct 17, 2018 - Python
GPU-ready Dockerfile to run Stability.AI stable-diffusion model v2 with a simple web interface. Includes multi-GPUs support.
- Updated
Jun 21, 2024 - Python
Package for writing high-level code for parallel high-performance stencil computations that can be deployed on both GPUs and CPUs
- Updated
Apr 2, 2025 - Julia
A PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.
- Updated
Sep 16, 2021 - Python
Distributed tensors and Machine Learning framework with GPU and MPI acceleration in Python
- Updated
Apr 28, 2025 - Python
Code for training py-faster-rcnn and py-R-FCN on multiple GPUs in caffe
- Updated
Jun 6, 2017 - Jupyter Notebook
Chains stable-diffusion-webui instances together to facilitate faster image generation.
- Updated
Feb 24, 2025 - Python
Almost trivial distributed parallelization of stencil-based GPU and CPU applications on a regular staggered grid
- Updated
Apr 28, 2025 - Julia
The world's first CUDA implementation of Weakly-Compressible Smoothed Particle Hydrodynamics
- Updated
Jan 28, 2024 - C++
multi-gpu pre-training in one machine for BERT without horovod (Data Parallelism)
- Updated
Mar 15, 2025 - Python
Efficient and Scalable Physics-Informed Deep Learning and Scientific Machine Learning on top of Tensorflow for multi-worker distributed computing
- Updated
Mar 1, 2022 - Python
Multi-device OpenCL kernel load balancer and pipeliner API for C#. Uses shared-distributed memory model to keep GPUs updated fast while using same kernel on all devices(for simplicity).
- Updated
Jul 7, 2022 - C#
A dual-GPU DEM solver with complex grain geometry support
- Updated
Apr 24, 2025 - C++
Improve this page
Add a description, image, and links to themulti-gpu topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with themulti-gpu topic, visit your repo's landing page and select "manage topics."