slurm
Here are 676 public repositories matching this topic...
Language:All
Sort:Most stars
Machine Learning Engineering Open Book
- Updated
Mar 9, 2025 - Python
Slurm: A Highly Scalable Workload Manager
- Updated
Mar 14, 2025 - C
A DSL for data-driven computational pipelines
- Updated
Mar 16, 2025 - Groovy
A scalable, efficient, cross-platform (Linux/macOS) and easy-to-use workflow engine in pure Python.
- Updated
Mar 15, 2025 - Python
Best practices & guides on how to write distributed pytorch training code
- Updated
Feb 24, 2025 - Python
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and support for E2E production ML pipelines when you're ready.
- Updated
Mar 17, 2025 - Python
A Slurm cluster using docker-compose
- Updated
Sep 27, 2024 - Dockerfile
Lightweight fast function pipeline (DAG) creation in pure Python for scientific workflows 🕸️🧪
- Updated
Mar 14, 2025 - Python
A scheduler for GPU/CPU tasks
- Updated
Mar 6, 2024 - C
Simplify HPC and Batch workloads on Azure
- Updated
Mar 20, 2023 - Python
Prometheus exporter for performance metrics from Slurm.
- Updated
Jun 20, 2024 - Go
An open-source toolkit for deploying and managing high performance clusters for HPC, AI, and data analytics workloads.
- Updated
Mar 17, 2025 - YAML
Run Slurm in Kubernetes
- Updated
Mar 14, 2025 - Go
SEML: Slurm Experiment Management Library
- Updated
Nov 7, 2024 - Python
Tools for computation on batch systems
- Updated
Jan 9, 2024 - R
Improve this page
Add a description, image, and links to theslurm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with theslurm topic, visit your repo's landing page and select "manage topics."