sparsity
Here are 137 public repositories matching this topic...
Language:All
Sort:Most stars
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
- Updated
Apr 25, 2025 - Python
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
- Updated
Aug 1, 2024 - Python
PyTorch native quantization and sparsity for training and inference
- Updated
Apr 26, 2025 - Python
PaddleSlim is an open-source library for deep model compression and architecture search.
- Updated
Dec 4, 2024 - Python
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
- Updated
Feb 10, 2025 - Python
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
- Updated
Apr 25, 2025 - Python
Neural Network Compression Framework for enhanced OpenVINO™ inference
- Updated
Apr 25, 2025 - Python
Network Slimming (Pytorch) (ICCV 2017)
- Updated
Nov 6, 2020 - Python
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
- Updated
Aug 19, 2024 - Python
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
- Updated
Aug 1, 2024 - Python
Caffe for Sparse and Low-rank Deep Neural Networks
- Updated
Mar 8, 2020 - C++
Reference ImageNet implementation of SelecSLS CNN architecture proposed in the SIGGRAPH 2020 paper "XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera". The repository also includes code for pruning the model based on implicit sparsity emerging from adaptive gradient descent methods, as detailed in the CVPR 2019 paper "On i…
- Updated
Jul 23, 2020 - Python
Sparse Optimisation Research Code
- Updated
Jan 17, 2025 - Python
Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boost Deep Learning scalability on various aspects (e.g. memory and computational time efficiency, representation and generalization power).
- Updated
Jul 21, 2021 - Python
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
- Updated
Oct 18, 2021 - Python
Sparse and structured neural attention mechanisms
- Updated
Aug 31, 2020 - Python
Learning both Weights and Connections for Efficient Neural Networkshttps://arxiv.org/abs/1506.02626
- Updated
Nov 10, 2022 - Jupyter Notebook
A research library for pytorch-based neural network pruning, compression, and more.
- Updated
Nov 28, 2022 - Shell
Improve this page
Add a description, image, and links to thesparsity topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thesparsity topic, visit your repo's landing page and select "manage topics."