pruning
Here are 492 public repositories matching this topic...
Language:All
Sort:Most stars
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
- Updated
Apr 3, 2025 - Jupyter Notebook
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research.https://intellabs.github.io/distiller
- Updated
Apr 24, 2023 - Jupyter Notebook
Sparsity-aware deep learning inference runtime for CPUs
- Updated
Jul 19, 2024 - Python
[CVPR 2023] DepGraph: Towards Any Structural Pruning
- Updated
Apr 25, 2025 - Python
A curated list of neural network pruning resources.
- Updated
Apr 4, 2024
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
- Updated
Apr 25, 2025 - Python
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
- Updated
Apr 24, 2025 - Python
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…
- Updated
Apr 16, 2025 - Python
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
- Updated
Aug 1, 2024 - Python
PaddleSlim is an open-source library for deep model compression and architecture search.
- Updated
Dec 4, 2024 - Python
Practical course about Large Language Models.
- Updated
Apr 22, 2025 - Jupyter Notebook
OpenMMLab Model Compression Toolbox and Benchmark.
- Updated
Jun 11, 2024 - Python
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
- Updated
Feb 10, 2025 - Python
Config driven, easy backup cli for restic.
- Updated
Mar 31, 2025 - Go
Efficient computing methods developed by Huawei Noah's Ark Lab
- Updated
Nov 5, 2024 - Jupyter Notebook
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
- Updated
Oct 7, 2024 - Python
Neural Network Compression Framework for enhanced OpenVINO™ inference
- Updated
Apr 25, 2025 - Python
PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference
- Updated
Jul 12, 2019 - Python
mobilev2-yolov5s剪枝、蒸馏,支持ncnn,tensorRT部署。ultra-light but better performence!
- Updated
Sep 26, 2022 - Jupyter Notebook
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
- Updated
Mar 12, 2025 - Python
Improve this page
Add a description, image, and links to thepruning topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thepruning topic, visit your repo's landing page and select "manage topics."