model-compression

Star

Here are 339 public repositories matching this topic...

Language:All

Filter by language

All339 Python226 Jupyter Notebook52 JavaScript3 C2 C++2 HTML2 Makefile2 C#1 Go1 Java1

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

microsoft /nni

Star14.3k

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

python data-science machine-learning deep-learning neural-network tensorflow machine-learning-algorithms pytorch distributed hyperparameter-optimization feature-engineering nas bayesian-optimization hyperparameter-tuning automl automated-machine-learning model-compression neural-architecture-search deep-neural-network mlops

UpdatedJul 3, 2024
Python

huawei-noah /Efficient-AI-Backbones

Star4.4k

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

tensorflow pytorch transformer imagenet convolutional-neural-networks pretrained-models model-compression efficient-inference ghostnet vision-transformer

UpdatedMar 15, 2025
Python

dkozlov /awesome-knowledge-distillation

Star3.8k

Awesome Knowledge Distillation

deep-learning knowledge-distillation teacher-student knowledge-transfer co-training model-compression distillation kd knowldge-distillation distillation-model model-distillation

UpdatedOct 23, 2025

VainF /Torch-Pruning

Star3.2k

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

transformers vision pruning model-compression efficient-deep-learning llm

UpdatedSep 7, 2025
Python

huawei-noah /Pretrained-Language-Model

Star3.2k

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

pretrained-models quantization knowledge-distillation model-compression large-scale-distributed

UpdatedJan 22, 2024
Python

Tencent /PocketFlow

Star2.9k

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

computer-vision deep-learning mobile-app automl model-compression

UpdatedMar 31, 2023
Python

FLHonker /Awesome-Knowledge-Distillation

Star2.6k

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

deep-learning transfer-learning model-compression distillation kd knowldge-distillation

UpdatedMay 30, 2023

he-y /Awesome-Pruning

Star2.5k

A curated list of neural network pruning resources.

awesome-list pruning model-compression model-acceleration

UpdatedApr 4, 2024

Efficient-ML /Awesome-Model-Quantization

Star2.3k

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

awesome deep-learning quantization model-compression model-acceleration binary-network binarized-neural-networks lightweight-neural-network model-quantization efficient-deep-learning

UpdatedMar 4, 2025

666DZY666 /micronet

Star2.3k

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

pytorch pruning convolutional-networks quantization xnor-net tensorrt model-compression bnn neuromorphic-computing group-convolution onnx network-in-network tensorrt-int8-python dorefa twn network-slimming integer-arithmetic-only quantization-aware-training post-training-quantization batch-normalization-fuse

UpdatedMay 6, 2025
Python

haitongli /knowledge-distillation-pytorch

Star2k

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

deep-neural-networks computer-vision pytorch knowledge-distillation cifar10 dark-knowledge model-compression

UpdatedMar 25, 2023
Python

AberHu /Knowledge-Distillation-Zoo

Star1.7k

Pytorch implementation of various Knowledge Distillation (KD) methods.

knowledge-distillation teacher-student knowledge-transfer model-compression distillation kd kd-methods

UpdatedNov 25, 2021
Python

tensorflow /model-optimization

Star1.6k

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

UpdatedDec 15, 2025
Python

microsoft /NeuronBlocks

Star1.5k

NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

natural-language-processing deep-learning text-classification dnn pytorch artificial-intelligence question-answering knowledge-distillation sequence-labeling text-matching qna model-compression

UpdatedJul 22, 2023
Python

huawei-noah /Efficient-Computing

Star1.3k

Efficient computing methods developed by Huawei Noah's Ark Lab

pruning quantization knowledge-distillation model-compression self-supervised binary-neural-networks

UpdatedNov 5, 2024
Jupyter Notebook

ethanhe42 /channel-pruning

Star1.1k

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

deep-neural-networks acceleration image-classification image-recognition object-detection model-compression channel-pruning

UpdatedMay 2, 2024
Python

MingSun-Tse /Efficient-Deep-Learning

Star953

Collection of recent methods on (deep) neural network compression and acceleration.

deep-neural-networks deep-learning knowledge-distillation model-compression network-pruning efficient-deep-learning

UpdatedApr 4, 2025

horseee /DeepCache

Star949

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

model-compression efficient-inference diffusion-models stable-diffusion training-free

UpdatedJun 27, 2024
Python

alibaba /TinyNeuralNetwork

Star861

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

deep-neural-networks deep-learning pytorch pruning model-compression model-converter quantization-aware-training post-training-quantization

UpdatedAug 21, 2025
Python

guan-yuan /Awesome-AutoML-and-Lightweight-Models

Star856

A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hyperparameter Optimization, 5.) Automated Feature Engineering.

tensorflow pytorch hyperparameter-optimization awesome-list quantization nas automl model-compression neural-architecture-search meta-learning architecture-search quantized-training model-acceleration automated-feature-engineering quantized-neural-network

UpdatedJun 19, 2021

Improve this page

Add a description, image, and links to themodel-compression topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with themodel-compression topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly