chester256/Model-Compression-PapersPublic

NotificationsYou must be signed in to change notification settings
Fork80
Star399

Papers for deep neural network compression and acceleration

399 stars 80 forks Branches Tags Activity

You must be signed in to change notification settings

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
README.md		README.md

Repository files navigation

Model-Compression-Papers

Papers for neural network compression and acceleration. Partly based onlink.

Survey

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better, [arXiv '21]
Recent Advances in Efficient Computation of Deep Convolutional Neural Networks, [arxiv '18]
A Survey of Model Compression and Acceleration for Deep Neural Networks [arXiv '17]

Quantization

Pruning

Binarized Neural Network

Low-rank Approximation

Efficient and Accurate Approximations of Nonlinear Convolutional Networks [CVPR'15]
Accelerating Very Deep Convolutional Networks for Classification and Detection (Extended version of above one)
Convolutional neural networks with low-rank regularization [arXiv'15]
Exploiting Linear Structure Within Convolutional Networks for Efficient Evaluation [NIPS'14]
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications [ICLR'16]
High performance ultra-low-precision convolutions on mobile devices [NIPS'17]
Speeding up convolutional neural networks with low rank expansions
Coordinating Filters for Faster Deep Neural Networks [ICCV '17]

Knowledge Distillation

Dark knowledge
FitNets: Hints for Thin Deep Nets [ICLR '15]
Net2net: Accelerating learning via knowledge transfer [ICLR '16]
Distilling the Knowledge in a Neural Network [NIPS '15]
MobileID: Face Model Compression by Distilling Knowledge from Neurons [AAAI '16]
DarkRank: Accelerating Deep Metric Learning via Cross Sample Similarities Transfer [arXiv '17]
Deep Model Compression: Distilling Knowledge from Noisy Teachers [arXiv '16]
Paying More Attention to Attention: Improving the Performance of Convolutional Neural Networks via Attention Transfer [ICLR '17]
Like What You Like: Knowledge Distill via Neuron Selectivity Transfer [arXiv '17]
Learning Efficient Object Detection Models with Knowledge Distillation [NIPS '17]
Data-Free Knowledge Distillation For Deep Neural Networks [NIPS '17]
A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learnin [CVPR '17]
Moonshine: Distilling with Cheap Convolutions [arXiv '17]
Model compression via distillation and quantization [ICLR '18]
Apprentice: Using Knowledge Distillation Techniques To Improve Low-Precision Network Accuracy [ICLR '18]

Miscellaneous

About

Papers for deep neural network compression and acceleration

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Model-Compression-Papers

Survey

Quantization

Pruning

Binarized Neural Network

Low-rank Approximation

Knowledge Distillation

Miscellaneous

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors2

Movatterモバイル変換

chester256/Model-Compression-Papers

Folders and files

Latest commit

History

Repository files navigation

Model-Compression-Papers

Survey

Quantization

Pruning

Binarized Neural Network

Low-rank Approximation

Knowledge Distillation

Miscellaneous

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors2

Packages