Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
#

pruning

Here are 492 public repositories matching this topic...

deepsparse

A curated list of neural network pruning resources.

  • UpdatedApr 4, 2024

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

  • UpdatedApr 25, 2025
  • Python

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

  • UpdatedApr 24, 2025
  • Python

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

  • UpdatedApr 16, 2025
  • Python
sparseml

PaddleSlim is an open-source library for deep model compression and architecture search.

  • UpdatedDec 4, 2024
  • Python
Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

  • UpdatedApr 22, 2025
  • Jupyter Notebook

OpenMMLab Model Compression Toolbox and Benchmark.

  • UpdatedJun 11, 2024
  • Python

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

  • UpdatedFeb 10, 2025
  • Python
autorestic

Config driven, easy backup cli for restic.

  • UpdatedMar 31, 2025
  • Go

Efficient computing methods developed by Huawei Noah's Ark Lab

  • UpdatedNov 5, 2024
  • Jupyter Notebook

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

  • UpdatedOct 7, 2024
  • Python

PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference

  • UpdatedJul 12, 2019
  • Python

mobilev2-yolov5s剪枝、蒸馏,支持ncnn,tensorRT部署。ultra-light but better performence!

  • UpdatedSep 26, 2022
  • Jupyter Notebook

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

  • UpdatedMar 12, 2025
  • Python

Improve this page

Add a description, image, and links to thepruning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thepruning topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp