Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

quantization

Here are 773 public repositories matching this topic...

LLaMA-FactoryChinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

  • UpdatedApr 30, 2024
  • Python

Faster Whisper transcription with CTranslate2

  • UpdatedApr 29, 2025
  • Python
Qbot

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs:https://ufund-me.github.io/Qbot ✨ :news: qbot-mini:https://github.com/Charmve/iQuant

  • UpdatedMay 5, 2025
  • Jupyter Notebook
bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

  • UpdatedMay 9, 2025
  • Python

Lossy PNG compressor — pngquant command based on libimagequant library

  • UpdatedJan 23, 2025
  • C

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

  • UpdatedApr 11, 2025
  • Python
deepsparse

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

  • UpdatedJan 22, 2024
  • Python
nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

  • UpdatedNov 7, 2022
  • Python

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

  • UpdatedMay 9, 2025
  • Python

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

  • UpdatedNov 22, 2022
  • Python

Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community:https://discord.gg/TgHXuSJEk6

  • UpdatedSep 23, 2024
  • Python

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

  • UpdatedMay 9, 2025
  • Python

Run Mixtral-8x7B models in Colab or consumer desktops

  • UpdatedApr 8, 2024
  • Python

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

  • UpdatedMay 10, 2025
  • Python

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

  • UpdatedMay 6, 2025
  • Python

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

  • UpdatedMar 4, 2025

Improve this page

Add a description, image, and links to thequantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thequantization topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp