Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

quantization

Here are 999 public repositories matching this topic...

LLaMA-Factory

Faster Whisper transcription with CTranslate2

  • UpdatedNov 19, 2025
  • Python
Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

  • UpdatedJul 15, 2025
  • Python
Qbot

[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs:https://ufund-me.github.io/Qbot ✨ :news: qbot-mini:https://github.com/Charmve/iQuant

  • UpdatedJul 6, 2025
  • Jupyter Notebook
bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

  • UpdatedDec 12, 2025
  • Python

Lossy PNG compressor — pngquant command based on libimagequant library

  • UpdatedJul 7, 2025
  • C

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

  • UpdatedApr 11, 2025
  • Python

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

  • UpdatedNov 17, 2025
  • Python

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

  • UpdatedDec 17, 2025
  • Python
deepsparse

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

  • UpdatedJan 22, 2024
  • Python
nlp-architect

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

  • UpdatedNov 7, 2022
  • Python

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

  • UpdatedDec 11, 2025
  • Cuda

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

  • UpdatedNov 22, 2022
  • Python

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community:https://discord.gg/TgHXuSJEk6

  • UpdatedDec 2, 2025
  • Python

ComfyUI Plugin of Nunchaku

  • UpdatedNov 8, 2025
  • Python

PyTorch native quantization and sparsity for training and inference

  • UpdatedDec 17, 2025
  • Python

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime

  • UpdatedDec 17, 2025
  • Python

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

  • UpdatedDec 17, 2025
  • Python

Improve this page

Add a description, image, and links to thequantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thequantization topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp