Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

int8-quantization

Here are 18 public repositories matching this topic...

CVT, a Computer Vision Toolkit.

  • UpdatedAug 24, 2022
  • C

Winner solution of mobile AI (CVPRW 2021).

  • UpdatedMay 14, 2022
  • Python

将端上模型部署过程中,常见的问题以及解决办法记录并汇总,希望能给其他人带来一点帮助。

  • UpdatedAug 17, 2022
  • Python

VB.NET api wrapper for llm-inference chatllm.cpp

  • UpdatedNov 26, 2024
  • Visual Basic .NET

Corrects your grammar in 5 languages directly in your browser. Powered by an open-source AI model.

  • UpdatedJul 12, 2025
  • JavaScript

TinyML project. This system monitors your room or surrounding with an onboard microphone of Arduino nano BLE sense. Still Under Developement

  • UpdatedOct 18, 2021
  • Jupyter Notebook
zoneburst

High-performance LLM inference platform with vLLM continuous batching achieving 12.3K+ req/sec, 42ms P50/178ms P99 latency, INT8/INT4 quantization (70% memory savings), tensor parallelism across 4 GPUs, and comprehensive monitoring serving 1500+ concurrent users.

  • UpdatedOct 3, 2025
  • Python

Post-Training quantization perfomed on the model trained with CLIC dataset.

  • UpdatedSep 1, 2025
  • Jupyter Notebook

Translation API using Meta's NLLB-200 model with 200+ languages

  • UpdatedDec 5, 2025
  • Python

Yandex LLM Scaling Week 2025

  • UpdatedDec 8, 2025
  • Jupyter Notebook

Improve this page

Add a description, image, and links to theint8-quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theint8-quantization topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp