Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings
vllm-project

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

PinnedLoading

  1. vllmvllmPublic

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 69.7k 13.3k

  2. llm-compressorllm-compressorPublic

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2.7k 387

  3. recipesrecipesPublic

    Common recipes to run vLLM

    Jupyter Notebook 368 139

  4. speculatorsspeculatorsPublic

    A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

    Python 225 37

  5. semantic-routersemantic-routerPublic

    System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

    Go 3.2k 529

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 33 repositories

[8]ページ先頭

©2009-2026 Movatter.jp