Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings
vllm-project

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

PinnedLoading

  1. vllmvllmPublic

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 65.7k 12k

  2. llm-compressorllm-compressorPublic

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2.4k 320

  3. recipesrecipesPublic

    Common recipes to run vLLM

    Jupyter Notebook 281 102

  4. speculatorsspeculatorsPublic

    A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

    Python 162 21

  5. semantic-routersemantic-routerPublic

    Intelligent Router for Mixture-of-Models

    Go 2.5k 322

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 30 repositories

[8]ページ先頭

©2009-2025 Movatter.jp