Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings
vllm-project

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

PinnedLoading

  1. vllmvllmPublic

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 70.8k 13.6k

  2. llm-compressorllm-compressorPublic

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    Python 2.8k 401

  3. recipesrecipesPublic

    Common recipes to run vLLM

    Jupyter Notebook 447 149

  4. speculatorsspeculatorsPublic

    A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM

    Python 243 40

  5. semantic-routersemantic-routerPublic

    System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

    Go 3.2k 536

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 34 repositories
  • tpu-inference Public

    TPU inference for vLLM, with unified JAX and PyTorch support.

    vllm-project/tpu-inference’s past year of commit activity
    Python 239Apache-2.0 105 47(1 issue needs help) 137 UpdatedFeb 20, 2026
  • vllm Public

    A high-throughput and memory-efficient inference and serving engine for LLMs

    vllm-project/vllm’s past year of commit activity
    Python 70,799Apache-2.0 13,571 1,697(45 issues need help) 1,746 UpdatedFeb 20, 2026
  • llm-compressor Public

    Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

    vllm-project/llm-compressor’s past year of commit activity
    Python 2,763Apache-2.0 401 73(15 issues need help) 39 UpdatedFeb 20, 2026
  • vllm-gaudi Public

    Community maintained hardware plugin for vLLM on Intel Gaudi

    vllm-project/vllm-gaudi’s past year of commit activity
    Python 26Apache-2.0 108 1 50 UpdatedFeb 20, 2026
  • semantic-router Public

    System Level Intelligent Router for Mixture-of-Models at Cloud, Data Center and Edge

    vllm-project/semantic-router’s past year of commit activity
    Go 3,208Apache-2.0 536 108(22 issues need help) 56 UpdatedFeb 20, 2026
  • perf-dashboard Public

    Performance dashboard for vLLM

    vllm-project/perf-dashboard’s past year of commit activity
    Python 10 2 0 UpdatedFeb 20, 2026
  • vllm-metal Public

    Community maintained hardware plugin for vLLM on Apple Silicon

    vllm-project/vllm-metal’s past year of commit activity
    Python 479Apache-2.0 46 8(2 issues need help) 6 UpdatedFeb 20, 2026
  • aibrix Public

    Cost-efficient and pluggable Infrastructure components for GenAI inference

    vllm-project/aibrix’s past year of commit activity
    Go 4,631Apache-2.0 526 258(19 issues need help) 24 UpdatedFeb 20, 2026
  • vllm-daily Public

    vLLM Daily Summarization of Merged PRs

    vllm-project/vllm-daily’s past year of commit activity
    41 3 0 0 UpdatedFeb 20, 2026
  • bart-plugin Public

    vLLM Model plugin for the encoder-decoder BART model

    vllm-project/bart-plugin’s past year of commit activity
    Python 8Apache-2.0 1 0 2 UpdatedFeb 20, 2026

Sponsors

  • @Flink-ddd
  • @excoffierleonard
  • @adampattersonct
  • @hhayan
  • @jinbum-kim
  • @HeartReset
  • @nwthomas
  • @G-Research-OSS
  • @kevATin
  • @yankay
  • @brickfrog
  • Private Sponsor
  • Private Sponsor

[8]ページ先頭

©2009-2026 Movatter.jp