Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings
llm-d

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@llm-d

llm-d

llm-d enables high performance distributed inference in production on Kubernetes

GitHub Org's starsDocumentationLicense

Join SlackX (formerly Twitter) FollowLinkedInReddit

llm-d is a well-lit path for serving large language models at scale with the fastest time-to-value and competitive performance per dollar. Built on vLLM, Kubernetes, and Inference Gateway, llm-d provides modular solutions for distributed inference with features like KV-cache aware routing and disaggregated serving.

Key Resources

🤝 How to Contribute

Join the Community

  1. 💬 Slack:Join our development discussions atllm-d.slack.com
  2. 📧 Google Group: Subscribe tollm-d-contributors for architecture docs and meeting invites
  3. 🗓️ Weekly Standup: Wednesdays at 1230 ET -Public Calendar

Contributing Code

  1. Read Guidelines: Review ourCode of Conduct andcontribution process
  2. Sign Commits: All commits requireDCO sign-off (git commit -s)

Ways to Contribute

  • 🐛Bug fixes and small features - Submit PRs directly to component repos
  • 🚀New features with APIs - Requireproject proposals
  • 📚Documentation - Help improve guides and examples
  • 🧪Testing & Benchmarking - Contribute to our test coverage
  • 💡Experimental features - Start inllm-d-incubation org

License:Apache 2.0

PinnedLoading

  1. llm-dllm-dPublic

    Achieve state of the art inference performance with modern accelerators on Kubernetes

    Shell 2.1k 251

  2. llm-d-inference-schedulerllm-d-inference-schedulerPublic

    Inference scheduler for llm-d

    Go 107 102

  3. llm-d-kv-cache-managerllm-d-kv-cache-managerPublic

    Distributed KV cache coordinator

    Go 89 59

  4. llm-d-benchmarkllm-d-benchmarkPublic

    llm-d benchmark scripts and tooling

    Jupyter Notebook 33 39

  5. llm-d-routing-sidecarllm-d-routing-sidecarPublic

    Incubating P/D sidecar for llm-d

    Go 16 28

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 11 repositories

[8]ページ先頭

©2009-2025 Movatter.jp