Movatterモバイル変換

Skip to content

#

llm-routing

Here are 8 public repositories matching this topic...

Language:All

Filter by language

All8 Python4 Go1 HTML1 Jupyter Notebook1 Rust1

katanemo /archgw

The data plane for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & hand off, guardrails, zero-code logs and traces, unified access to LLMs from OpenAI, Anthropic, Ollama, etc. Build agents faster, and scale them reliably.

proxy routing gateway prompt proxy-server openai envoy envoyproxy llms generative-ai llmops llm-inference llm-proxy ai-gateway llm-gateway llm-routing ai-gateway-support

UpdatedNov 28, 2025
Rust

junchenzhi /Awesome-LLM-Ensemble

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

moe ensemble ensemble-learning routing-algorithm multi-agent-systems ensemble-prediction ensemble-models ensemble-machine-learning large-language-models llms llm-agents llm-routing llm-ensemble multi-llms

UpdatedNov 21, 2025
HTML

thushan /olla

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

golang ai amd proxy intel self-hosted nvidia mlx llamacpp llama-cpp vllm llm-inference local-ai ollama llm-proxy lmstudio sglang llm-router llm-routing self-hosted-ai

UpdatedNov 23, 2025
Go

RouteWorks /RouterArena

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

arena routing multi-agent multi-agent-systems router-benchmark llm llm-router llm-routing router-evaluation router-leaderboard

UpdatedNov 28, 2025
Python

sebastianpinedaar /llumux

Compose, train and test fast LLM routers

model-selection automl-pipeline large-language-models llms reward-model llm-training llm-inference llm-evaluation llm-pipeline llm-routing

UpdatedOct 8, 2025
Python

laminair /mess-plus

NeurIPS 2025 paper "MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees"

llm-routing sla-management lyapunov-optimization cost-aware-routing sla-guarantees

UpdatedSep 22, 2025
Jupyter Notebook

v4ler11 /llm-portal

Unified interface server for various LLM providers with OpenAI API format

fastapi llms litellm llm-routing

UpdatedJun 9, 2025
Python

danindiana /copilot-bridge

Hybrid AI routing: LOCAL Ollama + CLOUD GitHub Copilot

python machine-learning ai prometheus performance-optimization cost-optimization gpu-optimization meta-reasoning smart-routing dual-gpu github-copilot llm local-llm ollama ai-proxy llm-routing

UpdatedOct 19, 2025
Python

Improve this page

Add a description, image, and links to thellm-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thellm-routing topic, visit your repo's landing page and select "manage topics."

[8]ページ先頭

©2009-2025 Movatter.jp