llm-routing
Here are 8 public repositories matching this topic...
The data plane for agents. Arch is a models-native proxy server that handles the plumbing work in AI: agent routing & hand off, guardrails, zero-code logs and traces, unified access to LLMs from OpenAI, Anthropic, Ollama, etc. Build agents faster, and scale them reliably.
- Updated
Nov 28, 2025 - Rust
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
- Updated
Nov 21, 2025 - HTML
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
- Updated
Nov 23, 2025 - Go
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.
- Updated
Nov 28, 2025 - Python
Compose, train and test fast LLM routers
- Updated
Oct 8, 2025 - Python
NeurIPS 2025 paper "MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees"
- Updated
Sep 22, 2025 - Jupyter Notebook
Unified interface server for various LLM providers with OpenAI API format
- Updated
Jun 9, 2025 - Python
Hybrid AI routing: LOCAL Ollama + CLOUD GitHub Copilot
- Updated
Oct 19, 2025 - Python
Improve this page
Add a description, image, and links to thellm-routing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thellm-routing topic, visit your repo's landing page and select "manage topics."