llm-routing
Here are 10 public repositories matching this topic...
Delivery infrastructure for agents. Arch is a models-native proxy and data plane for agents that handles plumbing work in AI - like agent routing and orchestration, guardrails, zero-code logs and traces, and unified access to LLMs (OpenAI, Anthropic, Ollama, etc.). Build agents faster and deliver them reliably to prod.
- Updated
Dec 18, 2025 - Rust
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
- Updated
Dec 15, 2025 - HTML
High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.
- Updated
Dec 15, 2025 - Go
RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.
- Updated
Dec 8, 2025 - Python
A neural multi-armed bandit framework for routing prompts to the most suitable LLM in a multi-agent system.
- Updated
Dec 10, 2025 - Python
Compose, train and test fast LLM routers
- Updated
Oct 8, 2025 - Python
NeurIPS 2025 paper "MESS+: Dynamically Learned Inference-Time LLM Routing in Model Zoos with Service Level Guarantees"
- Updated
Sep 22, 2025 - Jupyter Notebook
Unified interface server for various LLM providers with OpenAI API format
- Updated
Jun 9, 2025 - Python
Hybrid AI routing: LOCAL Ollama + CLOUD GitHub Copilot
- Updated
Oct 19, 2025 - Python
Enrutador inteligente de LLMs para el IndesIAhack 2025 que selecciona el mejor modelo por consulta para minimizar coste y consumo energético manteniendo la calidad.
- Updated
Dec 6, 2025 - Python
Improve this page
Add a description, image, and links to thellm-routing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thellm-routing topic, visit your repo's landing page and select "manage topics."