llm-routing

Here are 153 public repositories matching this topic...

katanemo / plano

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

proxy routing gateway prompt proxy-server openai envoy envoyproxy llms generative-ai llmops llm-inference llm-proxy ai-gateway llm-gateway llm-routing ai-gateway-support

Updated Jun 17, 2026
Rust

mnfst / awesome-free-llm-apis

Star

List of Permanent Free LLM API (API Keys)

awesome router gemini openai awesome-list ai-agents llm anthropic ollama llm-router llm-routing openclaw openclaw-plugin

Updated Jun 16, 2026
JavaScript

junchenzhi / Awesome-LLM-Ensemble

Star

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

multi-agent moe ensemble ensemble-learning routing-algorithm multi-agent-systems ensemble-prediction ensemble-models ensemble-machine-learning ensemble-methods large-language-models llms llm-agents llm-routing llm-collaboration llm-ensemble multi-llms

Updated Jun 15, 2026
HTML

thushan / olla

Sponsor

Star

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Updated Jun 18, 2026
Go

RouteWorks / RouterArena

Star

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

arena routing multi-agent multi-agent-systems router-benchmark llm llm-router llm-routing router-evaluation router-leaderboard

Updated Jun 18, 2026
Python

Self-hosted LLM gateway that routes requests across AI providers (OpenAI, Anthropic, Gemini, Mistral, Ollama) using intelligent multi-policy scoring — including an LLM-native routing policy. Drop-in compatible: just swap the base URL. No database required, built-in cost tracking, budget enforcement and multi-tenant isolation.

multi-tenant self-hosted anthropic openai-proxy llm-proxy cost-tracking ai-gateway llm-gateway llm-routing ai-router budget-enforcement

Updated Jun 14, 2026
TypeScript

BingoWon / keyaos

Star

Edge-native AI API gateway — cost-optimized routing across providers, multi-protocol support, built on Cloudflare Workers.

react typescript api-proxy edge-computing multimodal cloudflare-workers openai-api anthropic ai-gateway llm-routing

Updated May 21, 2026
TypeScript

qualixar / qualixar-os

Star

Qualixar OS: The Universal OS for AI Agents. Claw-compatible. 12 topologies, Forge AI team designer, 24-tab dashboard, skill marketplace. PAPER: https://arxiv.org/abs/2604.06392

typescript mcp multi-agent ai-agents agent-framework llm-routing agent-orchestration agent-reliability agent-os qualixar judge-pipeline

Updated May 25, 2026
TypeScript

kalibr-ai / kalibr-sdk-python

Star

Stop overpaying to run your agents. Kalibr routes every request to lower-cost model and tool paths without degrading performance.

Updated Jun 3, 2026
Python

open-world-project / model-router

Star

Automatic cost-aware model routing plugin for Hermes Agent

python plugin ai-agent openrouter llm-routing hermes-agent

Updated May 10, 2026
Python

skrashevich / botmux

Star

Web-based command center for managing Telegram bots — multi-bot dashboard, reverse proxy, inter-bot routing, protocol bridges, and LLM-powered smart routing

bot docker golang telegram dashboard sqlite proxy webhook self-hosted admin-panel botapi bot-management longpolling slack-bridge llm-routing webupdates

Updated Jun 14, 2026
Go

Hyperion-HQ / Hyperion

Star

Ultra-low-latency LLM gateway with microsecond caching, dynamic routing, budgets, analytics, and forecasting.

Updated Apr 2, 2026
Go

ankitvirdi4 / awesome-llm-cost

Star

Tools, libraries, papers, and patterns for reducing the cost of running large language models in production.

awesome gemini openai awesome-list quantization finops cost-engineering llm prompt-caching anthropic llm-observability llm-cost llm-routing llm-caching ai-cost

Updated Jun 5, 2026

deltawi / deltallm

Star

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface

kubernetes api-gateway mcp self-hosted multi-llm llm-proxy ai-gateway ai-infrastructure llm-gateway llm-routing model-context-protocol openai-compatible

Updated Jun 19, 2026
Python

orvi2014 / Baar-Core

Star

Budget-Aware Agentic Routing (BAAR) — Intelligent LLM model selection with a zero-call financial kill-switch. Save 90% on costs without losing accuracy.

python-library openai budget-management ai-safety cost-optimization langchain agentic-ai llm-routing

Updated May 20, 2026
Python

magent4aci / openJiSi

Star

[ICML2026] The official code of "Beyond Gemini-3-Pro: Revisiting LLM Routing and Aggregation at Scale"

routing multi-agent aggregation llm llm-routing llm-ensemble icml-2026

Updated Jun 3, 2026
Python

hussi9 / skill-router

Star

Skill + Agent + Model + Thinking depth — auto-routed before any tool fires. One SKILL.md for Claude Code. 90% routing accuracy, per-step model enforcement, 30%+ savings on multi-step chains.

opensource developer-tools ai-agents prompt-engineering anthropic claude-cli llm-routing agent-orchestration claude-code claude-skills

Updated Jun 11, 2026
Python

Das-rebel / a3m-router

Star

RouterArena #1 among known public baselines: 96.77% accuracy, $0.0768/1K, 1.0000 robustness. OpenAI-compatible LLM router across 47+ providers.

Updated Jun 19, 2026
TypeScript

rohansx / nvidia-litellm-router

Sponsor

Star

Free LLM router - latency-based routing across 31 NVIDIA NIM models with automatic failover.

nvidia llama free-api litellm deepseek nvidia-nim llm-routing openai-compatible model-router

Updated Mar 28, 2026
Python

ZhangYiqun018 / MTRouter

Star

[ACL 2026] Official implementation of MTRouter, a cost-aware multi-turn LLM routing framework accepted to ACL 2026 Main Conference.

benchmarking multi-turn hle llm-agents llm-routing cost-aware-routing scienceworld acl-2026

Updated Jun 16, 2026
Python

Improve this page

Add a description, image, and links to the llm-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-routing topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-routing

Here are 153 public repositories matching this topic...

katanemo / plano

mnfst / awesome-free-llm-apis

junchenzhi / Awesome-LLM-Ensemble

thushan / olla

RouteWorks / RouterArena

Inebrio / Routerly

BingoWon / keyaos

qualixar / qualixar-os

kalibr-ai / kalibr-sdk-python

open-world-project / model-router

skrashevich / botmux

Hyperion-HQ / Hyperion

ankitvirdi4 / awesome-llm-cost

deltawi / deltallm

orvi2014 / Baar-Core

magent4aci / openJiSi

hussi9 / skill-router

Das-rebel / a3m-router

rohansx / nvidia-litellm-router

ZhangYiqun018 / MTRouter

Improve this page

Add this topic to your repo