#

prompt-routing

Here are 21 public repositories matching this topic...

NadirRouter / NadirClaw

Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenClaw. Saves 40-70% on AI API costs. Self-hosted, no middleman.

Updated Jun 8, 2026
Python

batish52 / codecontext

Reduce your LLM costs by 40-70% automatically. Routes prompts locally, compacts context, tracks real savings.

python gateway code-search openai developer-tools cost-optimization rag llm anthropic prompt-routing

Updated Apr 27, 2026
Python

PiPiMink

Izzetee / PiPiMink

Route every prompt to the best LLM — for you specifically.

go router ai openai llm anthropic ollama model-routing prompt-routing

Updated Jun 4, 2026
Go

routesmith

sidrat2612 / routesmith

Host-aware model routing for coding agents. Python library + MCP server for Claude Code, Codex, Gemini CLI, Copilot, Cursor, and Aider.

Updated May 9, 2026
Python

Uo1428 / llm-switchboard

Blazing-fast, zero-cost local LLM router. Classify and route prompts to specialized AI models with <1ms latency using heuristic rules.

router openai autorouter ai-chatbot llm openrouter llm-router ai-router prompt-routing clawrouter openclaw-router

Updated Feb 25, 2026
TypeScript

Moguifeng-9119 / aperture

Intelligent multi-model LLM routing gateway — route each request to the optimal model, save 40-70% on API costs. Single binary, OpenAI-compatible.

golang router ai api-gateway devtools gateway self-hosted openai glm minimax hacktoberfest kimi moonshot llm llmops qwen deepseek zhipu prompt-routing

Updated Jun 4, 2026
Go

pranabjyotinath1999 / llm-switchboard

Route prompts to local large language models instantly to cut costs and reduce latency with smart, zero-overhead classification under 1 millisecond.

router openai autorouter ai-chatbot llm openrouter llm-router ai-router prompt-routing clawrouter openclaw-router

Updated Jun 9, 2026
TypeScript

jayaram-07 / ecoprompt

Energy-efficient AI prompt routing system — sends simple prompts to lightweight models and reserves larger models for complex reasoning to cut compute, latency, and energy.

react gemini student-project final-year-project energy-efficiency capstone-project inference-optimization fastapi groq green-ai btech-project llm llm-routing model-routing prompt-routing ai-cost-optimization

Updated Jun 3, 2026
Python

MCamner / atlas-one

Local prompt routing studio for structured AI workflows and execution handoff.

javascript java github-pages workflow ai architecture decision-making developer-tools local-first prompt-engineering prompt-engine prompt-library prompt-routing

Updated Jun 8, 2026
JavaScript

Lling0000 / proofroute

CLI-first OpenAI-compatible LLM router/proxy for coding agents that picks the cheapest fast-enough model and proves speed, savings, and prompt-free privacy.

Updated Jun 2, 2026
JavaScript

TribalHouse / claude-context-governor

The control plane for Claude Code. Run a dozen MCP servers and a hundred skills. Pay context for only the ones doing work right now.

mcp developer-tools opus haiku ai-agents sonnet ai-tools anthropic llm-tools context-management mcp-server claude-code token-optimization mcp-router mcp-gateway subagents claude-code-plugin prompt-routing mcp-aggregator

Updated May 17, 2026
JavaScript

iamadhitya1 / llm-router

python router ai openai cost-optimization groq llm anthropic prompt-routing

Updated May 31, 2026
Python

Devatva24 / LLM-Router-MCP

MCP server that intelligently routes prompts across Claude, Gemini, and GPT-4o based on task type — minimising token cost without sacrificing quality.

nodejs typescript mcp gemini developer-tools claude ai-tools llm anthropic gpt-4o ai-router prompt-routing

Updated Apr 19, 2026
JavaScript

rohith-nandan-6 / LLM-Cascade-Router

Intelligent LLM router that dynamically routes prompts between local Ollama (Qwen) and cloud models (Gemini) using complexity scoring, semantic caching, and cost-aware decisioning.

Updated May 15, 2026
Python

net9876 / ai-model-router-lab

Intelligent LLM request router for Azure OpenAI — automatically routes prompts to GPT-4o-mini or GPT-4o based on complexity, task type, environment, and budget. Cuts AI API costs by 60-80% with zero impact on application quality.

python azure cost-optimization fastapi ai-engineering azure-openai llm llmops gpt-4o gpt-4o-mini model-routing prompt-routing

Updated Jun 6, 2026
Python

milkoor / prompt-routing-skill

Prompt posture router for coding tasks; pairs with causetrace

skills opencode codex claude-code prompt-routing causetrace

Updated May 30, 2026
Shell

XidaoApi / local-llm-router

Route prompts between local and cloud LLMs based on task complexity. Use local models (Ollama) for simple tasks, cloud APIs for complex ones. Save 80%+ on AI costs.

Updated May 14, 2026
Python

ssthil / llmroute

Define your LLM providers once, then let llmroute dynamically route every prompt to the right model — a smart CLI switchboard for multi-provider AI workflows.

cli golang gemini openai ai-tools model-switching llm anthropic llm-router prompt-routing multi-provider-llm

Updated Jun 8, 2026
Go

hermes-labs-ai / claude-router

claude-router is a local prompt router that picks the right Claude model tier and prepends the right scaffold using local embeddings before you call the API. A deterministic routing layer for eval, research, content, and review prompts that helps teams stop overspending on Sonnet and Opus when Haiku plus structure is enough.

embeddings developer-tools claude cost-optimization llm prompt-engineering anthropic ollama llm-ops llm-cost llm-routing local-embeddings model-routing prompt-routing

Updated May 31, 2026
Python

XidaoApi / mcp-cost-analyzer

Analyze MCP tool schema overhead, token usage, and per-turn cost for AI agents. Estimate context bloat, tool-calling overhead, and routing savings for OpenAI-compatible backends.

python mcp developer-tools ai-agents cost-optimization developer-productivity llm agentic-ai tool-calling model-context-protocol openai-compatible token-optimization token-counting prompt-routing

Updated May 21, 2026
Python

Improve this page

Add a description, image, and links to the prompt-routing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the prompt-routing topic, visit your repo's landing page and select "manage topics."