# Router LLM

**Source:** https://promtable.com/glossary/router-llm

> A router LLM is a small fast language model whose only job is to classify or rewrite an incoming request — deciding which downstream model, agent, or tool should handle it.

---
A router LLM is a small fast language model whose only job is to classify or rewrite an incoming request — deciding which downstream model, agent, or tool should handle it.

Router LLMs (typically GPT-4o-mini, Claude Haiku, Gemini Flash, or even a small open-weight model) handle the lightweight "what is this" step before the heavyweight "answer this" step runs. They classify intent, rewrite queries for retrieval, decide which expert agent to call, or pick the cheapest model that will meet quality. In 2026 router LLMs are the production default for cost-sensitive multi-skill apps because they cut frontier-model calls by 50-90% with minimal quality loss when the routing layer is evaluated and tuned.

## When to use

- Cost-sensitive production with diverse query types.
- Multi-agent stacks with skill routing.

## Common mistakes

- Skipping evals on the router — routing errors are invisible without explicit monitoring.
- Using a router for tasks the frontier model could handle directly in fewer calls.

## Related terms

- [model-router](https://promtable.com/glossary/model-router)
- [semantic-routing](https://promtable.com/glossary/semantic-routing)
- [tool-router](https://promtable.com/glossary/tool-router)
- [agent](https://promtable.com/glossary/agent)

*Last updated: 2026-06-01*
---

Original page: https://promtable.com/glossary/router-llm
Maintained by Promtable (https://promtable.com). Content: CC BY 4.0. Cite as "Promtable — https://promtable.com/glossary/router-llm".
Contact: info@vibecodingturkey.com.