technique

Hybrid search (retrieval)

Hybrid search combines dense vector retrieval with sparse keyword (BM25) retrieval, then fuses the two ranked lists — the production retrieval default for RAG in 2026.

Pure vector search misses exact-match queries (product SKUs, error codes, named entities). Pure keyword search misses semantic intent. Hybrid combines them — typically retrieve top 50-100 from each, fuse with reciprocal rank fusion (RRF) or weighted score, then re-rank the top 20-30 with a cross-encoder. Empirically beats both single approaches on production corpora. By 2026 hybrid retrieval is the default in serious RAG stacks (Cohere Rerank pipeline, Voyage AI Rerank, Pinecone hybrid, Weaviate hybrid, Vespa).

When to use hybrid search (retrieval)

Common mistakes

FAQ

What is hybrid search (retrieval)?

Hybrid search combines dense vector retrieval with sparse keyword (BM25) retrieval, then fuses the two ranked lists — the production retrieval default for RAG in 2026.

When should I use hybrid search (retrieval)?

Any production RAG over diverse corpora. Mixed query types (some keyword-heavy, some semantic).

What are the most common mistakes with hybrid search (retrieval)?

Fusing scores directly across normalisation regimes — use RRF or rank-based fusion. Skipping the re-ranker — first-stage fusion is noisy.

Last updated: 2026-06-01. Raw markdown: https://promtable.com/glossary/hybrid-search.md.