



Best Matching 25 ranking function for information retrieval that ranks documents based on query term frequency with length normalization. Core component of hybrid search RAG systems combining keyword and semantic search.
Loading more......
BM25 (Best Matching 25) is a probabilistic ranking function used in information retrieval that has become a cornerstone of hybrid search systems in RAG applications. It ranks documents based on query term frequency with sophisticated normalization.
BM25 scoring formula considers:
Hybrid search addresses limitations of single-strategy retrieval by combining:
Run dense and BM25 queries in parallel, fuse ranked lists via RRF (Reciprocal Rank Fusion), then apply cross-encoder or ColBERT re-ranking over the fused top-k (typically k=50-200).
Combining BM25 full-text search with vector search significantly improves nDCG gains over pure vector search.
Most modern vector databases support BM25: