• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies
    Decorative pattern
    Decorative pattern
    1. Home
    2. Concepts & Definitions
    3. Hybrid Search (BM25 + Vector)

    Hybrid Search (BM25 + Vector)

    A search approach combining traditional keyword-based BM25 ranking with modern vector similarity search. By leveraging both lexical matching and semantic understanding, hybrid search provides superior retrieval quality through techniques like reciprocal rank fusion (RRF) to merge results from both methods.

    Overview

    Hybrid search combines the strengths of traditional keyword-based search (typically BM25) with modern semantic vector search. This approach captures both exact keyword matches and semantic similarities, providing more robust retrieval than either method alone.

    How It Works

    Dual Retrieval

    1. BM25 (Keyword Search): Finds documents with exact or partial keyword matches
    2. Vector Search: Finds semantically similar documents using embeddings
    3. Fusion: Combines results using algorithms like RRF (Reciprocal Rank Fusion)

    Key Advantages

    • Best of Both Worlds: Captures exact matches AND semantic meaning
    • Handles Terminology: BM25 catches specific terms, jargon, IDs
    • Semantic Understanding: Vector search finds conceptually similar content
    • Improved Recall: Less likely to miss relevant documents
    • Robustness: Works well across diverse query types

    Fusion Methods

    Reciprocal Rank Fusion (RRF)

    Most common approach:

    • Score = 1/(rank + k) for each result
    • Combines scores from both methods
    • Parameter k typically set to 60

    Linear Combination

    • Weighted sum of BM25 and vector scores
    • Requires score normalization
    • Weights can be tuned per application

    Use Cases

    • Enterprise Search: Mix of keywords and concepts
    • E-commerce: Product names + semantic features
    • RAG Systems: Better document retrieval for LLMs
    • Legal/Medical: Specific terms + conceptual queries
    • Customer Support: Ticket routing and KB search

    Implementation Examples

    Major platforms supporting hybrid search:

    • Weaviate (native hybrid search with alpha parameter)
    • Elasticsearch (hybrid scoring)
    • OpenSearch (with RRF)
    • Qdrant (hybrid search mode)
    • Pinecone (sparse-dense hybrid)

    Best Practices

    • Start with equal weights, then tune based on data
    • Use RRF for simplicity and robustness
    • Consider query type (keyword-heavy vs conceptual)
    • Test with representative queries
    • Monitor both BM25 and vector contributions

    Pricing

    Implementation varies by vector database platform.

    Surveys

    Loading more......

    Information

    Websiteweaviate.io
    PublishedMar 22, 2026

    Categories

    1 Item
    Concepts & Definitions

    Tags

    3 Items
    #hybrid-search#BM25#semantic-search

    Similar Products

    6 result(s)

    Hybrid Search with Reciprocal Rank Fusion

    Search technique combining BM25 lexical search and semantic vector search using Reciprocal Rank Fusion (RRF) to merge results, balancing precision of keyword matching with contextual understanding of neural embeddings.

    embedded-vector-db

    Lightweight npm package providing an embedded vector database for Node.js applications. Offers vector similarity search with HNSW, BM25 full-text search, hybrid search using weighted fusion or Reciprocal Rank Fusion (RRF), multi-namespace support, CRUD operations, metadata filtering, concurrency safety, and persistent storage to disk. Designed for RAG pipelines and semantic search use cases.

    Algolia AI Search

    AI-powered search platform that evolved from a traditional keyword search engine to include vector embeddings and semantic retrieval. Offers federated indexing and developer-friendly APIs for teams already using Algolia, adding semantic search without managing a separate vector database.

    Blockify

    AI search and vector database platform that provides unified vector search with semantic understanding, hybrid search capabilities, and developer-friendly APIs for building intelligent search applications.

    ParadeDB

    PostgreSQL extension enabling fast full-text, faceted, and hybrid search over Postgres tables using the BM25 algorithm. Built on Tantivy for production-ready search with ACID guarantees and transactional consistency.

    OpenSearch Vector Search

    OpenSearch Vector Search is the vector similarity search and AI search capability within the OpenSearch engine, supporting vector indices, ingestion of embedding data, and search methods including raw vector search, semantic search, hybrid search, multimodal search, and neural sparse search. It enables building RAG and conversational search applications using either user-provided embeddings or embeddings generated automatically by OpenSearch.