• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Concepts & Definitions
    3. BM42

    BM42

    Experimental sparse embedding approach combining exact keyword search with transformer intelligence, integrating sparse and dense vector searches for improved RAG results, developed by Qdrant.

    🌐Visit Website

    About this tool

    Overview

    BM42 is a new sparse embedding approach that combines the benefits of exact keyword search with the intelligence of transformers. It was developed by Qdrant as a search algorithm combining vector and standard BM25 keyword search methods to get better RAG results.

    Key Features

    Hybrid Search Approach

    At the core of BM42's innovation is its hybrid search capability, which seamlessly integrates both sparse and dense vector searches:

    • Sparse vector handles exact term matching
    • Dense vectors handle semantic relevance and deep meaning

    Technical Innovation

    As a sparse search technique, it retains the inverse document frequency (IDF) aspect of BM25, equipping BM42 with the core ability to capture rare and out-of-vocabulary terms. The key innovation lies in how it defines token-level relevance within documents.

    Transformer Integration

    BM42 reverses the tokenization process after getting the attention vectors, and the attention weights of subwords can be summed to get the attention weight of the word.

    Important Considerations

    Experimental Status: Recent evaluations have raised questions about the validity of BM42, and future developments may address these concerns. BM42 does not outperform BM25 implementation of other vendors and should be considered as an experimental approach which requires further research and development before it can be used in production.

    Implementation

    Starting from Qdrant v1.10.0, BM42 can be used in Qdrant via FastEmbed inference.

    Use Cases

    • Research and experimentation with hybrid search
    • Development of new sparse retrieval methods
    • Evaluation of sparse-dense search combinations

    Pricing

    Free to use as part of Qdrant.

    Surveys

    Loading more......

    Information

    Websiteqdrant.tech
    PublishedMar 13, 2026

    Categories

    1 Item
    Concepts & Definitions

    Tags

    3 Items
    #Sparse#Hybrid Search#Experimental

    Similar Products

    6 result(s)
    pinecone-sparse-english-v0
    Featured

    Learned sparse embedding model built on DeepImpact architecture, outperforming BM25 by up to 44% on TREC benchmarks for high-precision keyword search and hybrid retrieval.

    Cascading Retrieval
    Featured

    Advanced retrieval approach combining dense vectors, sparse vectors, and reranking in a multi-stage pipeline, achieving up to 48% better performance than single-method retrieval.

    Hybrid Search with Reciprocal Rank Fusion

    Search technique combining BM25 lexical search and semantic vector search using Reciprocal Rank Fusion (RRF) to merge results, balancing precision of keyword matching with contextual understanding of neural embeddings.

    Reciprocal Rank Fusion (RRF)

    Hybrid search algorithm combining results from multiple ranking systems by computing reciprocal ranks, commonly used to merge dense vector search with sparse keyword search for improved retrieval.

    Reciprocal Rank Fusion

    Method for combining ranked lists from multiple retrieval systems in hybrid search. Standard technique in RAG pipelines for fusing BM25 and dense vector results before reranking, creating diverse high-confidence candidate sets.

    Filtered Vector Search

    Combining vector similarity search with metadata filtering. Enables queries like find similar documents published after 2023 in category Technology.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies