• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Sdks & Libraries
    3. Dense Passage Retrieval (DPR)

    Dense Passage Retrieval (DPR)

    Set of tools and models from Meta AI Research for open domain Q&A using dense representations, outperforming BM25 by 9%-19% in passage retrieval accuracy with a dual-encoder BERT framework.

    🌐Visit Website

    About this tool

    Overview

    Dense Passage Retriever (DPR) is a set of tools and models for open domain Q&A task developed by Facebook AI Research (now Meta AI). The research shows that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages.

    Architecture

    DPR uses:

    • Dense encoder EP(·) which maps any text passage to a d-dimensional real-valued vector
    • Different encoder EQ(·) that maps the input question to a d-dimensional vector
    • Two independent BERT networks (base, uncased)
    • FAISS for efficient inference-time encoding and indexing

    Performance

    When evaluated on a wide range of open-domain QA datasets, the dense retriever outperforms a strong Lucene-BM25 system by 9%-19% absolute in terms of top-20 passage retrieval accuracy.

    Retrieval Process

    At run-time, DPR applies the question encoder to map the input question to a d-dimensional vector, and retrieves k passages of which vectors are the closest to the question vector using efficient similarity search.

    Resources

    • GitHub Repository: github.com/facebookresearch/DPR
    • Models: Pre-trained models available on Hugging Face
    • Paper: Available on arXiv (2004.04906)

    Pricing

    Free and open-source.

    Surveys

    Loading more......

    Information

    Websitegithub.com
    PublishedMar 13, 2026

    Categories

    1 Item
    Sdks & Libraries

    Tags

    3 Items
    #Retrieval#Open Source#Nlp

    Similar Products

    6 result(s)
    spaCy

    spaCy is an industrial-strength NLP library in Python that provides advanced tools for generating word, sentence, and document embeddings. These embeddings are commonly stored and searched in vector databases for NLP and semantic search applications.

    ModernBERT Embed

    Open-source embedding model from Nomic AI based on ModernBERT-base with 149M parameters. Supports 8192 token sequences and Matryoshka Representation Learning for 3x memory reduction.

    HNSWlib
    Featured

    Header-only C++/Python library for fast approximate nearest neighbor search implementing the HNSW algorithm. Used by Spotify and others, offers 10x speed increase over Annoy. This is an OSS library.

    NVIDIA cuVS
    Featured

    GPU-accelerated vector search and clustering library from NVIDIA RAPIDS. Provides 8-12x faster index building and queries with multiple language support (C, C++, Python, Rust). This is an OSS library.

    AutoTokenizer (Hugging Face Transformers)
    Featured

    A utility class from the Hugging Face Transformers library that automatically loads the correct tokenizer for a given pre-trained model. It is crucial for consistent text preprocessing and tokenization, a vital step before generating embeddings for vector database storage.

    SentenceTransformer
    Featured

    A Python library for generating high-quality sentence, text, and image embeddings. It simplifies the process of converting text into dense vector representations, which are fundamental for similarity search and storage in vector databases.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies