• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Research Papers & Surveys
    3. DET-LSH

    DET-LSH

    DET-LSH is a locality-sensitive hashing scheme that introduces a dynamic encoding tree structure to accelerate approximate nearest neighbor (ANN) search in high-dimensional spaces. While it is a research algorithm rather than a production database, it directly targets the core operation behind vector databases—efficient ANN search over vector embeddings—and is relevant for designing or optimizing vector indexing components within vector database systems.

    🌐Visit Website

    About this tool

    DET-LSH

    Category: Research Papers & Surveys
    Brand: arXiv
    Source: Paper PDF
    Code & Artifacts: GitHub – WeiJiuQi/DET-LSH

    Overview

    DET-LSH is a locality-sensitive hashing (LSH) scheme designed for approximate nearest neighbor (ANN) search in high-dimensional Euclidean spaces. It introduces a Dynamic Encoding Tree (DE-Tree) structure to accelerate both indexing and querying, with a focus on improving indexing efficiency—an aspect often underemphasized in traditional LSH-based methods.

    Key Concepts

    • Approximate Nearest Neighbor (ANN) Search: Targets efficient retrieval of points whose distances to a query are within a factor (c) of the true nearest neighbor distance in high-dimensional spaces.
    • Locality-Sensitive Hashing (LSH): A hashing framework that increases collision probability for nearby points to enable sublinear-time ANN search.

    Features

    Dynamic Encoding Tree (DE-Tree)

    • Encoding-based index tree designed specifically for ANN tasks.
    • Focuses on improving indexing efficiency rather than only query-time optimizations.
    • Avoids directly partitioning the raw multi-dimensional space, mitigating performance degradation as dimensionality increases.
    • Supports efficient range queries based on Euclidean distance.

    DET-LSH Scheme

    • Builds multiple independent DE-Tree indexes over the data.
    • Employs a novel query strategy that:
      • Performs range queries across multiple DE-Trees.
      • Reduces the probability of missing exact nearest neighbor points, thereby improving recall and overall query accuracy.
    • Designed as a general LSH-based framework that can be used to inspire or implement vector indexing components in vector databases.

    Theoretical Properties

    • Provides probabilistic guarantees on query accuracy, in line with classical LSH theory.
    • Formally analyzed within the approximate nearest neighbor ((c)-ANN) framework for Euclidean distance.

    Performance Characteristics

    • Targets high-dimensional Euclidean spaces where the curse of dimensionality makes exact NN search impractical.
    • Experimental results on real-world datasets (as reported in the paper):
      • Up to 6× speedup in indexing time compared to state-of-the-art LSH-based methods.
      • Up to 2× speedup in query time.
      • Higher query accuracy than competing LSH-based approaches under comparable settings.

    Use Cases

    • Designing or optimizing vector indexing components in vector databases and search systems.
    • High-dimensional ANN search applications in:
      • Databases and data management
      • Information retrieval
      • Data mining
      • Machine learning and embedding-based similarity search

    Publication & Licensing

    • Venue: PVLDB, Vol. 17, No. 9 (2024), pp. 2241–2254.
    • DOI: 10.14778/3665844.3665854
    • License: Creative Commons BY-NC-ND 4.0 (non-commercial, no derivatives for the paper text).

    Pricing

    • Not applicable. DET-LSH is a research algorithm with an academic publication and open-source artifacts rather than a commercial product or service with pricing plans.
    Surveys

    Loading more......

    Information

    Websitearxiv.org
    PublishedDec 25, 2025

    Categories

    1 Item
    Research Papers & Surveys

    Tags

    3 Items
    #Ann#Hashing#High Dimensional

    Similar Products

    6 result(s)
    Efficient Locality Sensitive Hashing

    This work by Jingfan Meng is a comprehensive research thesis on efficient locality-sensitive hashing (LSH), covering algorithmic solutions, core primitives, and applications for approximate nearest neighbor search. It is relevant to vector databases because LSH-based indexing is a foundational technique for scalable similarity search over high-dimensional vectors, informing the design of vector indexes, retrieval engines, and similarity search modules in modern vector database systems.

    Li, Wen, et al. "Approximate nearest neighbor search on high dimensional data—experiments, analyses, and improvement."

    An influential paper analyzing and improving approximate nearest neighbor search methods for high-dimensional data, highly relevant for developing and understanding vector databases.

    ANN Library

    A C++ library for approximate nearest neighbor searching in arbitrarily high dimensions, developed by David Mount and Sunil Arya at the University of Maryland. Provides data structures and algorithms for both exact and approximate nearest neighbor searching.

    Locality Sensitive Hashing (LSH)

    Algorithmic technique for approximate nearest neighbor search in high-dimensional spaces using hash functions to map similar items to the same buckets with high probability.

    RaBitQ

    RaBitQ is an open-source library implementing the "Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search" method, providing vector quantization and compression techniques designed to improve efficiency and accuracy of ANN search engines and vector databases operating in high-dimensional spaces.

    vsag

    vsag is an Alibaba open-source library implementing efficient vector search algorithms, including approximate nearest neighbor search for high-dimensional vectors.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies