• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Concepts & Definitions
    3. Locality Sensitive Hashing (LSH)

    Locality Sensitive Hashing (LSH)

    Algorithmic technique for approximate nearest neighbor search in high-dimensional spaces using hash functions to map similar items to the same buckets with high probability.

    🌐Visit Website

    About this tool

    Overview

    Locality-Sensitive Hashing (LSH) is an algorithmic technique that hashes similar input items into the same "buckets" with high probability (the number of buckets being much smaller than the universe of possible input items).

    How LSH Works

    LSH uses hash functions that map similar items to the same hash bucket with high probability, unlike traditional hash functions that try to avoid collisions. This property makes it particularly useful for approximate nearest neighbor search in high-dimensional spaces.

    Implementation Libraries

    Python Libraries

    1. FALCONN - Based on newer NIPS'15 paper ideas (2015-2016)
    2. E2LSH - Older version of LSH available as alpha-version
    3. LSHash by Kay Zhu - Fast hash calculation for large amounts of high dimensional data through numpy arrays
    4. SparseLSH - Emphasis on large, highly-dimensional datasets with sparse matrices
    5. mattilyra/LSH - Uses MurmurHash v3 library with Cython for performance

    Java Library

    java-LSH - Implements LSH as described in Leskovec, Rajaraman & Ullman (2014), "Mining of Massive Datasets"

    C++ Library

    LSHKIT - A C++ Locality Sensitive Hashing Library

    Production Alternatives

    For production use, libraries built for similarity search like FAISS or managed solutions like Pinecone are recommended instead of basic LSH implementations.

    Use Cases

    • Approximate nearest neighbor search
    • Duplicate detection
    • Image similarity search
    • Document deduplication
    • Recommendation systems

    Pricing

    Various open-source implementations available.

    Surveys

    Loading more......

    Information

    Websitewww.mit.edu
    PublishedMar 13, 2026

    Categories

    1 Item
    Concepts & Definitions

    Tags

    3 Items
    #Hashing#Ann#Algorithm

    Similar Products

    6 result(s)
    HNSW (Hierarchical Navigable Small World)

    Graph-based algorithm for approximate nearest neighbor search that maintains multi-layer graph structures for efficient vector similarity search with logarithmic complexity, widely used in modern vector databases.

    IVF

    Inverted File Index vector search algorithm that partitions high-dimensional vectors into clusters using k-means, enabling efficient nearest neighbor search by restricting searches to relevant clusters and dramatically reducing search space.

    Approximate Nearest Neighbors (ANN)

    Family of algorithms trading perfect accuracy for speed in high-dimensional similarity search. Enables sub-linear query time with 90%+ recall on billion-scale datasets.

    IVF (Inverted File Index)

    Clustering-based approximate nearest neighbor algorithm that partitions vector space into Voronoi cells. Fast search through coarse-to-fine strategy, often combined with Product Quantization (IVF-PQ).

    IVF-FLAT Index

    Inverted File Index with flat vectors using K-means clustering to partition high-dimensional space into regions, enhancing search efficiency by narrowing search area through neighbor partitions.

    NSW (Navigable Small World)

    Graph-based algorithm for approximate nearest neighbor search where vertices represent vectors and edges are constructed heuristically. Foundation for HNSW with (poly/)logarithmic search complexity using greedy routing.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies