• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Research Papers & Surveys
    3. MCGI

    MCGI

    Manifold-Consistent Graph Indexing for billion-scale disk-resident vector search. Leverages Local Intrinsic Dimensionality to achieve 5.8x throughput improvement over DiskANN on high-dimensional datasets.

    🌐Visit Website

    About this tool

    Overview

    MCGI (Manifold-Consistent Graph Indexing) is a geometry-aware and disk-resident indexing method that leverages Local Intrinsic Dimensionality (LID) to dynamically adapt search strategies to the data's intrinsic geometry. Published in January 2026.

    Core Problem

    Graph-based Approximate Nearest Neighbor (ANN) search often suffers from performance degradation in high-dimensional spaces due to the "Euclidean-Geodesic mismatch," where greedy routing diverges from the underlying data manifold.

    Key Innovation

    Unlike standard algorithms that treat dimensions uniformly, MCGI modulates its beam search budget based on in situ geometric analysis, eliminating dependency on static hyperparameters.

    Performance Results

    High-dimensional datasets (GIST1M, 960 dimensions)

    • Achieves 5.8× higher throughput at 95% recall compared to state-of-the-art DiskANN
    • Reaches 375 QPS through manifold-aware routing that minimizes strictly necessary disk reads

    Billion-scale datasets (SIFT1B)

    • Reduces high-recall query latency by 3× on billion-scale datasets
    • Maintains performance parity on standard lower-dimensional datasets

    Theoretical Foundation

    Theoretical analysis confirms that MCGI enables improved approximation guarantees by preserving manifold-consistent topological connectivity.

    Research Impact

    MCGI addresses the critical challenge of maintaining search quality in high-dimensional spaces while operating efficiently from disk storage, making it suitable for billion-scale vector search applications.

    Surveys

    Loading more......

    Information

    Websitearxiv.org
    PublishedMar 8, 2026

    Categories

    1 Item
    Research Papers & Surveys

    Tags

    3 Items
    #Ann#Research#Disk Based

    Similar Products

    6 result(s)
    BatANN

    Distributed disk-based approximate nearest neighbor system achieving near-linear throughput scaling. Delivers 6.21-6.49x throughput improvement over scatter-gather baseline with sub-6ms latency on 10 servers.

    SPFresh

    Incremental in-place update system for billion-scale vector search from Microsoft Research. Maintains 2.41x lower P99.9 latency than baselines while supporting efficient vector updates with minimal resource overhead.

    BANG

    BANG is a billion-scale approximate nearest neighbor search system optimized for single GPU execution, enabling high-performance vector search in vector database environments at massive scale.

    Cagra

    Cagra provides highly parallel graph construction and approximate nearest neighbor search for GPUs, supporting large-scale vector database operations and efficient similarity search.

    LANNS: a web-scale approximate nearest neighbor lookup system

    A scalable system for approximate nearest neighbor search at web-scale, relevant for implementing and understanding vector database infrastructure for high-dimensional data.

    Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs

    This paper introduces the HNSW algorithm, which is widely adopted in vector databases and search engines for its efficient and robust performance on high-dimensional data. HNSW is foundational in powering modern vector search systems.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies