• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Vector Database Engines
    3. RUMMY

    RUMMY

    A GPU-accelerated vector query processing system that supports large vector datasets beyond GPU memory. RUMMY uses reordered pipelining to efficiently overlap data transmission and GPU computation, achieving up to 135× better performance than traditional GPU-based approaches.

    🌐Visit Website

    About this tool

    Overview

    RUMMY is the first GPU-accelerated vector query processing system that achieves high performance and supports large vector datasets beyond GPU memory. Developed by researchers from Peking University, the system was presented at USENIX NSDI '24.

    Key Features

    • Reordered Pipelining: Exploits characteristics of vector query processing to efficiently pipeline data transmission from host memory to GPU memory and query processing in GPU
    • Cluster-Based Retrofitting: Eliminates redundant data transmission across queries in a batch
    • Dynamic Kernel Padding: Maximizes spatial and temporal GPU utilization for GPU computation with cluster balancing
    • Query-Aware Optimization: Reorders and groups queries to optimally overlap transmission and computation

    Performance

    • Outperforms IVF-GPU with CUDA unified memory by up to 135×
    • Achieves up to 23.1× better performance compared to CPU-based solutions (with 64 vCPUs)
    • Up to 37.7× more cost-effective than CPU implementations

    Use Cases

    • Billion-scale vector similarity search
    • Maximum inner product search (MIPS)
    • Large-scale semantic search applications
    • GPU-accelerated RAG systems

    Technical Architecture

    RUMMY addresses the challenge of processing vector queries on datasets that exceed GPU memory capacity. The core innovation is a novel reordered pipelining technique that leverages three key ideas to achieve optimal performance with limited GPU memory.

    Availability

    RUMMY is open-source and available on GitHub at pkusys/Rummy.

    Surveys

    Loading more......

    Information

    Websitegithub.com
    PublishedMar 20, 2026

    Categories

    1 Item
    Vector Database Engines

    Tags

    3 Items
    #Gpu Acceleration#High Performance#Scalable

    Similar Products

    6 result(s)
    FusionANNS

    An efficient CPU/GPU cooperative processing architecture for billion-scale approximate nearest neighbor search. FusionANNS achieves up to 13.1× higher QPS compared to SPANN and can handle billion-vector datasets with over 12,000 QPS while maintaining 15ms latency using only one entry-level GPU.

    KDB

    KDB is a high-performance vector database supporting billion-scale vector search, with features aimed at enterprises needing large-scale vector storage and retrieval.

    Milvus

    Milvus is a mature, open-source vector database maintained by Zilliz, supporting large-scale similarity search with multiple indexing strategies and GPU acceleration. It includes variants such as Milvus Lite (lightweight version), Milvus Standalone (single-machine deployment), and Milvus Distributed (Kubernetes-based deployment for large scale).

    Breaking the Storage-Compute Bottleneck in Billion-Scale ANNS

    A 2025 research paper presenting a GPU-driven asynchronous I/O framework for billion-scale approximate nearest neighbor search. The system addresses the fundamental bottleneck of data movement between storage and compute in large-scale vector search.

    PilotANN

    PilotANN is a memory-bounded GPU-accelerated framework for large-scale vector search, designed to improve performance and efficiency of approximate nearest neighbor (ANN) search workloads, making it relevant as a high-performance engine/component in vector database and vector search systems.

    BANG

    BANG is a billion-scale approximate nearest neighbor search system optimized for single GPU execution, enabling high-performance vector search in vector database environments at massive scale.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies