

Multi-Vector Retrieval Algorithm that reduces multi-vector similarity search to single-vector similarity search via Fixed Dimensional Encodings. Achieves 10% improved recall with 90% lower latency compared to existing approaches.
Loading more......
MUVERA (MUlti-VEctor Retrieval Algorithm) is a retrieval mechanism which reduces multi-vector similarity search to single-vector similarity search. MUVERA's innovation is to take whole groups of multi-vectors and compress them into a single, easier-to-handle vector called a Fixed Dimensional Encoding (FDE).
MUVERA asymmetrically generates Fixed Dimensional Encodings (FDEs) of queries and documents, which are vectors whose inner product approximates multi-vector similarity. FDEs give high-quality ε-approximations, thus providing the first single-vector proxy for multi-vector similarity with theoretical guarantees.
A key advantage of MUVERA is that the FDE transformation is data-oblivious, meaning it doesn't depend on the specific dataset. This makes it:
MUVERA provides theoretical guarantees for approximation quality, making it the first single-vector proxy for multi-vector similarity with such guarantees.
The research was presented at NeurIPS 2024 and has been implemented in production systems. Weaviate added MUVERA support in version 1.31, demonstrating its practical value for real-world vector search applications.
Free research from Google, with open implementations available.