
Product Quantization (PQ)
Vector compression technique that splits high-dimensional vectors into subvectors and quantizes each independently, achieving significant memory reduction while enabling approximate similarity search.
About this tool
Overview
Product Quantization (PQ) is a vector compression technique that splits high-dimensional vectors into subvectors and quantizes each subvector independently. This achieves significant memory reduction (often 32x or more) while enabling approximate similarity search.
How Product Quantization Works
Compression Process
- Split: Divide each d-dimensional vector into m subvectors
- Learn Codebooks: Train a codebook (lookup table) for each subvector using k-means
- Quantize: Replace each subvector with its nearest codebook entry's index
- Store: Store only the compact codes instead of full vectors
Search Process
- Quantize query vector using same splitting
- Pre-compute distances between query subvectors and all codebook entries
- Approximate full vector distances using lookup table
- Return top-k results
Memory Reduction
Typical compression:
- Original: 768 dimensions × 4 bytes = 3,072 bytes per vector
- PQ (m=96, k=256): 96 bytes per vector
- Compression ratio: ~32x
Variants
IVF-PQ
Combines Inverted File clustering with Product Quantization for both speed and compression.
OPQ (Optimized Product Quantization)
Applies a learned rotation before quantization to reduce quantization error.
Additive Quantization
Uses sum of multiple codebook entries for better accuracy.
Trade-offs
Advantages:
- Significant memory reduction
- Faster similarity computation
- Enables larger datasets in memory
Disadvantages:
- Loss of accuracy (quantization error)
- Requires training phase
- Not suitable for exact search
Configuration Parameters
- m: Number of subvectors (segments)
- nbits: Bits per code (determines codebook size: k=2^nbits)
Use Cases
- Large-scale vector search (billions of vectors)
- Memory-constrained environments
- When some accuracy loss is acceptable
- Reducing infrastructure costs
Pricing
Implemented in open-source libraries (FAISS, ScaNN, etc.)
Loading more......
Information
Categories
Tags
Similar Products
6 result(s)