PQ (Product Quantization)

Product Quantization is a compression and indexing technique for vector search that splits vectors into subspaces and quantizes each part separately, allowing vector databases to store large-scale embeddings compactly while supporting efficient ANN search.

🌐Visit Website

About this tool

title: "PQ (Product Quantization)" slug: "pq-product-quantization" brand: "faiss" brand_logo_url: "https://faiss.ai/images/faiss_logo_black.svg" category: "concepts-definitions" featured: false tags:

quantization
ann
vector-compression images:
"https://faiss.ai/images/pq.png" source_url: "https://cybergarden.au/blog/5-powerful-vector-database-tools-2025"

Overview

PQ (Product Quantization) is a vector compression and indexing technique commonly used in vector databases and similarity search libraries (such as Faiss). It enables efficient approximate nearest neighbor (ANN) search by splitting high-dimensional vectors into multiple subspaces and quantizing each subspace separately. This allows large-scale embedding collections to be stored compactly while still supporting fast semantic search.

Key Idea

Instead of storing full-precision vectors, Product Quantization:

Divides each original vector into smaller, disjoint sub-vectors (subspaces).
Learns a separate codebook (set of centroids) for each subspace.
Represents each sub-vector by the index of its nearest centroid in the corresponding codebook.

The resulting compressed representation significantly reduces memory usage while preserving enough structure for efficient ANN search.

Features

Vector splitting into subspaces
High-dimensional vectors are partitioned into multiple lower-dimensional blocks, enabling independent quantization of each part.
Subspace quantization (codebooks)
Each subspace has its own learned codebook of centroids; sub-vectors are approximated by the nearest centroid index.
Compact embedding storage
Full-precision floats are replaced with short integer codes, reducing RAM and disk footprint for large collections of embeddings.
Efficient approximate nearest neighbor (ANN) search
Distance computations are performed in the compressed space using precomputed lookup tables, enabling fast similarity search over millions or billions of vectors.
Speed–accuracy trade-off control
The number of subspaces, centroids per subspace, and code size can be tuned to balance search accuracy against memory usage and query latency.
Compatibility with vector indexes
Often combined with other ANN indexing techniques (e.g., IVF, HNSW in systems like Faiss) to further improve search performance at scale.
Scalability to large datasets
Designed to support very large vector collections by minimizing per-vector storage, making it practical to keep massive embedding sets in memory or on fast storage.
Support for semantic search use cases
Well-suited for applications that rely on embedding similarity (e.g., text, image, or multimodal search) where exact nearest neighbor search would be too expensive.

Use Cases

Large-scale semantic search over text, images, or documents.
Vector databases powering LLM-based retrieval, RAG, and recommendation.
Any scenario requiring memory-efficient storage of high-dimensional embeddings with fast ANN queries.

Pricing

Product Quantization (PQ) is a technique/concept, not a standalone commercial product, so there is no pricing model associated with it directly. Pricing depends on the specific database or library (e.g., Faiss or a managed vector database) in which PQ is implemented.

Surveys

Loading more......

Information

Websitecybergarden.au

PublishedDec 31, 2025

Tags

3 Items

#Quantization #Ann #vector compression

Similar Products

6 result(s)

BBQ Binary Quantization

Elasticsearch and Lucene's implementation of RaBitQ algorithm for 1-bit vector quantization, renamed as BBQ. Provides 32x compression with asymptotically optimal error bounds, enabling efficient vector search at massive scale with minimal accuracy loss.

Locally-Adaptive Vector Quantization

Advanced quantization technique that applies per-vector normalization and scalar quantization, adapting the quantization bounds individually for each vector. Achieves four-fold reduction in vector size while maintaining search accuracy with 26-37% overall memory footprint reduction.

Anisotropic Vector Quantization

An advanced quantization technique introduced by Google's ScaNN that prioritizes preserving parallel components between vectors rather than minimizing overall distance. Optimized for Maximum Inner Product Search (MIPS) and significantly improves retrieval accuracy.

HNSW (Hierarchical Navigable Small World)

Graph-based algorithm for approximate nearest neighbor search that maintains multi-layer graph structures for efficient vector similarity search with logarithmic complexity, widely used in modern vector databases.

Binary Quantization

Extreme vector compression technique converting each dimension to a single bit (0 or 1), achieving 32x memory reduction and enabling ultra-fast Hamming distance calculations with acceptable accuracy trade-offs.

Locality Sensitive Hashing (LSH)

Algorithmic technique for approximate nearest neighbor search in high-dimensional spaces using hash functions to map similar items to the same buckets with high probability.

PQ (Product Quantization)

🌐Visit Website

About this tool

title: "PQ (Product Quantization)" slug: "pq-product-quantization" brand: "faiss" brand_logo_url: "https://faiss.ai/images/faiss_logo_black.svg" category: "concepts-definitions" featured: false tags:

quantization
ann
vector-compression images:
"https://faiss.ai/images/pq.png" source_url: "https://cybergarden.au/blog/5-powerful-vector-database-tools-2025"

Overview

Key Idea

Instead of storing full-precision vectors, Product Quantization:

Divides each original vector into smaller, disjoint sub-vectors (subspaces).
Learns a separate codebook (set of centroids) for each subspace.
Represents each sub-vector by the index of its nearest centroid in the corresponding codebook.

The resulting compressed representation significantly reduces memory usage while preserving enough structure for efficient ANN search.

Features

Vector splitting into subspaces
High-dimensional vectors are partitioned into multiple lower-dimensional blocks, enabling independent quantization of each part.
Subspace quantization (codebooks)
Each subspace has its own learned codebook of centroids; sub-vectors are approximated by the nearest centroid index.
Compact embedding storage
Full-precision floats are replaced with short integer codes, reducing RAM and disk footprint for large collections of embeddings.
Efficient approximate nearest neighbor (ANN) search
Distance computations are performed in the compressed space using precomputed lookup tables, enabling fast similarity search over millions or billions of vectors.
Speed–accuracy trade-off control
The number of subspaces, centroids per subspace, and code size can be tuned to balance search accuracy against memory usage and query latency.
Compatibility with vector indexes
Often combined with other ANN indexing techniques (e.g., IVF, HNSW in systems like Faiss) to further improve search performance at scale.
Scalability to large datasets
Designed to support very large vector collections by minimizing per-vector storage, making it practical to keep massive embedding sets in memory or on fast storage.
Support for semantic search use cases
Well-suited for applications that rely on embedding similarity (e.g., text, image, or multimodal search) where exact nearest neighbor search would be too expensive.

Use Cases

Large-scale semantic search over text, images, or documents.
Vector databases powering LLM-based retrieval, RAG, and recommendation.
Any scenario requiring memory-efficient storage of high-dimensional embeddings with fast ANN queries.

Pricing

Surveys

Loading more......

Information

Websitecybergarden.au

PublishedDec 31, 2025

PQ (Product Quantization)

About this tool

Overview

Key Idea

Features

Use Cases

Pricing

Information

Categories

Tags

Similar Products

Connect with us

Stay Updated

Product

Clients

Company

Resources

PQ (Product Quantization)

About this tool

Overview

Key Idea

Features

Use Cases

Pricing

Information

Categories

Tags

Similar Products

Connect with us

Stay Updated

Product

Clients

Company

Resources