

Multi-vector retrieval architecture where queries and documents are represented by multiple vectors enabling fine-grained matching and improved retrieval quality through late interaction scoring.
ColBERT (Contextualized Late Interaction over BERT) represents documents and queries as collections of vectors (one per token), enabling fine-grained matching through late interaction.
Traditional Dense Retrieval:
ColBERT:
Concept: Defer interaction between query and document vectors until search time.
Process:
Formula:
Score(Q, D) = Σ max(Q_i · D_j) for all query tokens i
Fine-Grained Matching:
Improved Quality:
Interpretable:
Pros:
Cons:
Best For:
Not Ideal For:
RAGatouille:
ColBERTv2:
Jina ColBERT:
Native Support:
Workarounds:
Compression:
Indexing:
Hybrid:
Storage:
Query Speed:
Quality:
Yes, if:
No, if:
Loading more......