

Late interaction architecture for efficient and effective passage search. Encodes queries and documents independently using BERT, then performs token-level similarity via maxsim operator for strong generalization.
Loading more......
ColBERT introduces a late interaction architecture that independently encodes the query and the document using BERT and then employs a cheap yet powerful interaction step that models their fine-grained similarity.
Late interaction operates at the token level:
The multi-vector approach requires storing significantly more data than single-vector methods, posing challenges for:
Pioneered modern multi-vector retrieval methods. A First Workshop on Late Interaction and Multi Vector Retrieval is scheduled for ECIR 2026, demonstrating the growing importance of this approach.
Open-source on GitHub with active research community.