



A visual document retrieval model based on Qwen2-VL-2B that generates ColBERT-style multi-vector representations, treating documents as images to capture layout, tables, charts, and visual elements without requiring OCR or text extraction.
Loading more......
ColQwen2 is a late-interaction visual retrieval model that extends the ColPali architecture using the Qwen2-VL backbone. It treats documents as images to capture not only text but also layout, tables, charts, and other visual elements.
Instead of relying on OCR and text extraction, ColQwen2:
Open-source under Apache 2.0 and MIT licenses. Free for both academic and commercial use.