• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. BGE-VL

    BGE-VL

    State-of-the-art multimodal embedding model from BAAI supporting text-to-image, image-to-text, and compositional visual search. Trained on the MegaPairs dataset with over 26 million retrieval triplets.

    🌐Visit Website

    About this tool

    Overview

    BGE-VL is a State-Of-The-Art multimodal embedding model that supports any visual search applications, including text-to-image, image-to-text, image&prompt-to-image, text-to-image&text, and more. Released by BAAI in March 2025.

    Model Variants

    Based on the MegaPairs dataset, the BAAI BGE team trained three multi-modal retrieval models:

    • BGE-VL-Base
    • BGE-VL-Large
    • BGE-VL-MLLM

    Performance

    • Achieved optimal performance in 36 multi-modal embedding evaluation tasks of the Massive Multimodal Embedding Benchmark (MMEB)
    • Excels in both zero-shot and supervised fine-tuning scenarios
    • BGE-VL-MLLM-S1 shows 8.1% improvement on CIRCO benchmark (mAP@5) over previous state-of-the-art
    • Sets new benchmark for compositional image retrieval tasks

    MegaPairs Dataset

    BAAI released MegaPairs, a massive synthetic dataset containing over 26 million multimodal retrieval instruction-tuning triplets that powers BGE-VL.

    License

    Released under MIT license - completely free for both academic and commercial use.

    Availability

    Models and documentation available on GitHub, Hugging Face, and the official BGE documentation site.

    Surveys

    Loading more......

    Information

    Websitegithub.com
    PublishedMar 8, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Multimodal#Open Source#Visual Search

    Similar Products

    6 result(s)
    Jina Embeddings v4
    Featured

    Universal multimodal embedding model from Jina AI supporting text and images through unified pathway. Built on Qwen2.5-VL-3B-Instruct, outperforms proprietary models on visually rich document retrieval. This is a commercial API with free tier, though OSS weights available.

    DocArray

    An open-source library for creating, storing, and searching multimodal data and vector embeddings, supporting AI and ML workflows.

    Deep Lake

    Deep Lake is a vector database designed as a data lake for AI, capable of storing and managing vector embeddings, text, images, and videos. It utilizes a tensor format for efficient querying and integration with AI algorithms, making it suitable for similarity search and machine learning workflows. It is open-source and tailored for handling unstructured and multimodal data, with seamless integration with frameworks like PyTorch and TensorFlow.

    ColBERT
    Featured

    Late interaction architecture for efficient and effective passage search. Encodes queries and documents independently using BERT, then performs token-level similarity via maxsim operator for strong generalization.

    Qwen3 Embedding
    Featured

    Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

    Nomic Embed Text
    Featured

    First fully reproducible open-source text embedding model with 8,192 context length. v2 introduces Mixture-of-Experts architecture for multilingual embeddings. Outperforms OpenAI models on benchmarks. This is an OSS model under Apache 2.0 license.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies