• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. Jina-CLIP v2

    Jina-CLIP v2

    A 0.9B multimodal embedding model with multilingual support for 89 languages, 512x512 image resolution, and Matryoshka representations that enable dimensional flexibility from 1024 down to 64 dimensions while maintaining strong performance.

    🌐Visit Website

    About this tool

    Overview

    Jina-CLIP v2 is a state-of-the-art multimodal embedding model that combines text and image understanding in a single unified model. It represents a significant improvement over v1 with enhanced multilingual capabilities and higher resolution image processing.

    Architecture

    The model combines two specialized encoders:

    • Text Encoder: Jina XLM-RoBERTa (561M parameters)
    • Vision Encoder: EVA02-L14 (304M parameters)
    • Total Parameters: 865M

    Key Features

    • Multilingual Support: Supports 89 languages for text-image retrieval with up to 4% improvement over comparable models
    • High Resolution: Processes 512x512 images, a significant upgrade from v1's 224x224 resolution
    • Matryoshka Representations: Allows truncating output dimensions from 1024 to 64 while maintaining 99% performance
    • State-of-the-Art Performance: Achieves 98.0% accuracy on Flickr30k image-to-text retrieval
    • Flexible Deployment: Available via Jina Embeddings API, AWS, Azure, and GCP

    Performance

    Even aggressive 75% dimensional reduction maintained over 99% performance across text, image, and cross-modal tasks. The model shows 3% performance improvement over v1 in both text-image and text-text retrieval tasks.

    Use Cases

    • Cross-modal search (text-to-image, image-to-text)
    • Multilingual image retrieval
    • Visual question answering
    • Content-based recommendation systems
    • Multimodal RAG applications

    Pricing

    Available through Jina Embeddings API with commercial licensing. Also available on cloud marketplaces (AWS, Azure, GCP) with usage-based pricing.

    Surveys

    Loading more......

    Information

    Websitejina.ai
    PublishedMar 20, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Multimodal#Multilingual#Embedding Model

    Similar Products

    6 result(s)
    BGE-M3
    Featured

    A versatile embedding model from BAAI that simultaneously supports dense retrieval, sparse retrieval, and multi-vector retrieval, with multilingual support for 100+ languages and multi-granularity processing from short sentences to 8192-token documents.

    EmbeddingGemma
    Featured

    Google's 308M parameter multilingual text embedding model based on Gemma 3 that runs in less than 200MB RAM with quantization, generates embeddings in under 22ms on EdgeTPU, and ranks highest on MTEB for models under 500M parameters.

    Cohere Embed v4

    Multilingual, multimodal enterprise embedding model supporting over 100 programming languages and primary business languages with advanced quantization for cost optimization.

    BGE-VL
    Featured

    State-of-the-art multimodal embedding model from BAAI supporting text-to-image, image-to-text, and compositional visual search. Trained on the MegaPairs dataset with over 26 million retrieval triplets.

    Qwen3 Embedding
    Featured

    Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

    Jina Embeddings v4
    Featured

    Universal multimodal embedding model from Jina AI supporting text and images through unified pathway. Built on Qwen2.5-VL-3B-Instruct, outperforms proprietary models on visually rich document retrieval. This is a commercial API with free tier, though OSS weights available.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies