



Universal multimodal embedding model from Jina AI supporting text and images through unified pathway. Built on Qwen2.5-VL-3B-Instruct, outperforms proprietary models on visually rich document retrieval. This is a commercial API with free tier, though OSS weights available.
Loading more......
jina-embeddings-v4 is a 3.8B parameter model that embeds text and images through a unified pathway, supporting both dense and late-interaction retrieval. Particularly strong on visually rich document retrieval, outperforming proprietary models from Google, OpenAI, and Voyage AI.
New pricing model introduced May 6, 2025. Users with auto-recharge enabled before this date maintain old pricing. New pricing applies to new purchases or modifications.
Jina intentionally throttles API throughput for jina-embeddings-v4 to manage infrastructure costs. For production workloads requiring high throughput:
Payments processed through Stripe supporting:
v4 adds multimodal capabilities (text + images) with 1,536-dimensional vectors, while v3 was text-only with 1,024 dimensions. v3 offers higher API throughput for production text-only workloads.