mxbai-embed-large

State-of-the-art large embedding model from Mixedbread AI, ranked first among similar-sized models, supporting Matryoshka Representation Learning and binary quantization with 700M+ training pairs.

Visit Website

Overview

mxbai-embed-large is a state-of-the-art large embedding model from mixedbread.ai. It's part of the crispy sentence embedding family from Mixedbread.

Performance

The model:

Ranked first among embedding models of similar size
Outperforms the new OpenAI embedding model, text-embedding-3-large
Matches the performance of 20x larger models like echo-mistral-7b
As of March 2024, archives SOTA performance for Bert-large sized models on the MTEB
Trained with no overlap of the MTEB data, indicating good generalization across domains, tasks and text length

Training

The model was trained with:

Over 700 million pairs using contrastive training
Tuned on over 30 million high quality triplets using the AnglE loss

Key Features

Matryoshka Representation Learning

The model supports Matryoshka Representation Learning, allowing vector truncation to smaller dimensions without retraining.

Binary Quantization

Supports binary quantization for reduced storage and faster similarity search.

Task-Specific Prompting

For retrieval tasks, you need to provide the prompt "Represent this sentence for searching relevant passages:" for query.

Technical Specifications

Suggested maximum sequence length: 512 tokens
Supports tasks: retrieval, classification, clustering, reranking, and summarization

Availability

Hugging Face
Ollama
Docker Hub
Multiple integration platforms

Pricing

Free and open-source.

Surveys

Loading more......

Information

Websitehuggingface.co

PublishedMar 13, 2026

Tags

3 Items

#embeddings #open-source #matryoshka

Similar Products

stella_en

A family of English text embedding models distilled from state-of-the-art embedding models using a novel multi-stage distillation framework. Stella models support multiple dimensions (512 to 8192) through Matryoshka Representation Learning, offering flexible embedding sizes for different use cases.

000

Qwen3 Embedding

Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

000

sqlite-vec

sqlite-vec is a Rust-based SQLite extension library for vector similarity search using diskANN indexes on embeddings, enabling lightweight ANN without separate databases. Features HNSW-like graphs, quantization support, and hybrid full-text+vector queries in embedded SQLite environments. Perfect for prototyping and on-device apps; extremely lightweight compared to Milvus, more persistent than pure hnswlib.

000

ClickHouse

ClickHouse is a columnar OLAP database with vector indexes (ANN via AMM, brute-force), supporting SQL queries over vectors + structured data at petabyte scale. Excels in aggregations with vectors. For analytics workloads with embeddings; faster ingestion than Postgres pgvector for big data.

000

txtai

Open-source embeddings database for semantic search, workflows, and AI applications with vector storage and retrieval capabilities.

000

Pixeltable

Pixeltable is an open-source database featuring automatic incremental embedding indexing for efficient vector search. It supports Apache License 2.0 and is designed for handling embeddings in AI applications.

000

Overview

mxbai-embed-large is a state-of-the-art large embedding model from mixedbread.ai. It's part of the crispy sentence embedding family from Mixedbread.

Performance

The model:

Ranked first among embedding models of similar size
Outperforms the new OpenAI embedding model, text-embedding-3-large
Matches the performance of 20x larger models like echo-mistral-7b
As of March 2024, archives SOTA performance for Bert-large sized models on the MTEB
Trained with no overlap of the MTEB data, indicating good generalization across domains, tasks and text length

Training

The model was trained with:

Over 700 million pairs using contrastive training
Tuned on over 30 million high quality triplets using the AnglE loss

Key Features

Matryoshka Representation Learning

The model supports Matryoshka Representation Learning, allowing vector truncation to smaller dimensions without retraining.

Binary Quantization

Supports binary quantization for reduced storage and faster similarity search.

Task-Specific Prompting

For retrieval tasks, you need to provide the prompt "Represent this sentence for searching relevant passages:" for query.

Technical Specifications

Suggested maximum sequence length: 512 tokens
Supports tasks: retrieval, classification, clustering, reranking, and summarization

Availability

Hugging Face
Ollama
Docker Hub
Multiple integration platforms

Pricing

Free and open-source.

mxbai-embed-large

Overview

Performance

Training

Key Features

Matryoshka Representation Learning

Binary Quantization

Task-Specific Prompting

Technical Specifications

Availability

Pricing

Information

Categories

Tags

Similar Products

Connect with us

Stay Updated

Product

Clients

Company

Resources

mxbai-embed-large

Overview

Performance

Training

Key Features

Matryoshka Representation Learning

Binary Quantization

Task-Specific Prompting

Technical Specifications

Availability

Pricing

Information

Categories

Tags

Similar Products