stella_en

A family of English text embedding models distilled from state-of-the-art embedding models using a novel multi-stage distillation framework. Stella models support multiple dimensions (512 to 8192) through Matryoshka Representation Learning, offering flexible embedding sizes for different use cases.

Visit Website

Overview

The stella_en model family represents a breakthrough in embedding model distillation, created by researcher dunzhang. These models are distilled from Alibaba's state-of-the-art GTE embedding models using an innovative multi-stage distillation framework.

Key Innovation: Multi-Stage Distillation

Introduced in the paper "Jasper and Stella: distillation of SOTA embedding models" (arXiv:2412.19048), the approach enables a smaller student embedding model to distill multiple larger teacher embedding models through three carefully designed losses.

Teacher Models

Stella models are distilled from:

Alibaba-NLP/gte-large-en-v1.5
Alibaba-NLP/gte-Qwen2-1.5B-instruct

This multi-teacher approach allows the student model to learn diverse strengths from different architectures.

Matryoshka Representation Learning (MRL)

Utilizes MRL to support multiple embedding dimensions:

512 dimensions: Compact, fast, lower storage
768, 1024 dimensions: Balanced performance and efficiency
2048, 4096 dimensions: Higher quality for demanding tasks
6144, 8192 dimensions: Maximum quality

Performance Note: The MTEB score at 1024d is only 0.001 lower than 8192d, making 1024d a sweet spot for most applications.

Model Variants

stella_en_1.5B_v5: 1.5 billion parameters, higher quality

stella_en_400M_v5: 400 million parameters, smaller and faster

Both variants support the full range of dimensions through MRL.

Simplified Prompting

Stella models simplify prompt usage by providing two prompts for most general tasks:

s2p (sentence-to-passage): For query-document retrieval
s2s (sentence-to-sentence): For similarity comparison

This reduces complexity compared to models requiring extensive prompt engineering.

Performance Benefits

Competitive Quality: Through distillation, achieves performance close to much larger teacher models

Flexible Sizing: MRL allows trading off quality vs. speed/storage based on application needs

Efficiency: Smaller models (400M) offer fast inference while maintaining good quality

Use Cases

High-throughput applications: Use 512 or 768 dimensions for speed
Balanced deployments: Use 1024 dimensions for optimal quality/efficiency
Quality-critical tasks: Use 4096 or 8192 dimensions
Resource-constrained environments: stella_en_400M_v5 with lower dimensions

Surveys

Loading more......

Information

Websitehuggingface.co

PublishedMar 20, 2026

Tags

4 Items

#embeddings #matryoshka #distillation #open-source

Similar Products

mxbai-embed-large

State-of-the-art large embedding model from Mixedbread AI, ranked first among similar-sized models, supporting Matryoshka Representation Learning and binary quantization with 700M+ training pairs.

000

Qwen3 Embedding

Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

000

sqlite-vec

sqlite-vec is a Rust-based SQLite extension library for vector similarity search using diskANN indexes on embeddings, enabling lightweight ANN without separate databases. Features HNSW-like graphs, quantization support, and hybrid full-text+vector queries in embedded SQLite environments. Perfect for prototyping and on-device apps; extremely lightweight compared to Milvus, more persistent than pure hnswlib.

000

ClickHouse

ClickHouse is a columnar OLAP database with vector indexes (ANN via AMM, brute-force), supporting SQL queries over vectors + structured data at petabyte scale. Excels in aggregations with vectors. For analytics workloads with embeddings; faster ingestion than Postgres pgvector for big data.

000

txtai

Open-source embeddings database for semantic search, workflows, and AI applications with vector storage and retrieval capabilities.

000

Pixeltable

Pixeltable is an open-source database featuring automatic incremental embedding indexing for efficient vector search. It supports Apache License 2.0 and is designed for handling embeddings in AI applications.

000

Overview

Key Innovation: Multi-Stage Distillation

Teacher Models

Stella models are distilled from:

Alibaba-NLP/gte-large-en-v1.5
Alibaba-NLP/gte-Qwen2-1.5B-instruct

This multi-teacher approach allows the student model to learn diverse strengths from different architectures.

Matryoshka Representation Learning (MRL)

Utilizes MRL to support multiple embedding dimensions:

512 dimensions: Compact, fast, lower storage
768, 1024 dimensions: Balanced performance and efficiency
2048, 4096 dimensions: Higher quality for demanding tasks
6144, 8192 dimensions: Maximum quality

Performance Note: The MTEB score at 1024d is only 0.001 lower than 8192d, making 1024d a sweet spot for most applications.

Model Variants

stella_en_1.5B_v5: 1.5 billion parameters, higher quality

stella_en_400M_v5: 400 million parameters, smaller and faster

Both variants support the full range of dimensions through MRL.

Simplified Prompting

Stella models simplify prompt usage by providing two prompts for most general tasks:

s2p (sentence-to-passage): For query-document retrieval
s2s (sentence-to-sentence): For similarity comparison

This reduces complexity compared to models requiring extensive prompt engineering.

Performance Benefits

Competitive Quality: Through distillation, achieves performance close to much larger teacher models

Flexible Sizing: MRL allows trading off quality vs. speed/storage based on application needs

Efficiency: Smaller models (400M) offer fast inference while maintaining good quality

Use Cases

High-throughput applications: Use 512 or 768 dimensions for speed
Balanced deployments: Use 1024 dimensions for optimal quality/efficiency
Quality-critical tasks: Use 4096 or 8192 dimensions
Resource-constrained environments: stella_en_400M_v5 with lower dimensions

stella_en

Overview

Key Innovation: Multi-Stage Distillation

Teacher Models

Matryoshka Representation Learning (MRL)

Model Variants

Simplified Prompting

Performance Benefits

Use Cases

Information

Categories

Tags

Similar Products

Connect with us

Stay Updated

Product

Clients

Company

Resources

stella_en

Overview

Key Innovation: Multi-Stage Distillation

Teacher Models

Matryoshka Representation Learning (MRL)

Model Variants

Simplified Prompting

Performance Benefits

Use Cases

Information

Categories

Tags

Similar Products

Technical Details

Availability

Research