• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. E5-Mistral-7B-Instruct

    E5-Mistral-7B-Instruct

    Open-source embeddings model from Microsoft initialized from Mistral-7B-v0.1, achieving state-of-the-art BEIR score of 56.9 for English text embedding and retrieval tasks with 4096-dimensional vectors.

    🌐Visit Website

    About this tool

    Overview

    E5-Mistral is an open-source embeddings model developed by Microsoft, released under the MIT license. This E5 embedding model by Microsoft is initialized from Mistral-7B-v0.1 and fine-tuned on a mixture of multilingual datasets.

    Technical Specifications

    • Layers: 32 layers
    • Embedding Size: 4096 dimensions
    • Performance: Achieves a BEIR score of 56.9
    • Model Size: 14GB (the biggest on the MTEB leaderboard but also top performing)

    Key Features

    Instruction-Based Customization

    The task definition should be a one-sentence instruction that describes the task. This is a way to customize text embeddings for different scenarios through natural language instructions.

    High-Quality Representations

    Built with PyTorch, it generates high-quality vector representations useful for:

    • Semantic search
    • Information retrieval
    • Clustering tasks
    • Text similarity

    Language Support

    Since Mistral-7B-v0.1 is mainly trained on English data, it's recommended to use this model for English only.

    Requirements

    For e5-mistral-7b-instruct, it would require transformers>=4.34 to load Mistral model.

    Microsoft Integration

    The model is available on:

    • Microsoft's AI Model Catalog as second-state-e5-mistral-7b-instruct-embedding-gguf
    • Part of Microsoft's UniLM project
    • Hugging Face model hub

    Use Cases

    • Enterprise semantic search
    • Document retrieval systems
    • Question answering pipelines
    • Embedding-based classification

    Pricing

    Free and open-source under MIT license.

    Surveys

    Loading more......

    Information

    Websitehuggingface.co
    PublishedMar 13, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Embeddings#Open Source#Instruction Based

    Similar Products

    6 result(s)
    gte-Qwen2-1.5B-instruct

    A state-of-the-art multilingual text embedding model from Alibaba's GTE (General Text Embedding) series, built on the Qwen2-1.5B LLM. The model supports up to 8192 tokens and incorporates bidirectional attention mechanisms for enhanced contextual understanding across diverse domains.

    INSTRUCTOR

    A task-specific text embedding model that generates customized embeddings based on natural language instructions. INSTRUCTOR achieves state-of-the-art performance on 70 diverse embedding tasks by allowing users to specify the task objective and domain.

    Qwen3 Embedding
    Featured

    Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

    BGE-M3

    A versatile multilingual text embedding model from BAAI that supports 100+ languages and can handle inputs up to 8192 tokens. BGE-M3 is unique in supporting three retrieval methods simultaneously: dense retrieval, multi-vector retrieval, and sparse retrieval.

    gte-Qwen2-7B-instruct

    A large-scale multilingual text embedding model from Alibaba's GTE series with 7 billion parameters. Built on Qwen2-7B, it achieved a score of 70.24 on MTEB, outperforming NV-Embed-v1 and supporting 100+ languages with up to 8192 token context.

    stella_en

    A family of English text embedding models distilled from state-of-the-art embedding models using a novel multi-stage distillation framework. Stella models support multiple dimensions (512 to 8192) through Matryoshka Representation Learning, offering flexible embedding sizes for different use cases.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies