• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. NV-Embed

    NV-Embed

    NVIDIA's generalist embedding model achieving record 69.32 score on MTEB benchmark. Fine-tuned from Llama architecture with improved techniques for training LLMs as embedding models.

    🌐Visit Website

    About this tool

    Overview

    NV-Embed is a generalist embedding model that enhances the performance of decoder-only LLMs for embedding and retrieval tasks, with various architectural designs and training procedures.

    Performance

    Using only publicly available data, NV-Embed achieved:

    • Record-high score of 69.32 on MTEB (Massive Text Embedding Benchmark)
    • Ranked #1 on MTEB as of May 24, 2024
    • Evaluated across 56 tasks

    NVIDIA Llama-based Embedding Models (2026)

    NVIDIA has developed several embedding models based on Llama architecture:

    llama-text-embed-v2

    • Built on Llama 3.2 1B architecture
    • Optimized for high retrieval quality with low-latency inference

    llama-3.2-nv-embedqa-1b-v2

    • Dense text embedding model for fixed-length vector representations
    • Note: API will be deprecated on 05/18/2026

    llama-embed-nemotron-8b

    • Open-weights text embedding model
    • Achieves state-of-the-art performance on Multilingual MTEB leaderboard (October 21, 2025)
    • Based on meta-llama/Llama-3.1-8B
    • Fine-tuned version with bidirectional attention mechanism

    NV-Embed-v2 (October 2025)

    • Latest embedding model from NVIDIA
    • Fine-tuned from Llama-3.1-8B
    • Particularly powerful at understanding multilingual text
    • Continues NVIDIA's leadership in embedding model performance

    Key Innovations

    • Improved techniques for training LLMs as generalist embedding models
    • Decoder-only architecture optimization for embeddings
    • Bidirectional attention mechanisms
    • Multilingual capabilities

    Applications

    • Semantic search
    • Retrieval-augmented generation (RAG)
    • Multilingual information retrieval
    • Cross-lingual tasks
    • General-purpose text embedding

    Availability

    Available through:

    • NVIDIA NIM (NVIDIA Inference Microservices)
    • Hugging Face
    • Research paper on arXiv
    Surveys

    Loading more......

    Information

    Websitearxiv.org
    PublishedMar 8, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Embeddings#Nvidia#Llm

    Similar Products

    6 result(s)
    Qwen3 Embedding
    Featured

    Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

    all-MiniLM-L6-v2
    Featured

    A compact and efficient pre-trained sentence embedding model, widely used for generating vector representations of text. It's a popular choice for applications requiring fast and accurate semantic search, often integrated with vector databases.

    GTE Embeddings

    General Text Embeddings from Alibaba DAMO Academy trained on large-scale relevance pairs. Available in three sizes (large, base, small) with GTE-v1.5 supporting 8192 context length.

    ModernBERT Embed

    Open-source embedding model from Nomic AI based on ModernBERT-base with 149M parameters. Supports 8192 token sequences and Matryoshka Representation Learning for 3x memory reduction.

    OpenAI’s text-embedding-ada-002

    A pre-trained model used for extracting embeddings from content like PDFs, videos, and transcripts, which are then stored in vector databases for faster search.

    Matryoshka Embeddings
    Featured

    Representation learning approach encoding information at multiple granularities, allowing embeddings to be truncated while maintaining performance. Enables 14x smaller sizes and 5x faster search.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies