• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. NV-Embed

    NV-Embed

    NVIDIA's generalist embedding model achieving record 69.32 score on MTEB benchmark. Fine-tuned from Llama architecture with improved techniques for training LLMs as embedding models.

    🌐Visit Website

    About this tool

    Overview

    NV-Embed is a generalist embedding model that enhances the performance of decoder-only LLMs for embedding and retrieval tasks, with various architectural designs and training procedures.

    Performance

    Using only publicly available data, NV-Embed achieved:

    • Record-high score of 69.32 on MTEB (Massive Text Embedding Benchmark)
    • Ranked #1 on MTEB as of May 24, 2024
    • Evaluated across 56 tasks

    NVIDIA Llama-based Embedding Models (2026)

    NVIDIA has developed several embedding models based on Llama architecture:

    llama-text-embed-v2

    • Built on Llama 3.2 1B architecture
    • Optimized for high retrieval quality with low-latency inference

    llama-3.2-nv-embedqa-1b-v2

    • Dense text embedding model for fixed-length vector representations
    • Note: API will be deprecated on 05/18/2026

    llama-embed-nemotron-8b

    • Open-weights text embedding model
    • Achieves state-of-the-art performance on Multilingual MTEB leaderboard (October 21, 2025)
    • Based on meta-llama/Llama-3.1-8B
    • Fine-tuned version with bidirectional attention mechanism

    NV-Embed-v2 (October 2025)

    • Latest embedding model from NVIDIA
    • Fine-tuned from Llama-3.1-8B
    • Particularly powerful at understanding multilingual text
    • Continues NVIDIA's leadership in embedding model performance

    Key Innovations

    • Improved techniques for training LLMs as generalist embedding models
    • Decoder-only architecture optimization for embeddings
    • Bidirectional attention mechanisms
    • Multilingual capabilities

    Applications

    • Semantic search
    • Retrieval-augmented generation (RAG)
    • Multilingual information retrieval
    • Cross-lingual tasks
    • General-purpose text embedding

    Availability

    Available through:

    • NVIDIA NIM (NVIDIA Inference Microservices)
    • Hugging Face
    • Research paper on arXiv
    Surveys

    Loading more......

    Information

    Websitearxiv.org
    PublishedMar 8, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Embeddings#Nvidia#Llm

    Similar Products

    6 result(s)
    ColBERTv2
    Featured

    Advanced multi-vector retrieval model creating token-level embeddings with late interaction mechanism, featuring denoised supervision and improved memory efficiency over original ColBERT.

    pinecone-sparse-english-v0
    Featured

    Learned sparse embedding model built on DeepImpact architecture, outperforming BM25 by up to 44% on TREC benchmarks for high-precision keyword search and hybrid retrieval.

    voyage-3-large
    Featured

    State-of-the-art general-purpose and multilingual embedding model from Voyage AI that ranks first across eight domains spanning 100 datasets, outperforming OpenAI and Cohere models by significant margins.

    Qwen3 Embedding
    Featured

    Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

    all-MiniLM-L6-v2
    Featured

    A compact and efficient pre-trained sentence embedding model, widely used for generating vector representations of text. It's a popular choice for applications requiring fast and accurate semantic search, often integrated with vector databases.

    Cohere Embed v4

    Multilingual, multimodal enterprise embedding model supporting over 100 programming languages and primary business languages with advanced quantization for cost optimization.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies