• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. Llama-Embed-Nemotron-8B

    Llama-Embed-Nemotron-8B

    Universal text embedding model from NVIDIA achieving state-of-the-art performance on MMTEB leaderboard, optimized for retrieval, reranking, semantic similarity, and classification with 4,096-dimensional embeddings.

    🌐Visit Website

    About this tool

    Overview

    llama-embed-nemotron-8b is a versatile text embedding model trained by NVIDIA and optimized for retrieval, reranking, semantic similarity, and classification use cases. It achieves state-of-the-art performance on the Multilingual Massive Text Embedding Benchmark (MMTEB) leaderboard as of October 21, 2025.

    Architecture & Vector Embeddings

    The model consists of:

    • 32 hidden layers
    • Embedding size of 4,096 dimensions
    • Global average pooling to compress token information into dense vectors
    • Initializes using the weights and architecture of Llama-3.1-8B model
    • Replaces causal attention mask with bi-directional attention

    Key Capabilities

    Multilingual and Cross-Lingual

    Robust capabilities for multilingual and cross-lingual text retrieval, designed to serve as a foundational component in text-based Retrieval-Augmented Generation (RAG) systems.

    Instruction-Tuned

    A universal, instruction-tuned text embedding model designed to generate specialized embeddings for a wide range of tasks, including retrieval, classification, and semantic textual similarity (STS).

    Training Data

    The complete dataset consists of 4.3 million samples from a diverse range of corpora:

    • Approximately 2.7 million non-synthetic samples from public sources
    • 1.6 million synthetic samples

    Performance

    Achieved 62% Top-1 accuracy, the highest among all tested embedding models in comparative benchmarks.

    Pricing

    Free to use under NVIDIA AI Foundation Models license.

    Surveys

    Loading more......

    Information

    Websitehuggingface.co
    PublishedMar 13, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Embeddings#Multilingual#Nvidia

    Similar Products

    6 result(s)
    voyage-3-large
    Featured

    State-of-the-art general-purpose and multilingual embedding model from Voyage AI that ranks first across eight domains spanning 100 datasets, outperforming OpenAI and Cohere models by significant margins.

    NV-Embed
    Featured

    NVIDIA's generalist embedding model achieving record 69.32 score on MTEB benchmark. Fine-tuned from Llama architecture with improved techniques for training LLMs as embedding models.

    Qwen3 Embedding
    Featured

    Multilingual embedding model supporting over 100 languages and ranking #1 on MTEB multilingual leaderboard. Offers flexible model sizes from 0.6B to 8B parameters with user-defined instructions.

    nomic-embed-text-v2-moe

    Multilingual MoE text embedding model excelling at multilingual retrieval with SoTA performance compared to ~300M parameter models, supporting ~100 languages with Matryoshka Embeddings trained on 1.6B pairs.

    UForm

    Pocket-sized multimodal AI for content understanding across multilingual texts, images, and video. Up to 5x faster than OpenAI CLIP with quantization-aware embeddings and support for 20+ languages.

    Cohere Embed v4

    Multilingual, multimodal enterprise embedding model supporting over 100 programming languages and primary business languages with advanced quantization for cost optimization.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies