• Home
  • Categories
  • Tags
  • Pricing
  • Submit
    Decorative pattern
    1. Home
    2. Machine Learning Models
    3. MS MARCO Cross-Encoder

    MS MARCO Cross-Encoder

    Popular cross-encoder reranker models trained on MS MARCO dataset for semantic search, providing superior accuracy in re-ranking the top results from bi-encoder retrieval systems.

    🌐Visit Website

    About this tool

    Overview

    MS MARCO Cross-Encoder models are widely used reranker models trained on the MS MARCO dataset for semantic search applications. The most popular variant is cross-encoder/ms-marco-MiniLM-L-6-v2.

    How Cross-Encoders Work

    Cross-encoders calculate a similarity score given pairs of texts, processing both the query and document together through every transformer layer. This allows attention to model their interaction directly, providing superior performance compared to bi-encoder (Sentence Transformer) models.

    Architecture

    A cross-encoder takes a query–document pair as a single input sequence — [CLS] query [SEP] document [SEP] — and outputs a scalar relevance score.

    Two-Stage Retrieval Pipeline

    The recommended approach is:

    1. Stage 1 (Bi-encoder): Fast retrieval of top-20 candidates
    2. Stage 2 (Cross-encoder): Score each query–chunk pair jointly, return top-3 to the LLM

    This balances speed and accuracy, as cross-encoders are slower than sentence transformers but provide higher quality ranking.

    Performance

    Models trained on the MS MARCO dataset are very effective as rerankers for search systems, providing high accuracy and deep semantic understanding essential in reranking tasks.

    Output

    MS MARCO models return logits rather than normalized scores, though they can be configured to return scores between 0 and 1.

    Use Cases

    • Re-ranking search results
    • RAG pipeline optimization
    • Question-answering systems
    • Document retrieval
    • Semantic search

    Available Models

    • cross-encoder/ms-marco-TinyBERT-L-2-v2
    • cross-encoder/ms-marco-MiniLM-L-6-v2 (most popular)
    • cross-encoder/ms-marco-MiniLM-L-12-v2
    • Available on Hugging Face and Sentence Transformers

    Pricing

    Free and open-source models available through Sentence Transformers.

    Surveys

    Loading more......

    Information

    Websitewww.sbert.net
    PublishedMar 14, 2026

    Categories

    1 Item
    Machine Learning Models

    Tags

    3 Items
    #Reranker#Cross Encoder#Search

    Similar Products

    6 result(s)
    Reranking Models

    Cross-encoder models that rerank initial retrieval results for improved relevance. More accurate than bi-encoders but slower, typically applied to top-k candidates.

    Cohere's re-ranker
    Featured

    A re-ranking tool provided by Cohere, which can be integrated into LLM applications via frameworks like LangChain to improve the relevance and order of retrieved documents from search systems, including those utilizing vector databases.

    Approximate Nearest Neighbors (ANN)

    Family of algorithms trading perfect accuracy for speed in high-dimensional similarity search. Enables sub-linear query time with 90%+ recall on billion-scale datasets.

    Hybrid Search

    Search approach combining keyword-based (BM25) and semantic (vector) search for best of both worlds. Uses fusion techniques like RRF to merge results.

    Semantic Search

    Search technique understanding meaning and context rather than exact keyword matching. Uses vector embeddings to find semantically similar content even with different wording.

    Vector Similarity Search

    Finding nearest vectors in high-dimensional space based on distance or similarity metrics. Core operation of vector databases enabling semantic search, recommendations, and RAG.

    Decorative pattern
    Built with
    Ever Works
    Ever Works

    Connect with us

    Stay Updated

    Get the latest updates and exclusive content delivered to your inbox.

    Product

    • Categories
    • Tags
    • Pricing
    • Help

    Clients

    • Sign In
    • Register
    • Forgot password?

    Company

    • About Us
    • Admin
    • Sitemap

    Resources

    • Blog
    • Submit
    • API Documentation
    All product names, logos, and brands are the property of their respective owners. All company, product, and service names used in this repository, related repositories, and associated websites are for identification purposes only. The use of these names, logos, and brands does not imply endorsement, affiliation, or sponsorship. This directory may include content generated by artificial intelligence.
    Copyright © 2025 Awesome Vector Databases. All rights reserved.·Terms of Service·Privacy Policy·Cookies