Copied


NVIDIA's NV-Embed Model Achieves Top Spot on MTEB Leaderboard

Luisa Crawford   Jun 11, 2024 07:25 0 Min Read


NVIDIA's latest embedding model, NV-Embed, has set a new record for embedding accuracy with a score of 69.32 on the Massive Text Embedding Benchmark (MTEB), which encompasses 56 diverse embedding tasks, according to NVIDIA Technical Blog.

Understanding the Metrics for Embedding Models

Embedding models are evaluated using several metrics, with Normalized Discounted Cumulative Gain (NDCG) and Recall being the most significant. NDCG is a rank-aware metric that measures the relevance and order of retrieved information, while Recall is a rank-agnostic metric that measures the percentage of relevant results retrieved. Most benchmarks report NDCG@10, but for enterprise-grade retrieval-augmented generation (RAG) pipelines, Recall@5 is often recommended.

What are MTEB and BEIR?

The MTEB benchmark covers 56 tasks, including retrieval, classification, re-ranking, clustering, and summarization. BEIR focuses on retrieval tasks and adds complexity with different question types and domains, such as fact-checking and biomedical questions. MTEB is largely a superset of BEIR, making it a comprehensive benchmark for evaluating embedding models.

NV-Embed's performance on MTEB has been exceptional, achieving an NDCG@10 score of 69.32, the highest among all models tested. This performance is attributed to several key improvements in the model's architecture and training process.

Key Improvements in NV-Embed

  • Latent Attention Layer: This new layer simplifies the process of combining the mathematical representation (embeddings) of a series of words, improving the model's efficiency and accuracy.
  • Two-Stage Learning Process: The first stage uses in-batch negative and hard negative pairs for contrastive learning, while the second stage blends data from non-retrieval tasks for further training, enhancing the model's robustness.

These advancements make NV-Embed a powerful tool for enterprise retrieval workloads, although its effectiveness depends on the nature and domain of the data being used.

Prototyping with NV-Embed

NV-Embed is available through NVIDIA's API catalog, allowing organizations to integrate this high-performing model into their data processing pipelines. Additionally, the NVIDIA NeMo Retriever collection of microservices enables seamless connection of custom models to diverse business data, delivering highly accurate responses.

For further details, visit the NVIDIA Technical Blog.


Read More
The Hong Kong Monetary Authority has issued a warning about a fraudulent website posing as OCBC Bank (Hong Kong) Limited, urging public vigilance.
BitMEX has changed the Mark Method for NILUSDTH25 and REDUSDTZ25 to Fair Price marking, effective March 25, 2025, enhancing price accuracy.
BitMEX introduces NILUSDT perpetual swaps, offering traders up to 50x leverage. This new listing enhances trading options on the platform.
Bitcoin remains vulnerable to downward pressure due to tight liquidity conditions and weak investor sentiment, with ETF outflows and cautious market behavior persisting.
Vodafone implements AI-driven solutions using LangChain and LangGraph to optimize data operations and improve performance metrics monitoring and information retrieval across its data centers.
BitMEX announces the introduction of NILUSDT perpetual swap listing, offering traders up to 50x leverage. The NIL token will be available for trading starting March 25, 2024.
Cronos (CRO) Labs has appointed Mirko Zhao as its new leader, succeeding Ken Timsit. Zhao aims to enhance the blockchain’s growth and community engagement.
Cronos (CRO) Labs announces Mirko Zhao as the new Head of Product and Engineering, succeeding Ken Timsit, to lead the blockchain ecosystem's innovative growth.