
gte-Qwen2-7B-instruct
A large-scale multilingual text embedding model from Alibaba's GTE series with 7 billion parameters. Built on Qwen2-7B, it achieved a score of 70.24 on MTEB, outperforming NV-Embed-v1 and supporting 100+ languages with up to 8192 token context.
About this tool
Overview
gte-Qwen2-7B-instruct is the flagship model in Alibaba's GTE-Qwen2 series, featuring 7 billion parameters and achieving state-of-the-art performance on multilingual embedding benchmarks.
Performance
MTEB Benchmark: 70.24 score
Outperforms:
- NV-Embed-v1: 69.32
- gte-Qwen1.5-7B-instruct: 67.34
Technical Features
- 7 Billion Parameters: Larger model size enables richer representations
- Bidirectional Attention: Enhanced contextual understanding
- 8192 Token Context: Process long documents
- 100+ Languages: Comprehensive multilingual support
- Advanced Training: Weakly supervised and supervised data
Use Cases
- Enterprise multilingual search
- Long-document embedding
- High-quality RAG systems
- Cross-lingual retrieval
- Academic and research applications
Availability
Hugging Face: Alibaba-NLP/gte-Qwen2-7B-instruct
Commercial API: Alibaba Cloud text-embedding-v3
Surveys
Loading more......
