



Microsoft's state-of-the-art multilingual text embedding model supporting 100 languages with 1024-dimensional embeddings, trained on 1 billion multilingual text pairs for robust cross-lingual retrieval.
Loading more......
The multilingual-e5-large model is a sophisticated embedding model developed at Microsoft, supporting 100 languages from xlm-roberta. It's designed for robust text representation across diverse languages and tasks.
The training procedure adheres to the English E5 model recipe:
The E5 family includes:
Free and open-source model available on Hugging Face.