

Ultra-lite and super-fast Python reranking library based on SoTA cross-encoders and LLMs, running on CPU with the tiniest reranking model in the world at ~4MB with no PyTorch dependency.
Loading more......
FlashRank is an ultra-lite and super-fast Python library to add re-ranking to your existing search & retrieval pipelines. It is based on SoTA LLMs and cross-encoders, created by Prithiviraj Damodaran.
Supports SoTA Listwise and Pairwise reranking:
FlashRank integrates with various frameworks including:
Free and open-source, available on GitHub and PyPI.