



Python toolkit for efficient RAG research providing 36 pre-processed benchmark datasets and 23 state-of-the-art RAG algorithms in a unified, modular framework for reproduction and development.
FlashRAG is a Python toolkit for the reproduction and development of Retrieval Augmented Generation (RAG) research. FlashRAG is an efficient and modular open-source toolkit designed to assist researchers in reproducing and comparing existing RAG methods and developing their own algorithms within a unified framework.
The toolkit includes:
Multimodal RAG support has been added, including MLLMs like Llava, Qwen, InternVL, and various multimodal retrievers with Clip architecture.
A new paradigm that combines reasoning ability and retrieval, representing a significant advancement in RAG systems for complex reasoning tasks.
RAG methods are categorized into four types based on their inference paths:
The technical paper "FlashRAG: A Python Toolkit for Efficient RAG Research" was accepted to the Resource Track of the 2025 ACM Web Conference (WWW 2025).
Free and open-source.
Loading more......