



BenchmarkQED standardizes QPS/latency/accuracy evaluations for RAG pipelines including vector DB retrieval on diverse datasets. Features comparable methodologies for fair benchmarking of full RAG stacks. Essential for selecting production vector DBs in RAG; emphasizes retrieval fairness unlike ANN-Benchmarks indexing focus or VectorDBBench system-level throughput tests.
Loading more......
BenchmarkQED is an open benchmarking framework for Retrieval-Augmented Generation (RAG) systems designed to push the community toward fairer, comparably measured retrieval evaluation methods.
Free and open-source.