



Benchmark dataset designed for evaluating multimodal retrieval systems in the medical domain. Tests retrieval performance on medical literature tasks involving both text and visual information, providing standardized evaluation for multimodal RAG systems.
Loading more......
Benchmark dataset designed for evaluating multimodal retrieval systems in the medical domain. Tests retrieval performance on medical literature tasks involving both text and visual information, providing standardized evaluation for multimodal RAG systems.