Skip to content

Text Datasets

Text-based benchmarks for retrieval and RAG evaluation.

Available Datasets

Dataset Description Generation GT
BEIR Heterogeneous information retrieval No
MTEB Massive Text Embedding Benchmark No
RAGBench RAG evaluation benchmark Yes
MrTyDi Multilingual retrieval No
BRIGHT Reasoning-intensive retrieval No