GraphRAG-Bench/GraphRAG-BenchmarkPublic

NotificationsYou must be signed in to change notification settings
Fork40
Star298

GraphRAG-Bench, the official repo of comprehensive benchmark and dataset for evaluating GraphRAG models.

License

MIT license

298 stars 40 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
Datasets		Datasets
Evaluation		Evaluation
Examples		Examples
LICENSE		LICENSE
RAGvsGraphRAG.jpg		RAGvsGraphRAG.jpg
README.md		README.md
pipeline.jpg		pipeline.jpg
requirements.txt		requirements.txt

Repository files navigation

When to use Graphs in RAG: A Comprehensive Benchmark and Analysis for Graph Retrieval-Augmented Generation

🎉News •📖About •🏆Leaderboards •🧩Task Examples

🔧Getting Started •📬Contact •📝Citation •✨Stars History

If you find this benchmark helpful, please cite our paper:

@article{xiang2025use,  title={When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation},  author={Xiang, Zhishang and Wu, Chuanjie and Zhang, Qinggang and Chen, Shengyuan and Hong, Zijin and Huang, Xiao and Su, Jinsong},  journal={arXiv preprint arXiv:2506.05690},  year={2025}}

This repository is for the GraphRAG-Bench project, a comprehensive benchmark for evaluating Graph Retrieval-Augmented Generation models.

🎉 News

[2025-10-27] We releaseLinearRAG, a relation-free method for efficient GraphRAG.
[2025-08-24] We supportDIGIMON for flexible benchmarking across GraphRAG models.
[2025-05-25] We releaseGraphRAG-Bench, the benchmark for evaluating GraphRAG models.
[2025-05-14] We release theGraphRAG-Bench dataset.
[2025-01-21] We release theGraphRAG survey.

📖 About

Introduces Graph Retrieval-Augmented Generation (GraphRAG) concept
Compares traditional RAG vs GraphRAG approach
Explains research objective: Identify scenarios where GraphRAG outperforms traditional RAG
Visual comparison diagram of RAG vs GraphRAG

More Details

Graph retrieval-augmented generation (GraphRAG) has emerged as a powerful paradigm for enhancing large language models (LLMs) with external knowledge. It leverages graphs to model the hierarchical structure between specific concepts, enabling more coherent and effective knowledge retrieval for accurate reasoning. Despite its conceptual promise, recent studies report that GraphRAG frequently underperforms vanilla RAG on many real-world tasks. This raises a critical question: Is GraphRAG really effective, and in which scenarios do graph structures provide measurable benefits for RAG systems? To address this, we propose GraphRAG-Bench, a comprehensive benchmark designed to evaluate GraphRAG models on both hierarchical knowledge retrieval and deep contextual reasoning. GraphRAG-Bench features a comprehensive dataset with tasks of increasing difficulty, covering fact retrieval, complex reasoning, contextual summarization, and creative generation, and a systematic evaluation across the entire pipeline, from graph construction and knowledge retrieval to final generation. Leveraging this novel benchmark, we systematically investigate the conditions when GraphRAG surpasses traditional RAG and the underlying reasons for its success, offering guidelines for its practical application.

🏆 Leaderboards

Two domain-specific leaderboards with comprehensive metrics:

1. GraphRAG-Bench (Novel)

Evaluates models on literary/fictional content

2. GraphRAG-Bench (Medical)

Evaluates models on medical/healthcare content

Evaluation Dimensions:

Fact Retrieval (Accuracy, ROUGE-L)
Complex Reasoning (Accuracy, ROUGE-L)
Contextual Summarization (Accuracy, Coverage)
Creative Generation (Accuracy, Factual Score, Coverage)

🧩 Task Examples

Four difficulty levels with representative examples:

Level 1: Fact RetrievalExample: "Which region of France is Mont St. Michel located?"

Level 2: Complex ReasoningExample: "How did Hinze's agreement with Felicia relate to the perception of England's rulers?"

Level 3: Contextual SummarizationExample: "What role does John Curgenven play as a Cornish boatman for visitors exploring this region?"

Level 4: Creative GenerationExample: "Retell King Arthur's comparison to John Curgenven as a newspaper article."

🔧 Getting Started

First, install the necessary dependencies for GraphRAG-Bench.

pip install -r requirements.txt

🛠 Installation Guide

To prevent dependency conflicts, we strongly recommend using separate Conda environments for each framework:

We use the installation of LightRAG as an example. For other frameworks, please refer to their respective installation instructions.

# Create and activate environment (example for LightRAG)conda create -n lightrag python=3.10 -yconda activate lightrag# Install LightRAGgit clone https://github.com/HKUDS/LightRAG.gitcd LightRAGpip install -e.

🚀 Running Examples

We provide detailed instructions on how to use GraphRAG-Bench to evaluate each framework.

Specifically, we introduce how to perform index construction and batch inference for each framework in theExamples folder with instructions in theExamples README.

Note that the evaluation code is standardized across all frameworks to ensure fair comparison. Please refer to theEvaluation folder and theEvaluation README for detailed instructions on the evaluation.

📬 Contribution & Contact

Contributions to improve the benchmark website are welcome. Please contact the project team via GraphRAG@hotmail.com .

📝 Citation

If you find this benchmark helpful, please cite our paper:

@article{xiang2025use,  title={When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation},  author={Xiang, Zhishang and Wu, Chuanjie and Zhang, Qinggang and Chen, Shengyuan and Hong, Zijin and Huang, Xiao and Su, Jinsong},  journal={arXiv preprint arXiv:2506.05690},  year={2025}}

✨ Stars History

About

GraphRAG-Bench, the official repo of comprehensive benchmark and dataset for evaluating GraphRAG models.

Languages

Python100.0%

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

When to use Graphs in RAG: A Comprehensive Benchmark and Analysis for Graph Retrieval-Augmented Generation

🎉 News

📖 About

🏆 Leaderboards

🧩 Task Examples

🔧 Getting Started

🛠 Installation Guide

🚀 Running Examples

📬 Contribution & Contact

📝 Citation

✨ Stars History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Contributors12

Uh oh!

Languages

Movatterモバイル変換

License

GraphRAG-Bench/GraphRAG-Benchmark

Folders and files

Latest commit

History

Repository files navigation

When to use Graphs in RAG: A Comprehensive Benchmark and Analysis for Graph Retrieval-Augmented Generation

🎉 News

📖 About

🏆 Leaderboards

🧩 Task Examples

🔧 Getting Started

🛠 Installation Guide

🚀 Running Examples

📬 Contribution & Contact

📝 Citation

✨ Stars History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors12

Uh oh!

Languages

Packages