LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration

GraphRAG integrates (knowledge) graphs with large language models (LLMs) to improve reasoning accuracy and contextual relevance. Despite its promising applications and strong relevance to multiple research communities, such as databases and natural language processing, GraphRAG currently lacks modul...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Cao, Yukun, Gao, Zengyi, Li, Zhiyang, Xie, Xike, Zhou, Kevin, Xu, Jianliang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Cao, Yukun
Gao, Zengyi
Li, Zhiyang
Xie, Xike
Zhou, Kevin
Xu, Jianliang
description GraphRAG integrates (knowledge) graphs with large language models (LLMs) to improve reasoning accuracy and contextual relevance. Despite its promising applications and strong relevance to multiple research communities, such as databases and natural language processing, GraphRAG currently lacks modular workflow analysis, systematic solution frameworks, and insightful empirical studies. To bridge these gaps, we propose LEGO-GraphRAG, a modular framework that enables: 1) fine-grained decomposition of the GraphRAG workflow, 2) systematic classification of existing techniques and implemented GraphRAG instances, and 3) creation of new GraphRAG instances. Our framework facilitates comprehensive empirical studies of GraphRAG on large-scale real-world graphs and diverse query sets, revealing insights into balancing reasoning quality, runtime efficiency, and token or GPU cost, that are essential for building advanced GraphRAG systems.
doi_str_mv 10.48550/arxiv.2411.05844
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2411_05844</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2411_05844</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2411_058443</originalsourceid><addsrcrecordid>eNqFjr0OgjAURrs4GPUBnOwLgKAlIW5EsQ4aE3QnV7hgk9I25Sfo06vo7vQl5zvDIWTuey4Lg8Bbgu1F566Y77teEDI2Jtkx5meHWzD3JOIbetJ5K8GKp1AlHbBzgxpzmmBjBXYgnagtK1TNm3FUaKERWtFCW7rDWpSKXgxkSOPeSP09p2RUgKxx9tsJWezj6_bgDDmpsaIC-0g_WemQtf5vvADDVkL-</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration</title><source>arXiv.org</source><creator>Cao, Yukun ; Gao, Zengyi ; Li, Zhiyang ; Xie, Xike ; Zhou, Kevin ; Xu, Jianliang</creator><creatorcontrib>Cao, Yukun ; Gao, Zengyi ; Li, Zhiyang ; Xie, Xike ; Zhou, Kevin ; Xu, Jianliang</creatorcontrib><description>GraphRAG integrates (knowledge) graphs with large language models (LLMs) to improve reasoning accuracy and contextual relevance. Despite its promising applications and strong relevance to multiple research communities, such as databases and natural language processing, GraphRAG currently lacks modular workflow analysis, systematic solution frameworks, and insightful empirical studies. To bridge these gaps, we propose LEGO-GraphRAG, a modular framework that enables: 1) fine-grained decomposition of the GraphRAG workflow, 2) systematic classification of existing techniques and implemented GraphRAG instances, and 3) creation of new GraphRAG instances. Our framework facilitates comprehensive empirical studies of GraphRAG on large-scale real-world graphs and diverse query sets, revealing insights into balancing reasoning quality, runtime efficiency, and token or GPU cost, that are essential for building advanced GraphRAG systems.</description><identifier>DOI: 10.48550/arxiv.2411.05844</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Databases</subject><creationdate>2024-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2411.05844$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2411.05844$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Cao, Yukun</creatorcontrib><creatorcontrib>Gao, Zengyi</creatorcontrib><creatorcontrib>Li, Zhiyang</creatorcontrib><creatorcontrib>Xie, Xike</creatorcontrib><creatorcontrib>Zhou, Kevin</creatorcontrib><creatorcontrib>Xu, Jianliang</creatorcontrib><title>LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration</title><description>GraphRAG integrates (knowledge) graphs with large language models (LLMs) to improve reasoning accuracy and contextual relevance. Despite its promising applications and strong relevance to multiple research communities, such as databases and natural language processing, GraphRAG currently lacks modular workflow analysis, systematic solution frameworks, and insightful empirical studies. To bridge these gaps, we propose LEGO-GraphRAG, a modular framework that enables: 1) fine-grained decomposition of the GraphRAG workflow, 2) systematic classification of existing techniques and implemented GraphRAG instances, and 3) creation of new GraphRAG instances. Our framework facilitates comprehensive empirical studies of GraphRAG on large-scale real-world graphs and diverse query sets, revealing insights into balancing reasoning quality, runtime efficiency, and token or GPU cost, that are essential for building advanced GraphRAG systems.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Databases</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjr0OgjAURrs4GPUBnOwLgKAlIW5EsQ4aE3QnV7hgk9I25Sfo06vo7vQl5zvDIWTuey4Lg8Bbgu1F566Y77teEDI2Jtkx5meHWzD3JOIbetJ5K8GKp1AlHbBzgxpzmmBjBXYgnagtK1TNm3FUaKERWtFCW7rDWpSKXgxkSOPeSP09p2RUgKxx9tsJWezj6_bgDDmpsaIC-0g_WemQtf5vvADDVkL-</recordid><startdate>20241106</startdate><enddate>20241106</enddate><creator>Cao, Yukun</creator><creator>Gao, Zengyi</creator><creator>Li, Zhiyang</creator><creator>Xie, Xike</creator><creator>Zhou, Kevin</creator><creator>Xu, Jianliang</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241106</creationdate><title>LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration</title><author>Cao, Yukun ; Gao, Zengyi ; Li, Zhiyang ; Xie, Xike ; Zhou, Kevin ; Xu, Jianliang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2411_058443</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Databases</topic><toplevel>online_resources</toplevel><creatorcontrib>Cao, Yukun</creatorcontrib><creatorcontrib>Gao, Zengyi</creatorcontrib><creatorcontrib>Li, Zhiyang</creatorcontrib><creatorcontrib>Xie, Xike</creatorcontrib><creatorcontrib>Zhou, Kevin</creatorcontrib><creatorcontrib>Xu, Jianliang</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Cao, Yukun</au><au>Gao, Zengyi</au><au>Li, Zhiyang</au><au>Xie, Xike</au><au>Zhou, Kevin</au><au>Xu, Jianliang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration</atitle><date>2024-11-06</date><risdate>2024</risdate><abstract>GraphRAG integrates (knowledge) graphs with large language models (LLMs) to improve reasoning accuracy and contextual relevance. Despite its promising applications and strong relevance to multiple research communities, such as databases and natural language processing, GraphRAG currently lacks modular workflow analysis, systematic solution frameworks, and insightful empirical studies. To bridge these gaps, we propose LEGO-GraphRAG, a modular framework that enables: 1) fine-grained decomposition of the GraphRAG workflow, 2) systematic classification of existing techniques and implemented GraphRAG instances, and 3) creation of new GraphRAG instances. Our framework facilitates comprehensive empirical studies of GraphRAG on large-scale real-world graphs and diverse query sets, revealing insights into balancing reasoning quality, runtime efficiency, and token or GPU cost, that are essential for building advanced GraphRAG systems.</abstract><doi>10.48550/arxiv.2411.05844</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2411.05844
ispartof
issn
language eng
recordid cdi_arxiv_primary_2411_05844
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computation and Language
Computer Science - Databases
title LEGO-GraphRAG: Modularizing Graph-based Retrieval-Augmented Generation for Design Space Exploration
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-19T02%3A37%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=LEGO-GraphRAG:%20Modularizing%20Graph-based%20Retrieval-Augmented%20Generation%20for%20Design%20Space%20Exploration&rft.au=Cao,%20Yukun&rft.date=2024-11-06&rft_id=info:doi/10.48550/arxiv.2411.05844&rft_dat=%3Carxiv_GOX%3E2411_05844%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true