Graph RAG: Access to external data becomes much more accurate with Microsoft's approach

Microsoft researchers have developed a method called Graph RAG that significantly enhances how language models process external data.

Published in April, the approach allows for more meaningful answers to general queries across large databases compared to traditional methods that simply string together text snippets. According to Microsoft, in just a few months, Graph RAG has demonstrated clear advantages over traditional vector-based RAG approaches.

Retrieval augmented generation (RAG) is a common technique for answering user queries on large data sets that exceed the context window of a language model. While models such as Gemini 1.5 Pro can now process up to two million tokens at a time, RAG frameworks first retrieve relevant information from external sources to enrich the context of an original query. That way, it can still help improve answers.

Knowledge graph of entity nodes and relationship edges derived from a message dataset. Different colors represent different groups at the highest (left) and second-highest (right) levels. | Image: Microsoft

However, conventional vector-based RAG works best when answers are contained locally in text areas. It struggles with query-focused summaries spanning entire document collections.

Graph RAG differs by combining the modularity of knowledge graphs with the capabilities of language models. This makes it much more effective for questions such as "Which public figures are mentioned in various entertainment articles?"

The process begins by extracting chunks of text from source documents. A language model then identifies entities and relationships within each chunk. Entities in a knowledge graph can include people, companies, or places, for example.

To answer a user query, the system prepares grouped summaries and generates parallel intermediate answers for each chunk. These are then condensed into a final answer through query-focused summarization. Microsoft says this approach is more efficient than directly summarizing source texts, as each query requires fewer context tokens.

Graph RAG also excels at identifying higher-level topics in databases. Answers are more comprehensive than with vector RAG, while source information makes it easier to verify claims.

Recommendation

AI research

Scaling laws for precision: AI researcher sees "perfect storm" for the end of scale

A key advantage of knowledge graphs over vector databases is that humans can more easily explore and add new content. This combination of language models and knowledge graphs as external information sources could make AI answers more verifiable.

The researchers tested their approach on podcast transcripts and news articles. They generated questions using a language model and compared the performance of Graph RAG at different levels of summarization with traditional RAG. Benchmarks showed that all Graph RAG approaches outperformed traditional methods in terms of completeness and variety.

Microsoft pitted Graph RAG against Text Summaries (TS) and Vector RAG (VS) at different levels of group summarization and had an LLM evaluate them. The cells show the win rate of the method in the row versus the method in the column. | Image: Microsoft

Microsoft released Graph RAG as a Python implementation on GitHub and Azure in early July. LinkedIn's initial customer support tests are promising. Over a six-month period, response times to support requests dropped by an average of nearly 30%. Extensive independent benchmarking has yet to be done. Alternatives are available from companies such as neo4j.

The researchers acknowledge some limitations in their evaluation. They only tested Graph RAG on corpora of about one million tokens – theoretically small enough to fit in a single text prompt. However, language models still struggle with "lost in the middle" issues for long documents, which RAG systems have been shown to mitigate. Graph RAG and similar approaches might be able to combine the best of both worlds.

Join our community

Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.

Graph RAG: Access to external data becomes much more accurate with Microsoft's approach

Scaling laws for precision: AI researcher sees "perfect storm" for the end of scale

AI startup Prime Intellect trains first distributed LLM across three continents

AI researcher says that ChatGPT's "secret ingredient" may be holding back LLM capabilities

OpenAI slashes fine-tuning costs for GPT-4o mini with special limited-time offer

OpenAI launches GPT-5 as a unified system with adaptive reasoning for complex tasks

Google Deepmind's Genie 3 creates interactive 3D worlds that stay consistent for "multiple minutes"

Google upgrades Gemini with Deep Think and flags early warning risks

Graph RAG: Access to external data becomes much more accurate with Microsoft's approach

Share

Bank details