ABCs of KMArtificial intelligenceBrain powerFeatured Stories

Using GraphRAG to enhance LLM-based information retrieval supporting knowledge management (KM)

This article is part of an ongoing series looking at AI in KM, and KM in AI.

Since the release of OpenAI’s ChatGPT in 2022, artificial intelligence (AI) large language models (LLMs) have revolutionized information processing because of their exceptional capabilities to understand context and generate text that closely resembles human language. However, LLMs face limitations such as knowledge rigidity, reliance on static training data, and susceptibility to generating inaccurate or misleading outputs.

In a recent paper1 in the journal Electronics, authors Hisatoshi Naganawa, Enna Hirata, and Akira Yamada advise that these challenges highlight the need for hybrid approaches that combine LLMs with external databases to enhance reliability and contextual accuracy.

In response, they introduce a novel information management approach that uses graph retrieval-augmented generation (GraphRAG) to systematically organize and integrate literature related to the physical internet. Retrieval-augmented generation (RAG)2 improves LLM outputs by incorporating external information retrieval before generating responses. Unlike LLMs alone that rely on static training data, RAG pulls relevant text from databases, uploaded documents, or web sources.

As shown in Figure 1, GraphRAG has the potential to improve LLM outputs even further by also incorporating knowledge graphs, which were introduced in a previous RealKM Magazine article series. Naganawa and colleagues explore this potential using the case study of the physical internet.

Different methods using LLMs.
Figure 1. Different methods using LLMs (source: Naganawa et al., 2025).

The physical internet and its information challenges

The concept of the physical internet was first introduced3 as a transformative framework for sustainable logistics. By applying the design principles of the digital Internet, the physical internet aims to create an interoperable, collaborative and efficient freight transport system.

The core idea is to route modular containers through transit hubs, called physical internet hubs, to maximize consolidation opportunities and minimize inefficiencies in transportation networks. Over the past decade, the physical internet has received considerable attention from researchers and practitioners alike for its potential to address critical challenges such as greenhouse gas emissions and resource inefficiencies in global logistics systems.

The growing body of literature on the physical internet reflects its increasing importance. The growing academic interest in the the physical internet is can be seen in the increasing volume of peer-reviewed publications and systematic literature reviews over the past decade.

However, the synthesis of current knowledge in the physical internet field is severely hindered by fragmentation, methodological issues, and the rate at which new research is produced. Manual review articles on the physical internet have reviewed less than 700 articles, yet there are more than 2000 scientific papers already published.

In contrast, by building a comprehensive, semantically rich knowledge graph, GraphRAG enables information to be integrated and retrieved from thousands of diverse sources. This scalable, flexible framework for information organization fills a critical gap left by traditional review methods. GraphRAG excels at uncovering relationships among data points, retrieving highly relevant information, and generating contextually rich outputs. These capabilities make it particularly well-suited for addressing complex problems with interconnected data structures, such as those found in physical internet research.

Technologies similar to GraphRAG are HybridRAG and LightRAG. The differences in these technologies are summarized in Table 1.

Table 1. Comparison of GraphRAG, HybridRAG, and LightRAG (source: Naganawa et al., 2025).
Feature GraphRAG HybridRAG LightRAG
Retrieval Method Graph-based plus semantic search Combination of vector search and keyword search Pure vector-based retrieval
Structure Awareness Captures direct and indirect relationships via graph topology Limited. Sometimes captures superficial relevance Low. Relationships are not structurally modeled
Response Quality High contextual depth with relational grounding Moderate. May miss deeper or indirect context Fast but may include fragmented or shallow responses
Scalability Moderate. Depends on graph complexity High. Simple integration of vector and keyword methods Very high. Minimal processing overhead

Overall, hybridRAG combines vector search and keyword search, enabling it to obtain data with strong superficial relevance, but it is difficult to capture indirect relationships. In addition, LightRAG is basically a vector search based on simple similarity and does not consider relationships, so although it is fast in generation, there is a high possibility that fragmentary answers will be mixed in the generated results.

Therefore, Naganawa and colleagues consider GraphRAG to have the best potential for addressing the information challenges of the physical internet.

Building the knowledge graph

As shown in Figure 2, construction of the knowledge graph (KG) began with data from four key sources:

  1. Scientific literature, made up of 2537 relevant papers published between 2006 and 2024.
  2. Proceedings of the International Physical Internet Conference (IPIC).
  3. The European physical internet roadmap developed by the Alliance for Logistics Innovation through Collaboration in Europe (ALICE).
  4. The national physical internet roadmap prepared by Japan’s Ministry of Economy, Trade and Industry (METI).

The raw data collected from the four sources was aggregated into a unified dataset. To ensure data quality, noise such as stop words, URLs, punctuation, and other irrelevant character symbols were removed. The cleaned and standardized corpus served as the input for subsequent graph construction.

Data processing pipeline.
Figure 2. Data processing pipeline (source: Naganawa et al., 2025).

After preprocessing, the cleaned data was hosted in a Neo4j AuraDB graph database. Using GPT-4o mini, entity and relationship extraction was conducted to form nodes and edges, which together created a structured knowledge graph representing the physical internet domain. The knowledge graph is shown in Figure 3.

Knowledge graph.
Figure 3. Knowledge graph (source: Naganawa et al., 2025).

Performance evaluation

Four retrieval strategies were implemented and compared to evaluate their performance in generating insights. The four strategies are, I: GPT-only, II: Local Search, III: Global Search, and IV: Hybrid Search. The details are explained in Table 2. Hybrid search is a strategy that combines “details” (Local) and “context” (Global) to generate more accurate and comprehensive information.

Table 2. Outlines of searching methods applied in the model (source: Naganawa et al., 2025).
Method Description
GPT- only Normal LLM response generation (e.g., ChatGPT). GPT relies solely on the model’s prior knowledge, without database augmentation.
Local Search Search for nodes and edges in the graph that are semantically close to the query and retrieve directly related nodes and edges from the graph database. Local Search returns nodes and connected subgraphs that are directly relevant to the exact query.
Global Search Identify communities (clusters) in the graph and obtain a summary of the entire community to which the query relates. Global Search retrieves broader, community-level graph segments to provide contextual breadth.
Hybrid Search Integrates Local Search and Global Search to generate comprehensive responses with specific examples and background information. Hybrid Search combines node-level detail and global context, using reranking to achieve holistic relevance.

To preliminarily evaluate GraphRAG’s usefulness under different user needs, Naganawa and colleagues designed stakeholder-specific prompts for three groups: industry practitioners, policymakers, and academic researchers. Several parameters were employed to optimize GraphRAG’s performance, and metrics were used to quantitatively assess the quality of responses generated by each strategy.

The results are shown in Table 3, demonstrating GraphRAG’s potential as a powerful tool for organizing and synthesizing fragmented knowledge in complex domains like the physical internet.

Table 3. Comparison of GPT-only model (I) and GraphRAG models (II and IV) (source: Naganawa et al., 2025).
Feature GPT-Only (I) Local Search (II) Hybrid Search (IV)
Specificity Moderate High Very High
Novelty Commonplace ideas Emerging concepts Advanced, integrative
Context Low Node-level Node + community (local + global)
Actionability Basic guidance Operational insights Strategic + system-wide recommendations
Stakeholder alignment General Contextualized Deeply tailored

The tailored prompts also revealed unique insights for each stakeholder group. This enhances GraphRAG’s applicability in real-world decision-making contexts. Industry practitioners can identify new business opportunities; policymakers can design informed strategies; and academics can uncover research gaps. All of these can be performed in a unified platform.

Limitations of the research

In concluding their paper, Naganawa and colleagues alert that wile their study possesses certain strengths, it also has several limitations. These include that while GraphRAG tends to exhibit lower hallucination risk compared to standard LLM generation, this risk is not entirely eliminated. Even with source linking capabilities, the model may still generate inaccurate yet plausible descriptions, potentially undermining user trust in identified research gaps or findings.

To mitigate such risks, it is crucial to provide referable and transparent outputs, such as explicitly indicating which graph nodes or edges support each part of the response. This enables users to verify the basis of the model’s answers and more effectively evaluate the reliability of the presented findings.

Article source: Naganawa et al., 2025. CC BY 4.0.

Header image source: Buzz on Pixabay.

References:

  1. Naganawa, H., Hirata, E., & Yamada, A. (2025). Implementing a Knowledge Management System with GraphRAG: A Physical Internet Example. Electronics, 14(24), 4948.
  2. Wikipedia, CC BY-SA 4.0.
  3. Montreuil, B., Meller, R. D., & Ballot, E. (2010). Towards a Physical Internet: the impact on logistics facilities and material handling systems design and innovation. 11th IMHRC Proceedings (Milwaukee, Wisconsin. USA – 2010).

Bruce Boyes

Bruce Boyes is editor, lead writer, and a director of RealKM Magazine and winner of the International Knowledge Management Award 2025 (Individual Category). He is an experienced knowledge manager, environmental manager, project manager, communicator, and educator, and holds a Master of Environmental Management with Distinction and a Certificate of Technology (Electronics). His many career highlights include: establishing RealKM Magazine as an award-winning resource with more than 2,500 articles and 5 million reader views, leading the knowledge management (KM) community KM and Sustainable Development Goals (SDGs) initiative, using agile approaches to oversee the on time and under budget implementation of an award-winning $77.4 million recovery program for one of Australia's iconic river systems, leading a knowledge strategy process for Australia’s 56 natural resource management (NRM) regional organisations, pioneering collaborative learning and governance approaches to empower communities to sustainably manage landscapes and catchments in the face of complexity, being one of the first to join a new landmark aviation complexity initiative, initiating and teaching two new knowledge management subjects at Shanxi University in China, and writing numerous notable environmental strategies, reports, and other works.

Related Articles

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to top button