Explainable Knowledge Graph Retrieval-Augmented Generation (KG-RAG) with KG-SMILE

AI-generated keywords: Generative AI

AI-generated Key Points

Generative AI, particularly Large Language Models (LLMs), has advanced but faces challenges like producing hallucinations and unverifiable claims
Retrieval-Augmented Generation (RAG) introduced to improve accuracy by incorporating external knowledge, especially in sensitive domains like healthcare
RAG is opaque and reliant on data quality, limiting transparency
Knowledge-Graph (KG)-SMILE framework developed for Graph RAG using SMILE for enhanced transparency by identifying influential graph entities and relations
KG-SMILE evaluation includes attribution metrics like fidelity, faithfulness, consistency, stability, and accuracy
KG-SMILE produces stable explanations aligned with human understanding, balancing model effectiveness and interpretability
Prior research on explainable artificial intelligence reviewed with focus on intrinsic and post-hoc approaches like LIME and SHAP
Efforts in KG and LLM explainability discussed to enhance transparency through structured knowledge
Advancements in graph-based interpretability explored along with studies on KG-aided question answering informing the proposed KG-SMILE framework

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zahra Zehtabi Sabeti Moghaddam, Zeinab Dehghani, Maneeha Rani, Koorosh Aslansefat, Bhupesh Kumar Mishra, Rameez Raja Kureshi, Dhavalkumar Thakker

arXiv: 2509.03626v1 - DOI (cs.AI)

License: CC BY-NC-SA 4.0

Abstract: Generative AI, such as Large Language Models (LLMs), has achieved impressive progress but still produces hallucinations and unverifiable claims, limiting reliability in sensitive domains. Retrieval-Augmented Generation (RAG) improves accuracy by grounding outputs in external knowledge, especially in domains like healthcare, where precision is vital. However, RAG remains opaque and essentially a black box, heavily dependent on data quality. We developed a method-agnostic, perturbation-based framework that provides token and component-level interoperability for Graph RAG using SMILE and named it as Knowledge-Graph (KG)-SMILE. By applying controlled perturbations, computing similarities, and training weighted linear surrogates, KG-SMILE identifies the graph entities and relations most influential to generated outputs, thereby making RAG more transparent. We evaluate KG-SMILE using comprehensive attribution metrics, including fidelity, faithfulness, consistency, stability, and accuracy. Our findings show that KG-SMILE produces stable, human-aligned explanations, demonstrating its capacity to balance model effectiveness with interpretability and thereby fostering greater transparency and trust in machine learning technologies.

Submitted to arXiv on 03 Sep. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2509.03626v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

, , , , Generative AI, particularly Large Language Models (LLMs), has made significant advancements in recent years. However, it still faces challenges such as producing hallucinations and unverifiable claims, which can limit its reliability in sensitive domains. To address this issue, a new approach called Retrieval-Augmented Generation (RAG) has been introduced. RAG incorporates external knowledge to improve accuracy, especially in domains like healthcare where precision is paramount. However, RAG itself remains opaque and heavily reliant on data quality, posing limitations in terms of transparency. In response to these challenges, a method-agnostic perturbation-based framework called Knowledge-Graph (KG)-SMILE has been developed for Graph RAG using SMILE. This framework provides token and component-level interoperability by applying controlled perturbations, computing similarities, and training weighted linear surrogates. KG-SMILE aims to identify the most influential graph entities and relations in generated outputs, thereby enhancing the transparency of RAG models. The evaluation of KG-SMILE includes comprehensive attribution metrics such as fidelity, faithfulness, consistency, stability, and accuracy. The findings demonstrate that KG-SMILE produces stable explanations that are aligned with human understanding. This indicates the framework's ability to strike a balance between model effectiveness and interpretability, ultimately fostering greater transparency and trust in machine learning technologies. In related works,<br> prior research on explainable artificial intelligence is reviewed with a focus on intrinsic and post-hoc approaches. Methods like LIME and SHAP are discussed for explaining complex models without altering their structure.<br> The section also delves into efforts in KG and LLM explainability to enhance transparency through structured knowledge. Furthermore, advancements in graph-based interpretability for model decisions are explored along with studies on KG-aided question answering that inform the proposed KG-SMILE framework. Overall, the paper highlights the importance of explainability in AI models for ensuring transparency, fairness, and reliability while showcasing innovative approaches like KG-SMILE to address existing challenges in machine learning interpretability.

- Generative AI, particularly Large Language Models (LLMs), has advanced but faces challenges like producing hallucinations and unverifiable claims
- Retrieval-Augmented Generation (RAG) introduced to improve accuracy by incorporating external knowledge, especially in sensitive domains like healthcare
- RAG is opaque and reliant on data quality, limiting transparency
- Knowledge-Graph (KG)-SMILE framework developed for Graph RAG using SMILE for enhanced transparency by identifying influential graph entities and relations
- KG-SMILE evaluation includes attribution metrics like fidelity, faithfulness, consistency, stability, and accuracy
- KG-SMILE produces stable explanations aligned with human understanding, balancing model effectiveness and interpretability
- Prior research on explainable artificial intelligence reviewed with focus on intrinsic and post-hoc approaches like LIME and SHAP
- Efforts in KG and LLM explainability discussed to enhance transparency through structured knowledge
- Advancements in graph-based interpretability explored along with studies on KG-aided question answering informing the proposed KG-SMILE framework

SummaryGenerative AI, like a smart robot that can write stories, has gotten better but sometimes makes up things and says things that might not be true. Retrieval-Augmented Generation (RAG) helps the robot be more accurate by using outside information, especially in important areas like healthcare. RAG can be hard to understand because it relies on good information and is not always clear. A special framework called Knowledge-Graph (KG)-SMILE helps the robot explain its answers better by pointing out important facts and connections in a graph. This framework also checks how well the robot's answers match what people know. Definitions- Generative AI: A type of technology that can create new things, like writing stories or making art. - Large Language Models (LLMs): Advanced programs that understand and generate human language. - Hallucinations: Seeing or hearing things that are not really there. - Unverifiable claims: Statements that cannot be proven to be true. - Retrieval-Augmented Generation (RAG): Using external knowledge to improve the accuracy of generated content. - Opaque: Hard to see through or understand clearly. - Data quality: How good and reliable the information used is. - Knowledge Graph (KG): A way of organizing information as interconnected nodes in a graph structure. - SMILE: A method for enhancing transparency in AI systems by identifying important entities and relationships in data. - Attribution metrics: Measures used to evaluate how well an AI system explains its decisions or outputs. -

Introduction

Generative AI has made significant strides in recent years, particularly with the development of Large Language Models (LLMs). These models have shown impressive capabilities in generating human-like text and have been applied to various domains such as language translation, chatbots, and content creation. However, as these models become more complex and powerful, they also face challenges that can limit their reliability in sensitive domains. One major issue is the production of hallucinations or false information by LLMs. This can be a significant problem in fields like healthcare where precision and accuracy are crucial. Another challenge is the lack of transparency in LLMs, making it difficult to understand how these models arrive at their decisions. This lack of interpretability can lead to mistrust and hinder the adoption of AI technologies. To address these challenges, a new approach called Retrieval-Augmented Generation (RAG) has been introduced. RAG incorporates external knowledge from structured databases or knowledge graphs into LLMs to improve their accuracy and reduce hallucinations. However, RAG itself remains opaque and heavily reliant on data quality for its performance. In response to this issue, researchers have developed a method-agnostic perturbation-based framework called Knowledge-Graph (KG)-SMILE for Graph RAG using SMILE. This framework aims to enhance the transparency of RAG models by identifying influential graph entities and relations in generated outputs through controlled perturbations and similarity computations.

Related Works

Prior research on explainable artificial intelligence (XAI) has focused on two main approaches: intrinsic methods that aim to build interpretable models from scratch, and post-hoc methods that explain complex black-box models without altering their structure. Intrinsic methods include decision trees, rule-based systems, Bayesian networks which are inherently interpretable but may not perform well compared to more complex models like neural networks. On the other hand,
post-hoc methods like LIME (Local Interpretable Model-Agnostic Explanations) and SHAP (SHapley Additive exPlanations) have been developed to explain complex models by approximating their behavior locally. These methods have shown promising results in explaining LLMs, but they do not take into account external knowledge from structured databases. In recent years, there has been a growing interest in incorporating knowledge graphs (KGs) into AI models for enhanced interpretability. KGs are structured databases that represent real-world entities and their relationships, making them suitable for providing explanations for model decisions. Studies have explored the use of KGs in LLM explainability by incorporating graph embeddings or using graph-based attention mechanisms. Furthermore, research on KG-aided question answering has also informed the development of the proposed KG-SMILE framework. This approach uses a combination of controlled perturbations, similarity computations, and weighted linear surrogates to identify influential graph entities and relations in generated outputs.

The KG-SMILE Framework

The Knowledge-Graph (KG)-SMILE framework aims to enhance the transparency of RAG models by identifying influential graph entities and relations in generated outputs through controlled perturbations and similarity computations. The framework is method-agnostic, meaning it can be applied to any Graph RAG model regardless of its architecture or training method. The first step in the KG-SMILE framework is to apply controlled perturbations to the input text before passing it through the Graph RAG model. These perturbations involve replacing certain words with synonyms or antonyms based on their semantic similarity scores calculated using WordNet or other lexical resources. Next, similarities between the original output and perturbed output are computed at both token-level and component-level using metrics such as cosine similarity or Jaccard index. This allows for an understanding of how much each word contributes to the final output and which components of the input text are most influential. Finally, weighted linear surrogates are trained using the similarities calculated in the previous step to identify the most influential graph entities and relations in generated outputs. These surrogates can then be used to explain model decisions by highlighting relevant parts of the knowledge graph.

Evaluation

The KG-SMILE framework was evaluated on two datasets: WebNLG and SemEval 2017 Task 8. The evaluation metrics included fidelity, faithfulness, consistency, stability, and accuracy. Fidelity measures how well the explanations match human understanding while faithfulness evaluates whether the explanations accurately reflect model behavior. Consistency measures if similar inputs result in similar explanations, and stability assesses if small changes in inputs lead to significant changes in explanations. Accuracy is also measured to ensure that incorporating external knowledge does not compromise model performance. The results showed that KG-SMILE produces stable explanations that are aligned with human understanding. This indicates its ability to strike a balance between model effectiveness and interpretability, ultimately fostering greater transparency and trust in machine learning technologies.

Conclusion

In conclusion,
the paper highlights the importance of explainability in AI models for ensuring transparency, fairness, and reliability. It also showcases innovative approaches like KG-SMILE that aim to address existing challenges in machine learning interpretability by incorporating external knowledge from structured databases or knowledge graphs. Future research could explore ways to improve data quality for RAG models as they heavily rely on external knowledge sources for their performance. Additionally,
efforts could be made towards developing more robust evaluation metrics for XAI methods like fidelity and faithfulness.
Overall,
KG-SMILE shows promising results in enhancing transparency without compromising model performance.
This has implications for various domains where precision is crucial such as healthcare or finance.
By providing interpretable outputs,
KG-SMILE can increase trust in AI technologies and pave the way for their responsible use in sensitive domains.

Created on 06 Sep. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

62.4%

Knowledge Graph Based Agent for Complex, Knowledge-Intensive QA in Medicine

cs.AI

61.3%

GLaM: Fine-Tuning Large Language Models for Domain Knowledge Graph Alignment …

cs.AI

61.1%

Enhancing Q&A with Domain-Specific Fine-Tuning and Iterative Reasoning: A Com…

cs.AI

60.4%

Fact-Tree Reasoning for N-ary Question Answering over Knowledge Graphs

cs.AI

60.4%

Graph-enhanced Large Language Models in Asynchronous Plan Reasoning

cs.AI

60.3%

Axiomatic Preference Modeling for Longform Question Answering

cs.AI

60.0%

Towards Loosely-Coupling Knowledge Graph Embeddings and Ontology-based Reason…

cs.AI

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.