Research on the construction and application of retrieval enhanced generation (RAG) model based on knowledge graph

基于知识图谱的检索增强生成(RAG)模型构建与应用研究

阅读:1

Abstract

Generative pre-trained language models have demonstrated strong capabilities in natural language processing tasks, but they still suffer from "fact hallucination" and weak knowledge timeliness in open-domain question answering and text generation. To improve the accuracy and knowledge consistency of generated content, this paper proposes a Knowledge Graph-based Retrieval Enhanced Generation Model (KG-RAG), which integrates structured knowledge graphs into traditional RAG architectures to enhance the model's ability to understand semantic and inferential relationships. The model designs a dual-channel retrieval mechanism: on one hand, it uses Dense Passage Retrieval (DPR) for vectorized retrieval of unstructured texts; on the other hand, it employs graph neural networks (GNN) to structurally retrieve semantic paths within the knowledge graph, and through path attention mechanisms, it filters out the most relevant entity relationship chains to guide the knowledge injection module. Experimental results on the Natural Questions and PubMedQA datasets show that KG-RAG outperforms the original RAG model across multiple evaluation metrics. On the Natural Questions dataset, the ROUGE-L score of the KG-RAG model improves from 41.2 to 46.9, the BLEU score rises from 31.5 to 38.7, and the FactScore increases by 13.6%, significantly enhancing the knowledge consistency of the generated text. In the PubMedQA task, KG-RAG achieves an accuracy rate of 81% in medical question answering. 3%, an improvement of 6.8% points over RAG, demonstrates its advantage in knowledge reasoning within specialized fields. Furthermore, case studies show that KG-RAG can effectively integrate entity paths from knowledge graphs to generate more logical and factual answers in complex question-answering tasks. This method has broad application prospects in intelligent question-answering systems, multi-turn conversations, and educational Q&A scenarios. Future research will consider introducing dynamic knowledge update mechanisms and multimodal graph information to further enhance the capabilities and adaptability of KG-RAG in real-world tasks.

特别声明

1、本页面内容包含部分的内容是基于公开信息的合理引用;引用内容仅为补充信息,不代表本站立场。

2、若认为本页面引用内容涉及侵权,请及时与本站联系,我们将第一时间处理。

3、其他媒体/个人如需使用本页面原创内容,需注明“来源:[生知库]”并获得授权;使用引用内容的,需自行联系原作者获得许可。

4、投稿及合作请联系:info@biocloudy.com。