Memory-Enhanced Knowledge Reasoning with Reinforcement Learning-Reference-Cited by-同舟云学术

Memory-Enhanced Knowledge Reasoning with Reinforcement Learning

Published:2024-04-08 Issue:7 Volume:14 Page:3133
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Guo Jinhui¹,Zhang Xiaoli¹,Liang Kun¹^ORCID,Zhang Guoqiang¹

Affiliation:

1. The College of Artificial Intelligence, Tianjin University of Science and Technology, Tianjin 300457, China

Abstract

In recent years, the emergence of large-scale language models, such as ChatGPT, has presented significant challenges to research on knowledge graphs and knowledge-based reasoning. As a result, the direction of research on knowledge reasoning has shifted. Two critical issues in knowledge reasoning research are the algorithm of the model itself and the selection of paths. Most studies utilize LSTM as the path encoder and memory module. However, when processing long sequence data, LSTM models may encounter the problem of long-term dependencies, where memory units of the model may decay gradually with an increase in time steps, leading to forgetting earlier input information. This can result in a decline in the performance of the LSTM model in long sequence data. Additionally, as the data volume and network depth increase, there is a risk of gradient disappearance. This study improved and optimized the LSTM model to effectively address the problems of gradient explosion and gradient disappearance. An attention layer was employed to alleviate the issue of long-term dependencies, and ConvR embedding was used to guide path selection and action pruning in the reinforcement learning inference model. The overall model achieved excellent reasoning results.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/7/3133/pdf

Reference37 articles.

1. Domain knowledge graph-based research progress of knowledge representation;Lin;Neural Comput. Appl.,2021

2. WordNet: A lexical database for English;Miller;Commun. ACM,1995

3. Carlson, A., Betteridge, J., Kisiel, B., Settles, B., Hruschka, E., and Mitchell, T. (2010, January 11–15). Toward an architecture for never-ending language learning. Proceedings of the 24th AAAI Conference on Artificial Intelligence, Atlanta, GA, USA.

4. Bollacker, K., Evans, C., Paritosh, P., Sturge, T., and Taylor, J. (2008). Freebase: A Collaboratively Created Graph Database for Structuring Human Knowledge, ACM.

5. A review: Knowledge reasoning over knowledge graph;Chen;Expert Syst. Appl.,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Cruise Speed Model Based on Self-Attention Mechanism for Autonomous Underwater Vehicle Navigation;Remote Sensing;2024-07-14