RegRL-KG: Learning an L1 regularized reinforcement agent for keyphrase generation

Author:

Yao Yu,Yang Peng,Zhao Guangzhen,Leng Juncheng

Abstract

Keyphrase generation (KG) aims at condensing the content from the source text to the target concise phrases. Though many KG algorithms have been proposed, most of them are tailored into deep learning settings with various specially designed strategies and may fail in solving the bias exposure problem. Reinforcement Learning (RL), a class of control optimization techniques, are well suited to compensate for some of the limitations of deep learning methods. Nevertheless, RL methods typically suffer from four core difficulties in keyphrase generation: environment interaction and effective exploration, complex action control, reward design, and task-specific obstacle. To tackle this difficult but significant task, we present RegRL-KG, including actor-critic based-reinforcement learning control and L1 policy regularization under the first principle of minimizing the maximum likelihood estimation (MLE) criterion by a sequence-to-sequence (Seq2Seq) deep learnining model, for efficient keyphrase generation. The agent utilizes an actor-critic network to control the generated probability distribution and employs L1 policy regularization to solve the bias exposure problem. Extensive experiments show that our method brings improvement in terms of the evaluation metrics on five scientific article benchmark datasets.

Publisher

IOS Press

Subject

Artificial Intelligence,Computer Vision and Pattern Recognition,Theoretical Computer Science

Reference32 articles.

1. Neuron like adaptive elements that can solve difficult learning control problems;Barto;IEEE Transactions on Systems, Man, and Cybernetics,1983

2. F. Boudin, Y. Gallina and A. Aizawa, Keyphrase generation for scientific document retrieval, in: Proceedings of the Annual Meeting of the Association for Computational Linguistics, ACL 20’, Online, 2016, pp. 1118–1126.

3. K. Bousmalis, G. Trigeorgis, N. Silberman, D. Krishnan and D. Erhan, Domain Separation Networks, in: Proceedings of the Advances in Neural Information Processing Systems, NIPS 16’, Barcelona, Spain, 2016, pp. 343–351.

4. H.P. Chan, W. Chen, L. Wang and I. King, Neural keyphrase generation via reinforcement learning with adaptive rewards, in: Proceedings of the Annual Meeting of the Association for Computational Linguistics, ACL 19’, Florence, Italy, 2019, pp. 2163–2174.

5. W. Chen, H.P. Chan, P.J. Li, L.D. Bing and I. King, An integrated approach for keyphrase generation via exploring the power of retrieval and extraction, in: Proceedings of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 19’, Minneapolis, Minnesota, 2019, pp. 2846–2856.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3