Similarity Calculation via Passage-Level Event Connection Graph

Author:

Liu MingORCID,Chen Lei,Zheng Zihao

Abstract

Recently, many information processing applications appear on the web on the demand of user requirement. Since text is one of the most popular data formats across the web, how to measure text similarity becomes the key challenge to many web applications. Web text is often used to record events, especially for news. One text often mentions multiple events, while only the core event decides its main topic. This core event should take the important position when measuring text similarity. For this reason, this paper constructs a passage-level event connection graph to model the relations among events mentioned in one text. This graph is composed of many subgraphs formed by triggers and arguments extracted sentence by sentence. The subgraphs are connected via the overlapping arguments. In term of centrality measurement, the core event can be revealed from the graph and utilized to measure text similarity. Moreover, two improvements based on vector tunning are provided to better model the relations among events. One is to find the triggers which are semantically similar. By linking them in the event connection graph, the graph can cover the relations among events more comprehensively. The other is to apply graph embedding to integrate the global information carried by the entire event connection graph into the core event to let text similarity be partially guided by the full-text content. As shown by experimental results, after measuring text similarity from a passage-level event representation perspective, our calculation acquires superior results than unsupervised methods and even comparable results with some supervised neuron-based methods. In addition, our calculation is unsupervised and can be applied in many domains free from the preparation of training data.

Funder

The research in this article is supported by the National Key Research and Development Project

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference56 articles.

1. Ji, H., and Grishman, R. Refining event extraction through cross-document inference. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics.

2. Baker, C., Fillmore, C., and Lowe, J. The berkeley framenet project. Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics.

3. Jacksi, K., Ibrahim, R., Zeebaree, S., Zebari, R., and Sadeeq, M. Clustering documents based on semantic similarity using HAC and K-mean algorithms. Proceedings of the 2020 International Conference on Advanced Science and Engineering.

4. Huang, X., Qi, J., Sun, Y., and Zhang, R. Mala: Cross-domain dialogue generation with action learning. Proceedings of the 34th AAAI Conference on Artificial Intelligence.

5. Kieu, B., Unanue, I., Pham, S., Phan, H., and Piccardi, M. Learning neural textual representations for citation recommendation. Proceedings of the 25th International Conference on Pattern Recognition.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. English Speech Scoring System Based on Computer Neural Network;International Journal of Education and Humanities;2022-10-27

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3