Short Text Semantic Similarity Measurement Approach Based on Semantic Network-Reference-Cited by-同舟云学术

Short Text Semantic Similarity Measurement Approach Based on Semantic Network

Published:2022-12-05 Issue:6(Suppl.) Volume:19 Page:1581
ISSN:2411-7986
Container-title:Baghdad Science Journal
language:
Short-container-title:Baghdad Sci.J

Author:

Hameed Naamah Hussien^ORCID,Alimi Adel M.,Sadiq Ahmed T.^ORCID

Abstract

Estimating the semantic similarity between short texts plays an increasingly prominent role in many fields related to text mining and natural language processing applications, especially with the large increase in the volume of textual data that is produced daily. Traditional approaches for calculating the degree of similarity between two texts, based on the words they share, do not perform well with short texts because two similar texts may be written in different terms by employing synonyms. As a result, short texts should be semantically compared. In this paper, a semantic similarity measurement method between texts is presented which combines knowledge-based and corpus-based semantic information to build a semantic network that represents the relationship between the compared texts and extracts the degree of similarity between them. Representing a text as a semantic network is the best knowledge representation that comes close to the human mind's understanding of the texts, where the semantic network reflects the sentence's semantic, syntactical, and structural knowledge. The network representation is a visual representation of knowledge objects, their qualities, and their relationships. WordNet lexical database has been used as a knowledge-based source while the GloVe pre-trained word embedding vectors have been used as a corpus-based source. The proposed method was tested using three different datasets, DSCS, SICK, and MOHLER datasets. A good result has been obtained in terms of RMSE and MAE.

Publisher

College of Science for Women

Subject

General Physics and Astronomy,Agricultural and Biological Sciences (miscellaneous),General Biochemistry, Genetics and Molecular Biology,General Mathematics,General Chemistry,General Computer Science

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Emotion Classification Based on Multi-modal Fusion;Baghdad Science Journal;2024-02-25