A Combined Weighting for the Feature-Based Method on Topological Parameters in Semantic Taxonomy Using Social Media-Reference-Cited by-同舟云学术

A Combined Weighting for the Feature-Based Method on Topological Parameters in Semantic Taxonomy Using Social Media

Published:2020-02-01 Issue:1 Volume:769 Page:012002
ISSN:1757-8981
Container-title:IOP Conference Series: Materials Science and Engineering
language:
Short-container-title:IOP Conf. Ser.: Mater. Sci. Eng.

Author:

Muttaleb Hasan Ali,Mohd Noor Noorhuzaimi,Hussein Rassem Taha,Muttaleb Hasan Ahmed,Hammood Waleed A.

Abstract

Abstract The textual analysis has become most important task due to the rapid increase of the number of texts that have been continuously generated in several forms such as posts and chats in social media, emails, articles, and news. The management of these texts requires efficient and effective methods, which can handle the linguistic issues that come from the complexity of natural languages. In recent years, the exploitation of semantic features from the lexical sources has been widely investigated by researchers to deal with the issues of “synonymy and ambiguity” in the tasks involved in the Social Media like document clustering. The main challenges of exploiting the lexical knowledge sources such as 1WordNet 3.1 in these tasks are how to integrate the various types of semantic relations for capturing additional semantic evidence, and how to settle the high dimensionality of current semantic representing approaches. In this paper, the proposed weighting of features for a new semantic feature-based method as which combined four things as which is “Synonymy, Hypernym, non-taxonomy, and Glosses”. Therefore, this research proposes a new knowledge-based semantic representation approach for text mining, which can handle the linguistic issues as well as the high dimensionality issue. Thus, the proposed approach consists of two main components: a feature-based method for incorporating the relations in the lexical sources, and a topic-based reduction method to overcome the high dimensionality issue. The proposed method approach will evaluated using WordNet 3.1 in the text clustering and text classification.

Publisher

IOP Publishing

Subject

General Medicine

Link

https://iopscience.iop.org/article/10.1088/1757-899X/769/1/012002/pdf

Reference29 articles.

1. Semantic frame identification with distributed word representations;Hermann;Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers),2014

2. Reducing explicit semantic representation vectors using Latent Dirichlet Allocation;Saif;Knowledge-Based Systems,2016

3. Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness;Aouicha,2016

4. Derivation of “is a” taxonomy from Wikipedia Category Graph;Aouicha;Engineering Applications of Artificial Intelligence,2016

5. Restoring the missing features of the corrupted speech using linear interpolation methods;Rassem;AIP Conference Proceedings,2017

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Determinants Influencing E- Payment Adoption Amidst COVID-19: A Conceptual Framework;2023 7th International Conference on New Media Studies (CONMEDIA);2023-12-06