A Semantic Framework for Extracting Taxonomic Relations from Text Corpus-Reference-Cited by-同舟云学术

A Semantic Framework for Extracting Taxonomic Relations from Text Corpus

Published:2019-05-01 Issue:3 Volume:17 Page:325-337
ISSN:2309-4524
Container-title:The International Arab Journal of Information Technology
language:en
Short-container-title:IAJIT

Author:

Hong Doan Phuoc Thi¹,Arch-int Ngamnij¹,Arch-int Somjit¹

Affiliation:

1. Department of Computer Science, Khon Kaen University, Thailand

Abstract

Nowadays, ontologies have been exploited in many current applications due to the abilities in representing knowledge and inferring new knowledge. However, the manual construction of ontologies is tedious and time-consuming. Therefore, the automated ontology construction from text has been investigated. The extraction of taxonomic relations between concepts is a crucial step in constructing domain ontologies. To obtain taxonomic relations from a text corpus, especially when the data is deficient, the approach of using the web as a source of collective knowledge (a.k.a web-based approach) is usually applied. The important challenge of this approach is how to collect relevant knowledge from a large amount of web pages. To overcome this issue, we propose a framework that combines Word Sense Disambiguation (WSD) and web approach to extract taxonomic relations from a domain-text corpus. This framework consists of two main stages: concept extraction and taxonomic-relation extraction. Concepts acquired from the concept-extraction stage are disambiguated through WSD module and passed to stage of extraction taxonomic relations afterward. To evaluate the efficiency of the proposed framework, we conduct experiments on datasets about two domains of tourism and sport. The obtained results show that the proposed method is efficient in corpora which are insufficient or have no training data. Besides, the proposed method outperforms the state of the art method in corpora having high WSD results.

Publisher

Zarqa University

Subject

General Computer Science

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Systematic Approach for Measuring Semantic Relatedness between Ontologies;Electronics;2023-03-15

2. Review on knowledge extraction from text and scope in agriculture domain;Artificial Intelligence Review;2022-09-29

3. Refined Search and Attribute-Based Encryption Over the Cloud Data;Proceedings of International Conference on Deep Learning, Computing and Intelligence;2022

4. Enriching Domain Concepts with Qualitative Attributes (A Text Mining based Approach);The International Arab Journal of Information Technology;2020-11-01

5. A Novel Statistic-Based Corpus Machine Processing Approach to Refine a Big Textual Data: An ESP Case of COVID-19 News Reports;Applied Sciences;2020-08-09