Data and Knowledge Organization for Natural Language Processing: Searching and Identifying Better Arrangements of Texts Based on Multimodal Information Architecture-Reference-Cited by-同舟云学术

Data and Knowledge Organization for Natural Language Processing: Searching and Identifying Better Arrangements of Texts Based on Multimodal Information Architecture

Published:2024-01 Issue:1 Volume:14 Page:
ISSN:2158-2440
Container-title:Sage Open
language:en
Short-container-title:Sage Open

Author:

Kuroki Júnior George Hideyuki¹^ORCID,Gottschalg-Duque Cláudio²

Affiliation:

1. University of Brasília, Distrito Federal, Brazil

2. Artificial Intelligence Excellence Center - CEIA/UFG

Abstract

Processing texts of multiple knowledge areas is a hard task. This article presents an Information Science contribution to natural language processing based on artificial neural networks through data arrangement. An extended concept of Information architecture was used, aggregating a multimodal view of organizing data. The Multimodal Information Architecture definition served as foundations for a five-step procedure to design, analyze and transform data used for artificial neural networks training and learning methods, complementing gaps identified by authors focused on Computer Science implementations. The proposal was validated with three datasets formed by texts coming from 16 knowledge areas. Results obtained through the usage of pre-processed data and raw data where compared. In each of the three datasets, the method identified arrangements which led to better and worst results, separating which corpus samples are more susceptible for knowledge extraction.

Publisher

SAGE Publications

Link

http://journals.sagepub.com/doi/pdf/10.1177/21582440231177042

Reference28 articles.

1. Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier]

2. Bahdanau D., Cho K., Bengio Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv: 1409.0473.

3. The theory of dynamic programming

4. Modalities and Multimodalities

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Towards the use of Blockchain Technology in SEI, a Brazilian Electronic Document and Process Management Tool;Anais do II Colóquio em Blockchain e Web Descentralizada (CBlockchain 2024);2024-07-21