Knowledge-based Data Processing for Multilingual Natural Language Analysis

Author:

Jain Deepak Kumar123ORCID,Eyre Yamila García-Martínez4,Kumar Akshi5ORCID,Gupta Brij B.6ORCID,Kotecha Ketan78ORCID

Affiliation:

1. Key Laboratory of Intelligent Control and Optimization for Industrial Equipment of Ministry of Education, Dalian University of Technology, Dalian, 116024, China.

2. School of Artificial Intelligence, Dalian, China.

3. Symbiosis Institute of Technology, Symbiosis International University, Pune, India

4. Universidad Internacional de La Rioja (UNIR), Avda. de la Paz, Logroño (La Rioja). España

5. Department of Computing and Mathematics, Manchester Metropolitan University, Manchester, United Kingdom

6. Department of Computer Engineering, National Institute of Technology, Kurukshetra, India

7. Symbiosis Centre for Applied Artificial Intelligence, Symbiosis International University, Pune, India

8. School of Mathematical Sciences, Sunway University, Malaysia

Abstract

Natural Language Processing (NLP) aids the empowerment of intelligent machines by enhancing human language understanding for linguistic-based human-computer communication. Recent developments in processing power, as well as the availability of large volumes of linguistic data, have enhanced the demand for data-driven methods for automatic semantic analysis. This paper proposes multilingual data processing using feature extraction with classification using deep learning architectures. Here, the input text data has been collected based on various languages and processed to remove missing values and null values. The processed data has been extracted using Histogram Equalization based Global Local Entropy (HEGLE) and classified using Kernel-based Radial basis Function (Ker_Rad_BF). These architectures could be utilized to process natural language. We present solutions to the multilingual sentiment analysis issue in this research article by implementing algorithms, and we compare precision factors to discover the optimum option for multilingual sentiment analysis. For the HASOC dataset, the proposed HEGLE_ Ker_Rad_BF achieved an accuracy of 98%, a precision of 97%, a recall of 90.5%, an f-1 score of 85%, RMSE of 55.6% and a loss curve analysis attained 44%. For the TRAC dataset, the accuracy of 98%, the precision attained is 97%, the Recall is 91%, the F-1 score is 87%, and the RMSE of the proposed neural network is 55%.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference25 articles.

1. Feature Extraction and Analysis of Natural Language Processing for Deep Learning English Language

2. Sentiment analysis on twitter;Kumar A.;International Journal of Computer Science Issues (IJCSI),2012

3. Young , T. , Hazarika , D. , Poria , S. , & Cambria , E. ( 2018 ). Recent trends in deep learning based natural language processing. ieee Computational intelligenCe magazine, 13(3), 55-75 . Young, T., Hazarika, D., Poria, S., & Cambria, E. (2018). Recent trends in deep learning based natural language processing. ieee Computational intelligenCe magazine, 13(3), 55-75.

4. Sentiment Analysis Using XLM-R Transformer and Zero-shot Transfer Learning on Resource-poor Indian Language

5. Nivetha , S. ( 2020 , February). A survey on speech feature extraction and classification techniques . In 2020 international conference on inventive computation technologies (ICICT) (pp. 48-53) . IEEE. Nivetha, S. (2020, February). A survey on speech feature extraction and classification techniques. In 2020 international conference on inventive computation technologies (ICICT) (pp. 48-53). IEEE.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Publishing, linking and translating news in multilingual communities: a mirror of cultural differences?;Proceedings of the 35th ACM Conference on Hypertext and Social Media;2024-09-10

2. A novel socio-pragmatic framework for sentiment analysis in Dravidian–English code-switched texts;Knowledge-Based Systems;2024-09

3. Foundations of AI Ethics;Advances in Computational Intelligence and Robotics;2024-08-30

4. Enhancing Business Intelligence Through AI-Driven Integration of Sustainability Metrics via ESG Factors;Advances in Finance, Accounting, and Economics;2024-04-19

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3