Author:
Polyakov Evrenii,Voskov Leonid,Abramov Pavel,Polyakov Sergey
Abstract
Introduction: Sentiment analysis is a complex problem whose solution essentially depends on the context, field of study andamount of text data. Analysis of publications shows that the authors often do not use the full range of possible data transformationsand their combinations. Only a part of the transformations is used, limiting the ways to develop high-quality classification models.Purpose: Developing and exploring a generalized approach to building a model, which consists in sequentially passing throughthe stages of exploratory data analysis, obtaining a basic solution, vectorization, preprocessing, hyperparameter optimization, andmodeling. Results: Comparative experiments conducted using a generalized approach for classical machine learning and deeplearning algorithms in order to solve the problem of sentiment analysis of short text messages in natural language processinghave demonstrated that the classification quality grows from one stage to another. For classical algorithms, such an increasein quality was insignificant, but for deep learning, it was 8% on average at each stage. Additional studies have shown that theuse of automatic machine learning which uses classical classification algorithms is comparable in quality to manual modeldevelopment; however, it takes much longer. The use of transfer learning has a small but positive effect on the classificationquality. Practical relevance: The proposed sequential approach can significantly improve the quality of models under developmentin natural language processing problems.
Publisher
State University of Aerospace Instrumentation (SUAI)
Subject
Control and Optimization,Computer Science Applications,Human-Computer Interaction,Information Systems,Control and Systems Engineering,Software
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. AI Incorporating NLP – to Boldly Go, Where No Algorithms Have Gone Before;Lecture Notes in Networks and Systems;2024
2. REVIEW OF METHODS FOR DETERMINING THE TONATION OF TEXTS IN NATURAL LANGUAGES;Bulletin of Shakarim University. Technical Sciences;2023-03-31
3. Knowledge Retrieval and Relation Mining from Tolkien’s History of Middle Earth;2022 IEEE 22nd International Symposium on Computational Intelligence and Informatics and 8th IEEE International Conference on Recent Achievements in Mechatronics, Automation, Computer Science and Robotics (CINTI-MACRo);2022-11-21
4. NLP based Model for Classification of Complaints: Autonomous and Intelligent System;2022 2nd International Conference on Digital Futures and Transformative Technologies (ICoDT2);2022-05-24
5. Egyptian Student Sentiment Analysis Using Word2vec During the Coronavirus (Covid-19) Pandemic;Advances in Intelligent Systems and Computing;2020-09-20