Affiliation:
1. Ukrainian State University of Science and Technologies
Abstract
Purpose. Natural language text processing techniques are used to solve a wide range of tasks. One of the most difficult tasks when working with natural language texts for different languages is to find certain indicators for further determining its authorship. The problem is still relevant due to the lack of a unified tool or method for working with texts in different languages. Working with texts in Ukrainian requires taking into account its peculiarities of word and sentence construction to obtain the best result. The main purpose of this article is to analyze the existing methods of text processing, their features and effectiveness in working with texts of different languages. Methodology. Natural language text processing methods are systematized by type and format, according to the tools and approaches used. For each method, its features, effectiveness, scope, and limitations are considered. The means of system analysis were used to form the final characterization of the method, taking into account its purpose and capabilities. Findings. The study of methods has revealed the following ones used for the intellectual analysis of texts in different languages, their scope, effectiveness in working with different languages, strengths and weaknesses. This will make it possible to choose an effective toolkit for working with Ukrainian texts. It has been established that intelligent text processing is a complex task that requires an individual approach to each language to take into account its peculiarities and obtain the best result. Originality. The basis for choosing an effective method for working with Ukrainian-language texts is formed, the existing methods of intellectual text processing, their application features, capabilities and efficiency in working with texts of different languages are analyzed and systematized. Practical value. The work allowed us to identify the most promising, effective and appropriate methods of intellectual analysis of natural language texts in order to use them for processing Ukrainian-language texts in the future.
Publisher
Ukrainian State University of Science and Technologies
Reference65 articles.
1. Slavic experience of compiling a frequency dictionary of writer’s language.;Buk;Problems of slavonic studies,2011
2. Funktsionalnyy styl khudozhnoho movlennya;Voitenko;Naukovì zapiski Nacìonalʹnogo unìversitetu «Ostrozʹka akademìâ». Serìâ Fìlologìčna,2012
3. Perebyynis, V. S. (2002). Statystychni metody dlya linhvistiv: navchalnyy posibnyk. Vinnytsya: Nova knyha. (in Ukrainian)
4. A Naïve-Bayes classifier for damage detection in engineering materials;Addin;Materials & Design,2007
5. Aggarwal, C. C. (2018). Machine Learning for Text (pp. 1-6). Springer International Publishing. DOI: https://doi.org/10.1007/978-3-319-73531-3 (in English)