Affiliation:
1. Artificial Intelligence Technology Scientific and Education Center, Bauman Moscow State Technical University, Moscow 105005, Russia
Abstract
This study proposes a method for classifying economic activity descriptors to match Nomenclature of Economic Activities (NACE) codes, employing a blend of machine learning techniques and expert evaluation. By leveraging natural language processing (NLP) methods to vectorize activity descriptors and utilizing genetic algorithm (GA) optimization to fine-tune hyperparameters in multi-class classifiers like Naive Bayes, Decision Trees, Random Forests, and Multilayer Perceptrons, our aim is to boost the accuracy and reliability of an economic classification system. This system faces challenges due to the absence of precise target labels in the dataset. Hence, it is essential to initially check the accuracy of utilized methods based on expert evaluations using a small dataset before generalizing to a larger one.
Reference76 articles.
1. Schnabl, E., and Zenker, A. (2013). Statistical Classification of Knowledge-Intensive Business Services (KIBS) with NACE Rev. 2, Fraunhofer ISI.
2. Nijhowne, S. (1995). Defining and classifying statistical units. Business Survey Methods, Wiley Online Library.
3. Barrier, E.B. (2017). The concept of sustainable economic development. The Economics of Sustainability, Routledge.
4. Towards correct cloud resource allocation in business processes;Graiet;IEEE Trans. Serv. Comput.,2016
5. Ievdokymov, V., Ostapchuk, T., Lehenchuk, S., Grytsyshen, D., and Marchuk, G. (2020). Analysis of the Impact of Intangible Assets on the Companies’ Market Value, Natsional’nyi Hirnychyi Universytet. Naukovyi Visnyk.