Machine learning methods (tokenization) in marketing research

Author:

Ganebnykh E. V.1ORCID,Savelieva N. K.1ORCID,Sozinova A. A.1ORCID,Fokina O. V.1ORCID,Altsybeeva I. G.1ORCID

Affiliation:

1. Vyatka State University

Abstract

Field research is of particular interest in marketing because it often generates unique statistics. Closed-ended questions during data collection simplify data processing, but at the same time significantly limit the research subject depth. Open-ended questions provide a deeper understanding of respondents’ opinions, but processing responses in the form of natural language (qualitative data) is difficult and time-consuming, as it is usually done manually. Modern machine learning techniques, particularly tokenization, can be used to automate such data processing. The purpose of the study is to test this method application to data processing of the field research “Monitoring of the competition state and development in the commodity markets of the Novosibirsk Region”. The following tasks have been set and solved: primary information has been collected and prepared for processing, and token groups identified and formed. Based on the groups, the respondents’ answers have been further combined into relatively homogeneous clusters including similar answers to open-ended questions. Subsequent quality control of the conducted research has been carried out on the basis of Precision, Recall and F-measure metrics, which showed an acceptable level of data processing quality. Information collection has been realized through sociological surveys (questionnaire distribution) and CAWI surveys and included open-ended questions. The study reveals that even extremely insignificant references were not missed. The obtained data allowed us to conclude that it is necessary to form annotated databases and token libraries for the marketing research purposes.

Publisher

State University of Management

Reference15 articles.

1. Andreev А.V. Artificial intelligence and its role in big data processing. Smart Digital Economy. 2023;1(3):65–69. (In Russian).

2. Sarker I.H. Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Computer Science. 2021;2:160.

3. Shlykov S.V. Application of machine learning methods to automate processes in the oil and gas industry. Transport and storage of oil products and hydrocarbons. 2023;2:46–53. (In Russian). https://doi.org/10.24412/0131-4270-2023-2-46-53

4. Bobkov S.P., Suvorov S.V., Orlov A.I., Pivnev E.A. Using machine learning methods to assess risks when implementing a new credit product. News of Higher Educational Institutions. The Series “Economics, Finance and Production Management”. 2020;4(46):59–63. (In Russian). https://doi.org/10.6060/ivecofin.2020464.509

5. Osipova Т.А., Zaytsev K.S., Bifert V.O. The use of ML algorithms in the task of detecting fraud when using plastic cards. International Journal of Open Information Technologies. 2021;8:23–29. (In Russian).

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3