Business environmental analysis for textual data using data mining and sentence-level classification-Reference-Cited by-同舟云学术

Business environmental analysis for textual data using data mining and sentence-level classification

Published:2019-02-04 Issue:1 Volume:119 Page:69-88
ISSN:0263-5577
Container-title:Industrial Management & Data Systems
language:en
Short-container-title:IMDS

Author:

Kim Yoon-Sung,Rim Hae-Chang,Lee Do-Gil

Abstract

Purpose The purpose of this paper is to propose a methodology to analyze a large amount of unstructured textual data into categories of business environmental analysis frameworks. Design/methodology/approach This paper uses machine learning to classify a vast amount of unstructured textual data by category of business environmental analysis framework. Generally, it is difficult to produce high quality and massive training data for machine-learning-based system in terms of cost. Semi-supervised learning techniques are used to improve the classification performance. Additionally, the lack of feature problem that traditional classification systems have suffered is resolved by applying semantic features by utilizing word embedding, a new technique in text mining. Findings The proposed methodology can be used for various business environmental analyses and the system is fully automated in both the training and classifying phases. Semi-supervised learning can solve the problems with insufficient training data. The proposed semantic features can be helpful for improving traditional classification systems. Research limitations/implications This paper focuses on classifying sentences that contain the information of business environmental analysis in large amount of documents. However, the proposed methodology has a limitation on the advanced analyses which can directly help managers establish strategies, since it does not summarize the environmental variables that are implied in the classified sentences. Using the advanced summarization and recommendation techniques could extract the environmental variables among the sentences, and they can assist managers to establish effective strategies. Originality/value The feature selection technique developed in this paper has not been used in traditional systems for business and industry, so that the whole process can be fully automated. It also demonstrates practicality so that it can be applied to various business environmental analysis frameworks. In addition, the system is more economical than traditional systems because of semi-supervised learning, and can resolve the lack of feature problem that traditional systems suffer. This work is valuable for analyzing environmental factors and establishing strategies for companies.

Publisher

Emerald

Subject

Industrial and Manufacturing Engineering,Strategy and Management,Computer Science Applications,Industrial relations,Management Information Systems

Reference59 articles.

1. Mining association rules between sets of items in large databases;ACM SIGMOD Record,1993

2. Extracting failure time data from industrial maintenance records using text mining;Advanced Engineering Informatics,2016

3. Robust sentiment detection on twitter from biased and noisy data,2010

4. A neural probabilistic language model;Journal of Machine Learning Research,2003

5. Classifying sentiment in microblogs: is brevity an advantage?,2010

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Incorporating Multiple Textual Factors into Unbalanced Financial Distress Prediction: A Feature Selection Methods and Ensemble Classifiers Combined Approach;International Journal of Computational Intelligence Systems;2023-10-04

2. Support for decision-making in checking the level of quality of student research works based on automated text analysis ( Asistencia para la toma de decisiones en la evaluación de la calidad de las investigaciones de los estudiantes basada en el análisis automático de textos );Culture and Education;2023-10-02

3. A deep learning model for online doctor rating prediction;Journal of Forecasting;2023-02-14

4. Application of text mining in identifying the factors of supply chain financing risk management;Industrial Management & Data Systems;2020-11-10

5. Analysis of Smartphone Users Movement Using Data Mining Methods and SWOT Analysis in East Surabaya Areas;Journal of Physics: Conference Series;2020-07