Integrating Relative Efficiency Models with Machine Learning Algorithms for Performance Prediction-Reference-Cited by-同舟云学术

Integrating Relative Efficiency Models with Machine Learning Algorithms for Performance Prediction

Published:2024-04 Issue:2 Volume:14 Page:
ISSN:2158-2440
Container-title:Sage Open
language:en
Short-container-title:Sage Open

Author:

Perroni Marcos Gonçalves¹,Veiga Claudimar Pereira da²^ORCID,Forteski Elaine³,Marconatto Diego Antonio Bittencourt²,da Silva Wesley Vieira⁴,Senff Carlos Otávio¹,Su Zhaohui⁵

Affiliation:

1. Pontifical Catholic University of Parana, Curitiba, PR, Brazil

2. School of Business, Fundação Dom Cabral—FDC, Nova Lima, MG, Brazil

3. Contestado University (UnC), Pres., Mafra, Santa Catarina, Brazil

4. Federal University of Alagoas, Maceió, Brazil

5. Institute for Human Rights, Southeast University, Nanjing, China

Abstract

Predicting operational performance enables organizations to develop operational effectiveness goals considering different combinations of resources. Measuring performance is consolidated with advances in relative efficiency analysis techniques, including data envelopment analysis (DEA) and stochastic frontier analysis (SFA), albeit these methods lack predictive capability. This paper proposes an approach for performance prediction by integrating relative efficiency measurement models with machine learning algorithms. Data analyses were conducted using data provided by the energy assessment project offered to small and medium-sized manufacturing companies in the United States ( n 7,548) using sales as the output, with the inputs being the number of employees, hours of operation, electricity, natural gas, cost of electricity, and cost of natural gas. Performance was estimated differently, employing parametric (SFA) and non-parametric (DEA) methods. The prediction benchmarking process occurred by adopting machine learning algorithms: regression (LM), support vector machine (SVM), K-nearest neighbor (KNN), linear discriminant analysis (LDA), random forest (RF), and decision tree (DT). The findings showed that it is possible to identify the best prediction algorithm associated with a performance model. However, the performance prediction may differ if different strategies for measuring performance or machine learning model configurations are used. In addition, SFA-LOG and SVM had the best performance for regression, and DEA-VRS/IRS excelled with random forest; the RF algorithm was the best fit across all performance approaches. The error rate depends on the algorithm and the performance model, and the number of classes must be reduced to obtain a higher success rate.

Funder

Conselho Nacional de Desenvolvimento Científico e Tecnológico

Publisher

SAGE Publications

Link

https://journals.sagepub.com/doi/pdf/10.1177/21582440241257800

Reference61 articles.

1. Determinants of energy efficiency investments in the US

2. Forecasting models in the manufacturing processes and operations management: Systematic literature review

3. Formulation and estimation of stochastic frontier production function models

4. Data reduction based on NN-kNN measure for NN classification and regression

5. Information programs for technology adoption: the case of energy-efficiency audits