Multicriteria Machine Learning Model Assessment—Residuum Analysis Review-Reference-Cited by-同舟云学术

Multicriteria Machine Learning Model Assessment—Residuum Analysis Review

Published:2024-02-20 Issue:5 Volume:13 Page:810
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Kaniuka Jan¹^ORCID,Ostrysz Jakub¹^ORCID,Groszyk Maciej¹^ORCID,Bieniek Krzysztof¹^ORCID,Cyperski Szymon²^ORCID,Domański Paweł D.¹²^ORCID

Affiliation:

1. Institute of Control and Computation Engineering, Faculty of Electronics and Information Technology, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland

2. Control System Software Sp. z o.o., ul. Rzemieślnicza 7, 81-855 Sopot, Poland

Abstract

The use of machine learning (ML) and its applications is one of the leading research areas nowadays. Neural networks have recently gained enormous popularity and many works in various fields use them in the hope of improving previous results. The application of the artificial intelligence (AI) methods and the rationale for this decision is one issue, but the assessment of such a model is a completely different matter. People mostly use mean square error or less often mean absolute error in the absolute or percentage versions. One should remember that an error does not equal an error and a single value does not provide enough knowledge about the causes of some behavior. Proper interpretation of the results is crucial. It leads to further model improvement. It might be challenging, but allows us to obtain better and more robust solutions, which ultimately solve real-life problems. The ML model assessment is the multicriteria task. A single measure delivers only a fraction of the picture. This paper aims at filling that research gap. Commonly used integral measures are compared with alternative measures like factors of the Gaussian and non-Gaussian statistics, robust statistical estimators, tail index and the fractional order. The proposed methodology delivers new single-criteria indexes or the multicriteria approach, which extend the statistical concept of the moment ratio diagram (MRD) into the index ratio diagram (IRD). The proposed approach is validated using real data from the Full Truck Load cost estimation example. It compares 35 different ML regression algorithms applied to that task. The analysis gives an insight into the properties of the selected methods, enables their comparison and homogeneity analysis and ultimately leads towards constructive suggestions for their eventual proper use. The paper proposes new indexes and concludes that correct selection of the residuum analysis methodology makes the assessment and the ML regression credible.

Funder

Polish National Centre for Research and Development

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/5/810/pdf

Reference77 articles.

1. Morrison, G., Emil, E., Canipe, H., and Burnham, A. (2020). Guide to Calculating Ownership and Operating Costs of Department of Transportation Vehicles and Equipment: An Accounting Perspective, The National Academies Press.

2. Vu, Q.H., Cen, L., Ruta, D., and Liu, M. (2022, January 4–7). Key Factors to Consider when Predicting the Costs of Forwarding Contracts. Proceedings of the 2022 17th Conference on Computer Science and Intelligence Systems (FedCSIS), Sofia, Bulgaria.

3. Pricing Dynamics in the Truckload Sector: The Moderating Role of the Electronic Logging Device Mandate;Miller;J. Bus. Logist.,2021

4. Acocella, A., Caplice, C., and Sheffi, Y. (2022). The end of ’set it and forget it’ pricing? Opportunities for market-based freight contracts. arXiv.

5. Hernes, M., Wojtkiewicz, K., and Szczerbicki, E. (2020). Advances in Computational Collective Intelligence, Springer.