Multicriteria Machine Learning Model Assessment—Residuum Analysis Review
-
Published:2024-02-20
Issue:5
Volume:13
Page:810
-
ISSN:2079-9292
-
Container-title:Electronics
-
language:en
-
Short-container-title:Electronics
Author:
Kaniuka Jan1ORCID, Ostrysz Jakub1ORCID, Groszyk Maciej1ORCID, Bieniek Krzysztof1ORCID, Cyperski Szymon2ORCID, Domański Paweł D.12ORCID
Affiliation:
1. Institute of Control and Computation Engineering, Faculty of Electronics and Information Technology, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland 2. Control System Software Sp. z o.o., ul. Rzemieślnicza 7, 81-855 Sopot, Poland
Abstract
The use of machine learning (ML) and its applications is one of the leading research areas nowadays. Neural networks have recently gained enormous popularity and many works in various fields use them in the hope of improving previous results. The application of the artificial intelligence (AI) methods and the rationale for this decision is one issue, but the assessment of such a model is a completely different matter. People mostly use mean square error or less often mean absolute error in the absolute or percentage versions. One should remember that an error does not equal an error and a single value does not provide enough knowledge about the causes of some behavior. Proper interpretation of the results is crucial. It leads to further model improvement. It might be challenging, but allows us to obtain better and more robust solutions, which ultimately solve real-life problems. The ML model assessment is the multicriteria task. A single measure delivers only a fraction of the picture. This paper aims at filling that research gap. Commonly used integral measures are compared with alternative measures like factors of the Gaussian and non-Gaussian statistics, robust statistical estimators, tail index and the fractional order. The proposed methodology delivers new single-criteria indexes or the multicriteria approach, which extend the statistical concept of the moment ratio diagram (MRD) into the index ratio diagram (IRD). The proposed approach is validated using real data from the Full Truck Load cost estimation example. It compares 35 different ML regression algorithms applied to that task. The analysis gives an insight into the properties of the selected methods, enables their comparison and homogeneity analysis and ultimately leads towards constructive suggestions for their eventual proper use. The paper proposes new indexes and concludes that correct selection of the residuum analysis methodology makes the assessment and the ML regression credible.
Funder
Polish National Centre for Research and Development
Reference77 articles.
1. Morrison, G., Emil, E., Canipe, H., and Burnham, A. (2020). Guide to Calculating Ownership and Operating Costs of Department of Transportation Vehicles and Equipment: An Accounting Perspective, The National Academies Press. 2. Vu, Q.H., Cen, L., Ruta, D., and Liu, M. (2022, January 4–7). Key Factors to Consider when Predicting the Costs of Forwarding Contracts. Proceedings of the 2022 17th Conference on Computer Science and Intelligence Systems (FedCSIS), Sofia, Bulgaria. 3. Pricing Dynamics in the Truckload Sector: The Moderating Role of the Electronic Logging Device Mandate;Miller;J. Bus. Logist.,2021 4. Acocella, A., Caplice, C., and Sheffi, Y. (2022). The end of ’set it and forget it’ pricing? Opportunities for market-based freight contracts. arXiv. 5. Hernes, M., Wojtkiewicz, K., and Szczerbicki, E. (2020). Advances in Computational Collective Intelligence, Springer.
|
|