Multicriteria Machine Learning Model Assessment—Residuum Analysis Review

Author:

Kaniuka Jan1ORCID,Ostrysz Jakub1ORCID,Groszyk Maciej1ORCID,Bieniek Krzysztof1ORCID,Cyperski Szymon2ORCID,Domański Paweł D.12ORCID

Affiliation:

1. Institute of Control and Computation Engineering, Faculty of Electronics and Information Technology, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland

2. Control System Software Sp. z o.o., ul. Rzemieślnicza 7, 81-855 Sopot, Poland

Abstract

The use of machine learning (ML) and its applications is one of the leading research areas nowadays. Neural networks have recently gained enormous popularity and many works in various fields use them in the hope of improving previous results. The application of the artificial intelligence (AI) methods and the rationale for this decision is one issue, but the assessment of such a model is a completely different matter. People mostly use mean square error or less often mean absolute error in the absolute or percentage versions. One should remember that an error does not equal an error and a single value does not provide enough knowledge about the causes of some behavior. Proper interpretation of the results is crucial. It leads to further model improvement. It might be challenging, but allows us to obtain better and more robust solutions, which ultimately solve real-life problems. The ML model assessment is the multicriteria task. A single measure delivers only a fraction of the picture. This paper aims at filling that research gap. Commonly used integral measures are compared with alternative measures like factors of the Gaussian and non-Gaussian statistics, robust statistical estimators, tail index and the fractional order. The proposed methodology delivers new single-criteria indexes or the multicriteria approach, which extend the statistical concept of the moment ratio diagram (MRD) into the index ratio diagram (IRD). The proposed approach is validated using real data from the Full Truck Load cost estimation example. It compares 35 different ML regression algorithms applied to that task. The analysis gives an insight into the properties of the selected methods, enables their comparison and homogeneity analysis and ultimately leads towards constructive suggestions for their eventual proper use. The paper proposes new indexes and concludes that correct selection of the residuum analysis methodology makes the assessment and the ML regression credible.

Funder

Polish National Centre for Research and Development

Publisher

MDPI AG

Reference77 articles.

1. Morrison, G., Emil, E., Canipe, H., and Burnham, A. (2020). Guide to Calculating Ownership and Operating Costs of Department of Transportation Vehicles and Equipment: An Accounting Perspective, The National Academies Press.

2. Vu, Q.H., Cen, L., Ruta, D., and Liu, M. (2022, January 4–7). Key Factors to Consider when Predicting the Costs of Forwarding Contracts. Proceedings of the 2022 17th Conference on Computer Science and Intelligence Systems (FedCSIS), Sofia, Bulgaria.

3. Pricing Dynamics in the Truckload Sector: The Moderating Role of the Electronic Logging Device Mandate;Miller;J. Bus. Logist.,2021

4. Acocella, A., Caplice, C., and Sheffi, Y. (2022). The end of ’set it and forget it’ pricing? Opportunities for market-based freight contracts. arXiv.

5. Hernes, M., Wojtkiewicz, K., and Szczerbicki, E. (2020). Advances in Computational Collective Intelligence, Springer.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3