Overview of Machine Learning Process Modelling-Reference-Cited by-同舟云学术

Overview of Machine Learning Process Modelling

Published:2021-08-28 Issue:9 Volume:23 Page:1123
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Brumen Boštjan^ORCID,Černezel Aleš,Bošnjak Leon^ORCID

Abstract

Much research has been conducted in the area of machine learning algorithms; however, the question of a general description of an artificial learner’s (empirical) performance has mainly remained unanswered. A general, restrictions-free theory on its performance has not been developed yet. In this study, we investigate which function most appropriately describes learning curves produced by several machine learning algorithms, and how well these curves can predict the future performance of an algorithm. Decision trees, neural networks, Naïve Bayes, and Support Vector Machines were applied to 130 datasets from publicly available repositories. Three different functions (power, logarithmic, and exponential) were fit to the measured outputs. Using rigorous statistical methods and two measures for the goodness-of-fit, the power law model proved to be the most appropriate model for describing the learning curve produced by the algorithms in terms of goodness-of-fit and prediction capabilities. The presented study, first of its kind in scale and rigour, provides results (and methods) that can be used to assess the performance of novel or existing artificial learners and forecast their ‘capacity to learn’ based on the amount of available or desired data.

Funder

Javna Agencija za Raziskovalno Dejavnost RS

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/23/9/1123/pdf

Reference41 articles.

1. A relational model of data for large shared data banks

2. Knowledge Discovery in Databases,1991

3. Advances in Knowledge Discovery and Data Mining,1996