The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets-Reference-Cited by-同舟云学术

The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets

Published:2020-05-13 Issue: Volume:2020 Page:1-17
ISSN:1024-123X
Container-title:Mathematical Problems in Engineering
language:en
Short-container-title:Mathematical Problems in Engineering

Author:

Uçar Muhammed Kürşad¹,Nour Majid²,Sindi Hatem³,Polat Kemal⁴^ORCID

Affiliation:

1. Electrical and Electronics Engineering, Faculty of Engineering, Sakarya University, Sakarya 54187, Turkey

2. Department of Electrical and Computer Engineering, Faculty of Engineering, King Abdulaziz University, Jeddah 21589, Saudi Arabia

3. King Abdulaziz University, Knowledge-Economy, and Technology Transfer Center, Jeddah 21589, Saudi Arabia

4. Department of Electrical and Electronics Engineering, Faculty of Engineering, Bolu Abant Izzet Baysal University, Bolu 14280, Turkey

Abstract

Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process in detail and offer new recommendations. This article provides a detailed analysis of the training and testing process in machine learning. The article has the following sections. The third section describes how to prepare the datasets. Four balanced datasets were used for the application. The fourth section describes the rate and how to select samples at the training and testing stage. The fundamental sampling theorem is the subject of statistics. It shows how to select samples. In this article, it has been proposed to use sampling methods in machine learning training and testing process. The fourth section covers the theoretic expression of four different sampling theorems. Besides, the results section has the results of the performance of sampling theorems. The fifth section describes the methods by which training and pretest features can be selected. In the study, three different classifiers control the performance. The results section describes how the results should be analyzed. Additionally, this article proposes performance evaluation methods to evaluate its results. This article examines the effect of the training and testing process on performance in machine learning in detail and proposes the use of sampling theorems for the training and testing process. According to the results, datasets, feature selection algorithms, classifiers, training, and test ratio are the criteria that directly affect performance. However, the methods of selecting samples at the training and testing stages are vital for the system to work correctly. In order to design a stable system, it is recommended that samples should be selected with a stratified systematic sampling theorem.

Funder

King Abdulaziz University

Publisher

Hindawi Limited

Subject

General Engineering,General Mathematics

Link

http://downloads.hindawi.com/journals/mpe/2020/2836236.pdf

Reference53 articles.

1. Diversity in Machine Learning

Cited by 79 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning model to predict rate constants for sonochemical degradation of organic pollutants;Ultrasonics Sonochemistry;2024-11

2. Development of machine learning-based burst capacity models for pipelines containing dent-gouges with synthetic full-scale burst test data generated using tabular generative adversarial network;Engineering Applications of Artificial Intelligence;2024-07

3. Artificial intelligence applications for accurate geothermal temperature prediction in the lower Friulian Plain (north-eastern Italy);Journal of Cleaner Production;2024-07

4. Examining labelling guidelines for AI‐based software as a medical device: A review and analysis of dermatology mobile applications in Australia;Australasian Journal of Dermatology;2024-05

5. Deep learning-based forecasting modeling of micro gas turbine performance projection: An experimental approach;Engineering Applications of Artificial Intelligence;2024-04