A TEDE Algorithm Studies the Effect of Dataset Grouping on Supervised Learning Accuracy-Reference-Cited by-同舟云学术

A TEDE Algorithm Studies the Effect of Dataset Grouping on Supervised Learning Accuracy

Published:2023-06-05 Issue:11 Volume:12 Page:2546
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Wang Xufei¹²^ORCID,Wang Penghui¹,Song Jeongyoung³,Hao Taotao¹,Duan Xinlu¹

Affiliation:

1. School of Mechanical Engineering, Shaanxi University of Technology, Hanzhong 723000, China

2. Key Laboratory of Industrial Automation, Shaanxi University of Technology, Hanzhong 723000, China

3. Department of Computer Engineering, Pai Chai University, Daejeon 35345, Republic of Korea

Abstract

Datasets are the basis for research on deep learning methods in computer vision. The impact of the percentage of training sets in a dataset on the performance of neural network models needs to be further explored. In this paper, a twice equal difference enumeration (TEDE) algorithm is proposed to investigate the effect of different training set percentages in the dataset on the performance of the network model, and the optimal training set percentage is determined. By selecting the Pascal VOC dataset and dividing it into six different datasets from largest to smallest, and then dividing each dataset into the datasets to be analyzed according to five different training set percentages, the YOLOv5 convolutional neural network is used to train and test the 30 datasets to determine the optimal neural network model corresponding to the training set percentages. Finally, tests were conducted using the Udacity Self-Driving dataset with a self-made Tire Tread Defects (TTD) dataset. The results show that the network model performance is superior when the training set accounts for between 85% and 90% of the overall dataset. The results of dataset partitioning obtained by the TEDE algorithm can provide a reference for deep learning research.

Funder

Shaanxi Provincial Key Laboratory of Industrial Automation Research Program

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/12/11/2546/pdf

Reference32 articles.

1. Deep learning enabled inverse design in nanophotonics;So;Nanophotonics,2020

2. Supervised machine learning: A review of classification techniques;Kotsiantis;Emerg. Artif. Intell. Appl. Comput. Eng.,2007

3. Predicting the outcome of heart failure against chronic-ischemic heart disease in elderly population–Machine learning approach based on logistic regression, case to Villa hospital Genoa, Italy;Stojanov;J. King Saud Univ. Sci.,2023

4. Applying naive bayesian networks to disease prediction: A systematic review;Langarizadeh;Acta Inform. Med.,2016

5. Medical Health Big Data Classification Based on KNN Classification Algorithm;Xing;IEEE Access,2020

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Road Object Detection Model Based on YOLOv4 of Autonomous Vehicle;IEEE Access;2024