Affiliation:
1. College of Plant Protection South China Agricultural University Guangzhou China
2. College of Mathematics and Informatics South China Agricultural University Guangzhou China
3. College of Plant Protection China Agricultural University Beijing China
Abstract
AbstractBACKGROUNDWith the rapid development of deep learning, the recognition of rice disease images using deep neural networks has become a hot research topic. However, most previous studies only focus on the modification of deep learning models, while lacking research to systematically and scientifically explore the impact of different data sizes on the image recognition task for rice diseases. In this study, a functional model was developed to predict the relationship between the size of dataset and the accuracy rate of model recognition.RESULTSTraining VGG16 deep learning models with different quantities of images of rice blast‐diseased leaves and healthy rice leaves, it was found that the test accuracy of the resulting models could be well fitted with an exponential model (A = 0.9965 − e(−0.0603×I50−1.6693)). Experimental results showed that with an increase of image quantity, the recognition accuracy of deep learning models would show a rapid increase at first. Yet when the image quantity increases beyond a certain threshold, the accuracy of image classification would not improve much, and the marginal benefit would be reduced. This trend remained similar when the composition of the dataset was changed, no matter whether (i) the disease class was changed, (ii) the number of classes was increased or (iii) the image data were augmented.CONCLUSIONSThis study provided a scientific basis for the impact of data size on the accuracy of rice disease image recognition, and may also serve as a reference for researchers for database construction. © 2024 Society of Chemical Industry.