Affiliation:
1. College of Science, Northeast Forestry University, Hexing Road 26, Harbin 150040, China
Abstract
In recent years, deep-learning methods have significantly improved the classification results in the field of plant-leaf recognition. However, limited by the model input, the original image needs to be compressed to a certain size before it can be input into the convolutional neural network. This results in great changes in the shape and texture information of some samples, thus affecting the classification accuracy of the model to a certain extent. Therefore, a minimum enclosing quadrate (MEQ) method is proposed to standardize the sample datasets. First, the minimum enclosing rectangle (MER) of the leaf is obtained in the original image, and the target area is clipped. Then, the minimum enclosing quadrate of the leaf is obtained by extending the short side of the rectangle. Finally, the sample is compressed to fit the input requirements of the model. In addition, in order to further improve the classification accuracy of plant-leaf recognition, an EC-ResNet50 model based on transfer-learning strategy is proposed and further combined with the MEQ method. The Swedish leaf, Flavia leaf, and MEW2012 leaf datasets are used to test the performance of the proposed methods, respectively. The experimental results show that using the MEQ method to standardize datasets can significantly improve the classification accuracy of neural networks. The Grad-CAM visual analysis reveals that the convolutional neural network exhibits a higher degree of attention towards the leaf surface features and utilizes more comprehensive feature regions during recognition of the leaf samples processed by MEQ method. In addition, the proposed MEQ + EC-ResNet50 method also achieved the best classification results among all the compared methods. This experiment provides a widely applicable sample standardization method for leaf recognition research, which can avoid the problem of sample deformation caused by compression processing and reduce the interference of redundant information in the image to the classification results to a certain degree.
Funder
Fundamental Research Funds for the Central Universities
Northeast Forestry University
Project of National Natural Science Foundation of China