Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning-Reference-Cited by-同舟云学术

Embracing limited and imperfect training datasets: opportunities and challenges in plant disease recognition using deep learning

Published:2023-09-22 Issue: Volume:14 Page:
ISSN:1664-462X
Container-title:Frontiers in Plant Science
language:
Short-container-title:Front. Plant Sci.

Author:

Xu Mingle,Kim Hyongsuk,Yang Jucheng,Fuentes Alvaro,Meng Yao,Yoon Sook,Kim Taehyun,Park Dong Sun

Abstract

Recent advancements in deep learning have brought significant improvements to plant disease recognition. However, achieving satisfactory performance often requires high-quality training datasets, which are challenging and expensive to collect. Consequently, the practical application of current deep learning–based methods in real-world scenarios is hindered by the scarcity of high-quality datasets. In this paper, we argue that embracing poor datasets is viable and aims to explicitly define the challenges associated with using these datasets. To delve into this topic, we analyze the characteristics of high-quality datasets, namely, large-scale images and desired annotation, and contrast them with the limited and imperfect nature of poor datasets. Challenges arise when the training datasets deviate from these characteristics. To provide a comprehensive understanding, we propose a novel and informative taxonomy that categorizes these challenges. Furthermore, we offer a brief overview of existing studies and approaches that address these challenges. We point out that our paper sheds light on the importance of embracing poor datasets, enhances the understanding of the associated challenges, and contributes to the ambitious objective of deploying deep learning in real-world applications. To facilitate the progress, we finally describe several outstanding questions and point out potential future directions. Although our primary focus is on plant disease recognition, we emphasize that the principles of embracing and analyzing poor datasets are applicable to a wider range of domains, including agriculture. Our project is public available at https://github.com/xml94/EmbracingLimitedImperfectTrainingDatasets.

Publisher

Frontiers Media SA

Subject

Plant Science

Reference96 articles.

1. Plant diseases recognition on images using convolutional neural networks: A systematic review;Abade;Comput. Electron. Agric.,2021

2. Tomato plant disease detection using transfer learning with c-gan synthetic images;Abbas;Comput. Electron. Agric.,2021

3. Compress: Self-supervised learning by compressing representations;Abbasi Koohpayegani;Adv. Neural Inf. Process. Syst.,2020

4. Detecting powdery mildew disease in squash at different stages using uav-based hyperspectral imaging and artificial intelligence;Abdulridha;Biosyst. Eng.,2020

5. Convolutional neural network for automatic identification of plant diseases with limited data;Afifi;Plants,2020

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Harnessing image processing for precision disease diagnosis in sugar beet agriculture;Crop Design;2024-11

2. Investigation to answer three key questions concerning plant pest identification and development of a practical identification framework;Computers and Electronics in Agriculture;2024-07

3. An Offline Biotic Stress Recognition Tool for Rice Plants Through Domain Shift;SN Computer Science;2024-04-23

4. Deep learning for medicinal plant species classification and recognition: a systematic review;Frontiers in Plant Science;2024-01-05

5. Known and unknown class recognition on plant species and diseases;Computers and Electronics in Agriculture;2023-12