Small object detection algorithm incorporating swin transformer for tea buds-Reference-Cited by-同舟云学术

Small object detection algorithm incorporating swin transformer for tea buds

Published:2024-03-21 Issue:3 Volume:19 Page:e0299902
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Shi Meiling^ORCID,Zheng Dongling,Wu Tianhao,Zhang Wenjing,Fu Ruijie,Huang Kailiang^ORCID

Abstract

Accurate identification of small tea buds is a key technology for tea harvesting robots, which directly affects tea quality and yield. However, due to the complexity of the tea plantation environment and the diversity of tea buds, accurate identification remains an enormous challenge. Current methods based on traditional image processing and machine learning fail to effectively extract subtle features and morphology of small tea buds, resulting in low accuracy and robustness. To achieve accurate identification, this paper proposes a small object detection algorithm called STF-YOLO (Small Target Detection with Swin Transformer and Focused YOLO), which integrates the Swin Transformer module and the YOLOv8 network to improve the detection ability of small objects. The Swin Transformer module extracts visual features based on a self-attention mechanism, which captures global and local context information of small objects to enhance feature representation. The YOLOv8 network is an object detector based on deep convolutional neural networks, offering high speed and precision. Based on the YOLOv8 network, modules including Focus and Depthwise Convolution are introduced to reduce computation and parameters, increase receptive field and feature channels, and improve feature fusion and transmission. Additionally, the Wise Intersection over Union loss is utilized to optimize the network. Experiments conducted on a self-created dataset of tea buds demonstrate that the STF-YOLO model achieves outstanding results, with an accuracy of 91.5% and a mean Average Precision of 89.4%. These results are significantly better than other detectors. Results show that, compared to mainstream algorithms (YOLOv8, YOLOv7, YOLOv5, and YOLOx), the model improves accuracy and F1 score by 5-20.22 percentage points and 0.03-0.13, respectively, proving its effectiveness in enhancing small object detection performance. This research provides technical means for the accurate identification of small tea buds in complex environments and offers insights into small object detection. Future research can further optimize model structures and parameters for more scenarios and tasks, as well as explore data augmentation and model fusion methods to improve generalization ability and robustness.

Publisher

Public Library of Science (PLoS)

Reference45 articles.

1. Environmental and nutritional requirements for tea cultivation;R Hajiboland;Folia horticulturae,2017

2. Developing situations of tea plucking machine;Y Han;Engineering,2014

3. Detection and classification of tea buds based on deep learning;W Xu;Computers and Electronics in Agriculture,2022

4. YOLO-tea: A tea disease detection model improved by YOLOv5;Z Xue;Forests,2023

5. An improved YOLOv7 network using RGB-D multi-modal feature fusion for tea shoots detection;Y Wu;Computers and Electronics in Agriculture,2024

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Lightweight detection of small tools for safer construction;Automation in Construction;2024-11