Segmentation Network for Multi-Shape Tea Bud Leaves Based on Attention and Path Feature Aggregation
-
Published:2024-08-17
Issue:8
Volume:14
Page:1388
-
ISSN:2077-0472
-
Container-title:Agriculture
-
language:en
-
Short-container-title:Agriculture
Author:
Chen Tianci1, Li Haoxin1, Lv Jinhong1, Chen Jiazheng2, Wu Weibin13
Affiliation:
1. National Key Laboratory of Agricultural Equipment Technology, College of Engineering, South China Agricultural University, Guangzhou 510642, China 2. College of Mechanical and Electrical Engineering, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, China 3. Guangdong Engineering Technology Research Center for Creative Hilly Orchard Machinery, Guangzhou 510642, China
Abstract
Accurately detecting tea bud leaves is crucial for the automation of tea picking robots. However, challenges arise due to tea stem occlusion and overlapping of buds and leaves, presenting varied shapes of one bud–one leaf targets in the field of view, making precise segmentation of tea bud leaves challenging. To improve the segmentation accuracy of one bud–one leaf targets with different shapes and fine granularity, this study proposes a novel semantic segmentation model for tea bud leaves. The method designs a hierarchical Transformer block based on a self-attention mechanism in the encoding network, which is beneficial for capturing long-range dependencies between features and enhancing the representation of common features. Then, a multi-path feature aggregation module is designed to effectively merge the feature outputs of encoder blocks with decoder outputs, thereby alleviating the loss of fine-grained features caused by downsampling. Furthermore, a refined polarized attention mechanism is employed after the aggregation module to perform polarized filtering on features in channel and spatial dimensions, enhancing the output of fine-grained features. The experimental results demonstrate that the proposed Unet-Enhanced model achieves segmentation performance well on one bud–one leaf targets with different shapes, with a mean intersection over union (mIoU) of 91.18% and a mean pixel accuracy (mPA) of 95.10%. The semantic segmentation network can accurately segment tea bud leaves, providing a decision-making basis for the spatial positioning of tea picking robots.
Funder
2024 Rural Revitalization Strategy Special Funds Provincial Project Guangdong Province (Shenzhen) Digital and Intelligent Agricultural Service Industrial Park Construction of Smart Agricultural Machinery and Control Technology Research and Development 2023 Guangdong Provincial Special Fund for Modern Agriculture Industry Technology Innovation Teams
Reference45 articles.
1. Xie, S., and Sun, H. (2023). Tea-YOLOv8s: A tea bud detection model based on deep learning and computer vision. Sensors, 23. 2. Wang, J., Li, X., Yang, G., Wang, F., Men, S., Xu, B., Xu, Z., Yang, H., and Yan, L. (2022). Research on Tea Trees Germination Density Detection Based on Improved YOLOv5. Forests, 13. 3. Zhu, Y., Wu, C., Tong, J., Tong, J., Chen, J., He, L., Wang, R., and Jia, J. (2021). Deviation tolerance performance evaluation and experiment of picking end effector for famous tea. Agriculture, 11. 4. Detection and classification of tea buds based on deep learning;Xu;Comput. Electron. Agric.,2022 5. Zhang, S., Yang, H., Yang, C., Zhang, S., Yang, H., Yang, C., Yuan, W., Li, X., Wang, X., and Zhang, Y. (2023). Edge device detection of tea leaves with one bud and two leaves based on ShuffleNetv2-YOLOv5-Lite-E. Agronomy, 13.
|
|