High-precision apple recognition and localization method based on RGB-D and improved SOLOv2 instance segmentation-Reference-Cited by-同舟云学术

High-precision apple recognition and localization method based on RGB-D and improved SOLOv2 instance segmentation

Published:2024-06-06 Issue: Volume:8 Page:
ISSN:2571-581X
Container-title:Frontiers in Sustainable Food Systems
language:
Short-container-title:Front. Sustain. Food Syst.

Author:

Tang Shixi,Xia Zilin,Gu Jinan,Wang Wenbo,Huang Zedong,Zhang Wenhao

Abstract

Intelligent apple-picking robots can significantly improve the efficiency of apple picking, and the realization of fast and accurate recognition and localization of apples is the prerequisite and foundation for the operation of picking robots. Existing apple recognition and localization methods primarily focus on object detection and semantic segmentation techniques. However, these methods often suffer from localization errors when facing occlusion and overlapping issues. Furthermore, the few instance segmentation methods are also inefficient and heavily dependent on detection results. Therefore, this paper proposes an apple recognition and localization method based on RGB-D and an improved SOLOv2 instance segmentation approach. To improve the efficiency of the instance segmentation network, the EfficientNetV2 is employed as the feature extraction network, known for its high parameter efficiency. To enhance segmentation accuracy when apples are occluded or overlapping, a lightweight spatial attention module is proposed. This module improves the model position sensitivity so that positional features can differentiate between overlapping objects when their semantic features are similar. To accurately determine the apple-picking points, an RGB-D-based apple localization method is introduced. Through comparative experimental analysis, the improved SOLOv2 instance segmentation method has demonstrated remarkable performance. Compared to SOLOv2, the F1 score, mAP, and mIoU on the apple instance segmentation dataset have increased by 2.4, 3.6, and 3.8%, respectively. Additionally, the model’s Params and FLOPs have decreased by 1.94M and 31 GFLOPs, respectively. A total of 60 samples were gathered for the analysis of localization errors. The findings indicate that the proposed method achieves high precision in localization, with errors in the X, Y, and Z axes ranging from 0 to 3.95 mm, 0 to 5.16 mm, and 0 to 1 mm, respectively.

Publisher

Frontiers Media SA

Reference38 articles.

1. Robust apple segmentation using fuzzy logic;Ahmad,2018

2. YOLACT: real-time instance segmentation;Bolya,2019

3. DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs;Chen;IEEE Trans. Pattern Anal. Mach. Intell.,2017

4. An apple detection method based on des-YOLO v4 algorithm for harvesting robots in complex environment;Chen;Math. Probl. Eng.,2021

5. A detection algorithm for cherry fruits based on the improved YOLO-v4 model;Gai;Neural Comput. Appl.,2023