A Seamless Deep Learning Approach for Apple Detection, Depth Estimation, and Tracking Using YOLO Models Enhanced by Multi-Head Attention Mechanism

Author:

Sekharamantry Praveen Kumar12ORCID,Melgani Farid1ORCID,Malacarne Jonni1ORCID,Ricci Riccardo1ORCID,de Almeida Silva Rodrigo3ORCID,Marcato Junior Jose3ORCID

Affiliation:

1. Department of Information Engineering and Computer Science, University of Trento, 38123 Trento, Italy

2. Department of Computer Science and Engineering, GITAM School of Technology, GITAM (Deemed to be University), Visakhapatnam 530045, India

3. Faculty of Engineering, Architecture and Urbanism and Geography, Federal University of Mato Grosso do Sul, Campo Grande 79070-900, Brazil

Abstract

Considering precision agriculture, recent technological developments have sparked the emergence of several new tools that can help to automate the agricultural process. For instance, accurately detecting and counting apples in orchards is essential for maximizing harvests and ensuring effective resource management. However, there are several intrinsic difficulties with traditional techniques for identifying and counting apples in orchards. To identify, recognize, and detect apples, apple target detection algorithms, such as YOLOv7, have shown a great deal of reflection and accuracy. But occlusions, electrical wiring, branches, and overlapping pose severe issues for precisely detecting apples. Thus, to overcome these issues and accurately recognize apples and find the depth of apples from drone-based videos in complicated backdrops, our proposed model combines a multi-head attention system with the YOLOv7 object identification framework. Furthermore, we provide the ByteTrack method for apple counting in real time, which guarantees effective monitoring of apples. To verify the efficacy of our suggested model, a thorough comparison assessment is performed with several current apple detection and counting techniques. The outcomes adequately proved the effectiveness of our strategy, which continuously surpassed competing methods to achieve exceptional accuracies of 0.92, 0.96, and 0.95 with respect to precision, recall, and F1 score, and a low MAPE of 0.027, respectively.

Funder

Fondazione Caritro

Italian Ministry of Foreign Affairs and International Cooperation and the Brazilian National Council of State Funding Agencies

Publisher

MDPI AG

Reference56 articles.

1. Zou, Z., Chen, K., Shi, Z., Guo, Y., and Ye, J. (2019). Object detection in 20 years: A survey. arXiv.

2. Vision Based Computing Systems for Healthcare Applications;Murala;J. Healthc. Eng.,2019

3. Apple Detection during Different Growth Stages in Orchards Using the Improved YOLO-V3 Model;Tian;Comput. Electron. Agric.,2019

4. Current situations and prospects of smart agriculture;Zhao;J. South China Agric. Univ.,2021

5. Cohen, O., Linker, R., and Naor, A. (2010, January 22–25). Estimation of the number of apples in color images recorded in orchards. Proceedings of the International Conference on Computer and Computing Technologies in Agriculture, Nanchang, China.

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3