Image-Fused-Guided Underwater Object Detection Model Based on Improved YOLOv7

Author:

Wang Zhenhua12ORCID,Zhang Guangshi12,Luan Kuifeng12ORCID,Yi Congqin12,Li Mingjie234

Affiliation:

1. College of Information Science, Shanghai Ocean University, Shanghai 201306, China

2. Key Laboratory of Marine Environmental Survey Technology and Application, Ministry of Natural Resources, Guangzhou 510300, China

3. South China Sea Institute of Planning and Environmental Research, State Oceanic Administration, Guangzhou 510310, China

4. Technology Innovation Center for South China Sea Remote Sensing, Surveying and Mapping Collaborative Application, Ministry of Natural Resources, Guangzhou 510310, China

Abstract

Underwater object detection, as the principal means of underwater environmental sensing, plays a significant part in the marine economic, military, and ecological fields. Due to the degradation problems of underwater images caused by color cast, blurring, and low contrast, we proposed a model for underwater object detection based on YOLO v7. In the presented detection model, an enhanced image branch was constructed to expand the feature extraction branch of YOLOv7, which could mitigate the feature degradation issues existing in the original underwater images. The contextual transfer block was introduced to the enhanced image branch, following the underwater image enhancement module, which could extract the domain features of the enhanced image, and the features of the original images and the enhanced images were fused before being fed into the detector. Focal EIOU was adopted as a new model bounding box regression loss, aiming to alleviate the performance degradation caused by mutual occlusion and overlapping of underwater objects. Taking URPC2020 and UTDAC2020 (Underwater Target Detection Algorithm Competition 2020) datasets as experimental datasets, the performance of our proposed model was compared against with other models, including YOLOF, YOLOv6 v3.0, DETR, Swin Transformer, and InternImage. The results show that our proposed model presents a competitive performance, achieving 80.71% and 86.32% in mAP@0.5 on URPC2020 and UTDAC2020, respectively. Comprehensively, the proposed model is capable of effectively mitigating the problems encountered in the task of object detection in underwater images with degraded features and exhibits great advancement.

Funder

Key Laboratory of Marine Environmental Survey Technology and Application, Ministry of Natural Resources

Capacity Development for Local College Project

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Reference42 articles.

1. A novel attention-based lightweight network for multiscale object detection in underwater images;Wang;J. Sens.,2022

2. Multi-view underwater image enhancement method via embedded fusion mechanism;Zhou;Eng. Appl. Artif. Intell.,2023

3. Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement;Zhang;IEEE Trans. Image Process.,2022

4. Target oriented perceptual adversarial fusion network for underwater image enhancement;Jiang;IEEE Trans. Circuits Syst. Video Technol.,2022

5. Enhancing visibility of polarimetric underwater image by transmittance correction;Hu;IEEE Photonics J.,2017

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3