An Effective Method for Underwater Biological Multi-Target Detection Using Mask Region-Based Convolutional Neural Network

Author:

Yue Zhaoxin123ORCID,Yan Bing4,Liu Huaizhi5,Chen Zhe4ORCID

Affiliation:

1. School of Computer and Software, Nanjing Vocational University of Industry Technology, Nanjing 210023, China

2. Key Laboratory of River Basin Digital Twinning of Ministry of Water Resources, China Institute of Water Resources and Hydropower Research, Beijing 100038, China

3. Industrial Perception and Intelligent Manufacturing Equipment Engineering Research Center of Jiangsu Province, Nanjing Vocational University of Industry Technology, Nanjing 210023, China

4. College of Computer and Information, Hohai University, Nanjing 211100, China

5. CSIC PRIDe (Nanjing) Atmospheric & Oceanic Information System Co., Ltd., Nanjing 211106, China

Abstract

Underwater creatures play a vital role in maintaining the delicate balance of the ocean ecosystem. In recent years, machine learning methods have been developed to identify underwater biologicals in the complex underwater environment. However, the scarcity and poor quality of underwater biological images present significant challenges to the recognition of underwater biological targets, especially multi-target recognition. To solve these problems, this paper proposed an ensemble method for underwater biological multi-target recognition. First, the CutMix method was improved for underwater biological image augmentation. Second, the white balance, multiscale retinal, and dark channel prior algorithms were combined to enhance the underwater biological image quality, which could largely improve the performance of underwater biological target recognition. Finally, an improved model was proposed for underwater biological multi-target recognition by using a mask region-based convolutional neural network (Mask-RCNN), which was optimized by the soft non-maximum suppression and attention-guided context feature pyramid network algorithms. We achieved 4.97 FPS, the mAP was 0.828, and the proposed methods could adapt well to underwater biological multi-target recognition. The recognition effectiveness of the proposed method was verified on the URPC2018 dataset by comparing it with current state-of-the-art recognition methods including you-only-look-once version 5 (YOLOv5) and the original Mask-RCNN model, where the mAP of the YOLOv5 model was lower. Compared with the original Mask-RCNN model, the mAP of the improved model increased by 3.2% to 82.8% when the FPS was reduced by only 0.38.

Funder

School Research Fund of Nanjing Vocational University of Industry Technology

Open Research Fund of Key Laboratory of River Basin Digital Twinning of Ministry of Water Resources,

Open Foundation of Industrial Perception and Intelligent Manufacturing Equipment Engineering Re-search Center of Jiangsu Province

Vocational Undergraduate Education Research Fund of Nanjing Vocational University of Industry Technology

Publisher

MDPI AG

Subject

Water Science and Technology,Aquatic Science,Geography, Planning and Development,Biochemistry

Reference43 articles.

1. Integrate MSRCR and mask R-CNN to recognize underwater creatures on small sample datasets;Song;IEEE Access,2020

2. An underwater target recognition method based on improved YOLOv4 in complex marine environment;Zhou;Syst. Sci. Control Eng.,2022

3. The pascal visual object classes challenge: A retrospective;Everingham;Int. J. Comput. Vis.,2015

4. Underwater Target Detection Based on Parallel High-Resolution Networks;Bao;Sensors,2023

5. Object detection based on regional saliency and underwater optical prior knowledge;Huibin;Chin. J. Sci. Instrum.,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3