Deep Learning for Enhanced Marine Vision: Object Detection in Underwater Environments-Reference-Cited by-同舟云学术

Deep Learning for Enhanced Marine Vision: Object Detection in Underwater Environments

Published:2023-12-26 Issue:4 Volume:11 Page:1209-1218
ISSN:2347-470X
Container-title:International Journal of Electrical and Electronics Research
language:en
Short-container-title:IJEER

Author:

Dakhil Radhwan Adnan¹,Khayeat Ali Retha Hasoon¹

Affiliation:

1. Department of Computer Science, College of Computer Science and Information Technology, University of Kerbala, Karbala, Iraq

Abstract

This study leverages the Semantic Segmentation of Underwater Imagery (SUIM) dataset, encompassing over 1,500 meticulously annotated images that delineate eight distinct object categories. These categories encompass a diverse array, ranging from vertebrate fish and invertebrate reefs to aquatic vegetation, wreckage, human divers, robots, and the seafloor. The use of this dataset involves a methodical synthesis of data through extensive oceanic expeditions and collaborative experiments, featuring both human participants and robots. The research extends its scope to evaluating cutting-edge semantic segmentation techniques, employing established metrics to gauge their performance comprehensively. Additionally, we introduce a fully convolutional encoder-decoder model designed with a dual purpose: delivering competitive performance and computational efficiency. Notably, this model boasts a remarkable accuracy of 88%, underscoring its proficiency in underwater image segmentation. Furthermore, this model's integration within the autonomy pipeline of visually-guided underwater robots presents its tangible applicability. Its rapid end-to-end inference capability addresses the exigencies of real-time decision-making, vital for autonomous systems. This study elucidates the model's practical benefits across diverse applications like visual serving, saliency prediction, and intricate scene comprehension. Crucially, the utilization of the Enhanced Super-Resolution Generative Adversarial Network (ESRGAN) elevates image quality, enriching the foundation upon which our model's success rests. This research establishes a solid groundwork for future exploration in underwater robot vision by presenting the model and the benchmark dataset.

Publisher

FOREX Publication

Subject

Electrical and Electronic Engineering,Engineering (miscellaneous)

Reference35 articles.

1. A. Garcia-Garcia, S. Orts-Escolano, S. Oprea, V. Villena-Martinez, and J. J. a. p. a. Garcia-Rodriguez, "A review on deep learning techniques applied to semantic segmentation," 2017. https://doi.org/10.48550/arXiv.1704.06857.

2. M. Jian, Q. Qi, J. Dong, Y. Yin, K.-M. J. J. o. v. c. Lam, and i. representation, "Integrating QDWD with pattern distinctness and local contrast for underwater saliency detection," vol. 53, pp. 31-41, 2018. https://doi.org/10.1016/j.jvcir.2018.03.008.

3. M. Sharma, J. Lim, and H. J. A. S. Lee, "The amalgamation of the object detection and semantic segmentation for steel surface defect detection," vol. 12, no. 12, p. 6004, 2022. https://doi.org/10.3390/app12126004.

4. M. J. Islam, Y. Xia, J. J. I. R. Sattar, and A. Letters, "Fast underwater image enhancement for improved visual perception," vol. 5, no. 2, pp. 3227-3234, 2020. https://doi.org/10.1109/LRA.2020.2974710.

5. I. Alonso, M. Yuval, G. Eyal, T. Treibitz, and A. C. J. J. o. F. R. Murillo, "CoralSeg: Learning coral segmentation from sparse annotations," vol. 36, no. 8, pp. 1456-1477, 2019. https://doi.org/10.1002/rob.21915.