Abstract
AbstractA wide range of applications in marine ecology extensively uses underwater cameras. Still, to efficiently process the vast amount of data generated, we need to develop tools that can automatically detect and recognize species captured on film. Classifying fish species from videos and images in natural environments can be challenging because of noise and variation in illumination and the surrounding habitat. In this paper, we propose a two-step deep learning approach for the detection and classification of temperate fishes without pre-filtering. The first step is to detect each single fish in an image, independent of species and sex. For this purpose, we employ the You Only Look Once (YOLO) object detection technique. In the second step, we adopt a Convolutional Neural Network (CNN) with the Squeeze-and-Excitation (SE) architecture for classifying each fish in the image without pre-filtering. We apply transfer learning to overcome the limited training samples of temperate fishes and to improve the accuracy of the classification. This is done by training the object detection model with ImageNet and the fish classifier via a public dataset (Fish4Knowledge), whereupon both the object detection and classifier are updated with temperate fishes of interest. The weights obtained from pre-training are applied to post-training as a priori. Our solution achieves the state-of-the-art accuracy of 99.27% using the pre-training model. The accuracies using the post-training model are also high; 83.68% and 87.74% with and without image augmentation, respectively. This strongly indicates that the solution is viable with a more extensive dataset.
Publisher
Springer Science and Business Media LLC
Reference32 articles.
1. Perry D, Staveley TAB, Gullström M (2018) . Habitat connectivity of fish in temperate shallow-water seascapes 4:440
2. Weinstein BG (2017) A computer vision for animal ecology. J Animal Ecol 87(3):533–545
3. Pelletier D, Leleu K, Mou-Tham G, Guillemot N, Chabanet P (2011) Comparison of visual census and high definition video transects for monitoring coral reef fish assemblages. Fish Res 107(1):84–93
4. Lopez-Vazquez V, Lopez-Guede J, Marini S, Fanelli E, Johnsen E, Aguzzi J (2020) Video image enhancement and machine learning pipeline for underwater animal detection and classification at cabled observatories. Sensors 20:726, 01
5. Francour P, Liret C, Harvey E (1999) Comparison of fish abundance estimates made by remote underwater video and visual census. Naturalista Siciliano 23:155–168, 01
Cited by
94 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献