Recognition of Aras Bird Species From Their Voices With Deep Learning Methods-Reference-Cited by-同舟云学术

Recognition of Aras Bird Species From Their Voices With Deep Learning Methods

Published:2022-09-01 Issue: Volume: Page:1250-1263
ISSN:2146-0574
Container-title:Journal of the Institute of Science and Technology
language:en
Short-container-title:Iğdır Üniv. Fen Bil Enst. Der.

Author:

BAYAT Seda¹,IŞIK Gültekin²

Affiliation:

1. IGDIR UNIVERSITY

2. Iğdır Üniversitesi

Abstract

This study focuses on recognizing bird species from their voices, which are frequently seen in Aras River Bird Sanctuary of Iğdır. For this purpose, deep learning methods were used. Acoustic monitoring is carried out to examine and analyze biological diversity. Passive acoustic listeners/recorders are used for this work. In general, various analyzes are performed on the raw sound recordings collected with these recording devices. In this study, raw sound recordings obtained from birds were processed with the methods developed by us, and then bird species were classified with deep learning architectures. Classifications were carried out on 22 bird species that are frequently seen in Aras Bird Sanctuary. Audio recordings were made into 10-second clips and then converted into one-second log mel spectrograms. Convolutional Neural Networks (CNN) and Long Short-Term Memory Neural Networks (LSTM), which are deep learning architectures, were used as classification methods. In addition to these two models, the Transfer Learning method was also used. Highlevel feature vectors of sounds were extracted with VGGish and YAMNet models, which are pre-trained convolutional neural networks, used for transfer learning. These extracted vectors formed the input layers of the classifiers. Accuracy rates and F1 scores of four different architectures were found through experiments. Accordingly, the highest accuracy rate (acc) and F1 score were obtained with the classifier using the VGGish model with 94.2% and 92.8%, respectively.

Publisher

Igdir University

Subject

General Medicine

Reference50 articles.

1. Abadi, M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G. S, Davis A, Dean J, & Devin M. (2016). Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv 2016. arXiv preprint arXiv:1603.04467.

2. Aide T. M, Corrada-Bravo C, Campos-Cerqueira M, Milan C, Vega G, & Alvarez R. (2013). Real-time bioacoustics monitoring and automated species identification. PeerJ, 2013(1).

3. Akhtar N, & Mian A. (2018). Threat of Adversarial Attacks on Deep Learning in Computer Vision: A Survey. Içinde IEEE Access (C. 6, ss. 14410–14430). Institute of Electrical and Electronics Engineers Inc.

4. Bardeli R, Wolff D, Kurth F, Koch M, Tauchert K. H, & Frommolt K. H. (2010). Detecting bird sounds in a complex acoustic environment and application to bioacoustic monitoring. Pattern Recognition Letters, 31(12), 1524–1534.

5. Barrowclough G. F, Cracraft J, Klicka J, & Zink R. M. (2016). How Many Kinds of Birds Are There and Why Does It Matter? PLOS ONE, 11(11), 1–15.

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving Plant Disease Recognition Through Gradient-Based Few-shot Learning with Attention Mechanisms;Iğdır Üniversitesi Fen Bilimleri Enstitüsü Dergisi;2023-09-01

2. Evaluating the Effectiveness of Different Machine Learning Approaches for Sentiment Classification;Iğdır Üniversitesi Fen Bilimleri Enstitüsü Dergisi;2023-09-01

3. A Vision Transformer-based Approach for Automatic COVID-19 Diagnosis on Chest X-ray Images;Iğdır Üniversitesi Fen Bilimleri Enstitüsü Dergisi;2023-06-01

4. Derin Evrişimli Sinir Ağları Kullanılarak Pirinç Hastalıklarının Sınıflandırılması;Iğdır Üniversitesi Fen Bilimleri Enstitüsü Dergisi;2023-06-01

5. A new YOLO-based method for social distancing from real-time videos;Neural Computing and Applications;2023-04-07