Author:
van Merriënboer Bart,Hamer Jenny,Dumoulin Vincent,Triantafillou Eleni,Denton Tom
Abstract
In the context of passive acoustic monitoring (PAM) better models are needed to reliably gain insights from large amounts of raw, unlabeled data. Bioacoustics foundation models, which are general-purpose, adaptable models that can be used for a wide range of downstream tasks, are an effective way to meet this need. Measuring the capabilities of such models is essential for their development, but the design of robust evaluation procedures is a complex process. In this review we discuss a variety of fields that are relevant for the evaluation of bioacoustics models, such as sound event detection, machine learning metrics, and transfer learning (including topics such as few-shot learning and domain generalization). We contextualize these topics using the particularities of bioacoustics data, which is characterized by large amounts of noise, strong class imbalance, and distribution shifts (differences in the data between training and deployment stages). Our hope is that these insights will help to inform the design of evaluation protocols that can more accurately predict the ability of bioacoustics models to be deployed reliably in a wide variety of settings.
Reference100 articles.
1. A framework for the robust evaluation of sound event detection;Bilen,2020
2. Automatic detection and compression for passive acoustic monitoring of the african forest elephant;Bjorck;Proc. AAAI Conf. Artif. Intell.,2019
3. On the opportunities and risks of foundation models;Bommasani;arXiv preprint arXiv:2108.07258,2021
4. Audiolm: a language modeling approach to audio generation;Borsos;IEEE/ACM Trans. Audio Speech Lang. Process,2023