Abstract
Declines in biodiversity and ecosystem health due to climate change are raising urgent concerns. In response, large-scale multispecies monitoring programmes are being implemented that increasingly adopt sensor-based approaches such as acoustic recording. These approaches rely heavily on ecological data science. However, developing reliable algorithms for processing sensor-based data relies heavily on labelled datasets of sufficient quality and quantity. We present a dataset of 1,575 dawn chorus soundscape recordings, 141 being fully annotated (n = 32,994 annotations) with avian, mammalian and amphibian vocalisations. The remaining recordings were included to facilitate novel research applications. These recordings are paired with 48 site-level climatic, forest structure and topographic covariates. This dataset provides a valuable resource to researchers developing acoustic classification algorithms or studying biodiversity and wildlife behaviour and its relationship to environmental gradients. The dawn chorus recordings were collected as part of a long-term Northern Spotted Owl monitoring program; this demonstrates the complementary value of harnessing existing monitoring efforts to strengthen biodiversity sampling.
This dataset of dawn chorus soundscape recordings is one of the few open-access acoustic datasets annotated with non-biotic and both interspecific (across species) and intraspecific (within species) bird, mammal and amphibian sonotypes and the first that is paired with climatic, forest structure and topographical covariates extracted at recorder locations. This makes it a valuable resource for researchers studying the dawn chorus and its relationship to the environment.