Automated classification of avian vocal activity using acoustic indices in regional and heterogeneous datasets

Acoustic indices combined with clustering and classification approaches have been increasingly used to automate identification of the presence of vocalizing taxa or acoustic events of interest. While most studies using this approach standardize data collection and study design parameters at the proj...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Methods in ecology and evolution 2021-04, Vol.12 (4), p.707-719
Hauptverfasser: Yip, Daniel A., Mahon, C. Lisa, MacPhail, Alexander G., Bayne, Erin M., Freckleton, Robert
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Acoustic indices combined with clustering and classification approaches have been increasingly used to automate identification of the presence of vocalizing taxa or acoustic events of interest. While most studies using this approach standardize data collection and study design parameters at the project or study level, recent trends in ecological research are to investigate patterns at regional or continental scales. Large‐scale studies often require collaboration between research groups and integration of data from multiple sources to fulfil objectives, which can lead to variation in recording equipment and data collection protocols. Our objectives were to determine how analytical approaches and variation in data collection and processing that is typical of regional acoustic monitoring programmes influences accuracy when identifying vocal activity in breeding birds. We used data from three regional datasets in Northern Alberta, Northern British Columbia, and Southern and Central Yukon, Canada to investigate the effect of analytical framework, sample size, local species richness and data collection variables on classification accuracy. We found supervised classification approaches to be the most effective, with boosted regression trees identifying vocalizations of breeding birds in audio recordings with a 92.0% accuracy and easily able to accommodate variation in data collection and processing parameters. We also provide recommendations on effectively processing large and heterogeneous datasets including sufficient sample size, accommodating potentially confounding variables and selecting suitable model training data. The results presented in this study can help inform decisions in data collection, data processing, and study design and analysis, maximize performance and accuracy during analysis, and efficiently process large, heterogeneous acoustic datasets to answer questions at scales previously difficult to investigate.
ISSN:2041-210X
2041-210X
DOI:10.1111/2041-210X.13548