LiTasNeT: A Bird Sound Separation Algorithm Based on Deep Learning
Recent advances in deep learning techniques and acoustic sensor networks offer a new way for continuously monitoring birds. Deep learning methods have led to considerable progresses in audio source separation (ASS). However, it is still a challenge to deploy models based on deep learning on embedded...
Gespeichert in:
Veröffentlicht in: | International journal of sociotechnology and knowledge development 2022, Vol.14 (1), p.1-19 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Recent advances in deep learning techniques and acoustic sensor networks offer a new way for continuously monitoring birds. Deep learning methods have led to considerable progresses in audio source separation (ASS). However, it is still a challenge to deploy models based on deep learning on embedded devices. Therefore, find an efficient solution to compact these large models without affecting ASS performance has become an important research topic. In birds' natural habitat, it is common for several birds to sing simultaneously. This phenomenon will lead to false results when identifying a particular bird species. Separate required bird sound from the recorded mixture becomes indispensable. In this paper, a novel so-called Lite TasNet (LiTasNeT) is proposed. Based on conventional ASS methods, LiTasNeT has obtained leading results in several standardized ASS areas. LiTasNeT is designed with parameter-sharing scheme to lower the memory consumption. Moreover, his low latency natures make it definitely suitable for real-time on-device applications. |
---|---|
ISSN: | 1941-6253 1941-6261 |
DOI: | 10.4018/IJSKD.301261 |