Classification of underwater soundscapes using raw hydroacoustic signals

Automatic classification of underwater soundscapes remains an open problem, mainly due to the challenges posed by multi-source interferences. Accurately classifying anthropogenic sounds in the marine environment, specifically those emanating from surface-ships, is concerned in this study. To achieve...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2023-10, Vol.154 (4_supplement), p.A304-A304
Hauptverfasser: Nie, Leixin, Zhang, Yonglin, Wang, HaiBin
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Automatic classification of underwater soundscapes remains an open problem, mainly due to the challenges posed by multi-source interferences. Accurately classifying anthropogenic sounds in the marine environment, specifically those emanating from surface-ships, is concerned in this study. To achieve this, the convolutional neural network (CNN) with sine-cardinal-likeconstrained convolutional kernels is proposed, where kernels represent unsolved filter coefficients. Raw signals of sound pressures passively received by the hydrophone are used directly in this method without the need for the routine time-frequency analysis (TFA) beforehand. The convolutional layer with constrained kernels plays a crucial role in extracting features from hydroacoustic signals, effectively acting as a learnable extension of the bandpass filter groups with flexible bandwidth. One significant advantage of the proposed approach is its adaptive filtering capability based on the real-world data, which makes it effectively filter out irrelevant interference sounds. Such heterogeneous interferences often exhibit diverse unknown spectral ranges, making them challenging for conventional TFA with fixed parameters. By leveraging this adaptability, the output of the convolutional layer serves as a task-specific spectrogram, customized to the classifier after being trained on hydroacoustic data. The experiments on the ShipsEar dataset demonstrate the promising results of our solution than TFA and vanilla CNN.
ISSN:0001-4966
1520-8524
DOI:10.1121/10.0023608