Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform

Underwater acoustic monitoring systems record many hours of audio data for marine research, making fast and reliable non-causal signal detection paramount. Such detectors assist in reducing the amount of labor required for signal annotations, which often contain large portions devoid of signals. Cet...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2022-11
Hauptverfasser: Rademan, Marco W, Versfeld, Daniel J, du Preez, Johan A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Rademan, Marco W
Versfeld, Daniel J
du Preez, Johan A
description Underwater acoustic monitoring systems record many hours of audio data for marine research, making fast and reliable non-causal signal detection paramount. Such detectors assist in reducing the amount of labor required for signal annotations, which often contain large portions devoid of signals. Cetacean vocalization detection based on spectral entropy is investigated as a means of vocalization discovery. Previous techniques using spectral entropy (SE) mostly consider time-frequency enhancement of the entropy measure, and utilize the STFT as its time-frequency (TF) decomposition. SE methods also requires the user to set a detection threshold manually, which call for knowledge of the produced entropy measures. This paper considers median filtering as a simple, effective way to provide temporal stabilization to the entropy measure, and considers the CWT as an alternative TF decomposition. K-means clustering is used to determine the threshold required to accurately separate the signal/no-signal entropy measures, resulting in a one-dimensional, two-class classification problem. The class means are used to perform pseudo-probabilistic soft class assignment, which is a useful metric in algorithmic development. The effect of median filtering, signal-to-noise ratio and the chosen TF decomposition are investigated. The proposed method shows a significant improvement in detection accuracy and specificity, while also providing a more interpretable detection threshold setting via soft class assignment.
doi_str_mv 10.48550/arxiv.2211.01065
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2211_01065</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2731610795</sourcerecordid><originalsourceid>FETCH-LOGICAL-a955-282e66d46cc9480526332d8e86dacf680908ccd2d448560ae14ea805c4b648493</originalsourceid><addsrcrecordid>eNot0MtOAjEUgOHGxESCPIArm7h1sHc6SzPiJWJYgLqc1E4Hhwzt2HaI-AK-tgVcdXG-c5r8AFxgNGaSc3Sj_HezHROC8RhhJPgJGBBKcSYZIWdgFMIaIUTEhHBOB-B34eqYzfvY9REumpVVLbwz0ejYOAtr52FhotJGWfjmtGqbH7WfBPgaGruCiy5Jn3amNnrX7a7hc_aScIBF24do_B4pW8H4aWDhbGxs7_oA39XWtCbCpU82_bI5B6e1aoMZ_b9DsLyfLovHbDZ_eCpuZ5nKOc-IJEaIigmtcyYRJ4JSUkkjRaV0LSTKkdS6IhVLLQRSBjOjktPsQzDJcjoEl8ezh0pl55uN8rtyX6s81Eri6ig67756E2K5dr1PWUJJJhQLjCY5p3835W5S</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2731610795</pqid></control><display><type>article</type><title>Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Rademan, Marco W ; Versfeld, Daniel J ; du Preez, Johan A</creator><creatorcontrib>Rademan, Marco W ; Versfeld, Daniel J ; du Preez, Johan A</creatorcontrib><description>Underwater acoustic monitoring systems record many hours of audio data for marine research, making fast and reliable non-causal signal detection paramount. Such detectors assist in reducing the amount of labor required for signal annotations, which often contain large portions devoid of signals. Cetacean vocalization detection based on spectral entropy is investigated as a means of vocalization discovery. Previous techniques using spectral entropy (SE) mostly consider time-frequency enhancement of the entropy measure, and utilize the STFT as its time-frequency (TF) decomposition. SE methods also requires the user to set a detection threshold manually, which call for knowledge of the produced entropy measures. This paper considers median filtering as a simple, effective way to provide temporal stabilization to the entropy measure, and considers the CWT as an alternative TF decomposition. K-means clustering is used to determine the threshold required to accurately separate the signal/no-signal entropy measures, resulting in a one-dimensional, two-class classification problem. The class means are used to perform pseudo-probabilistic soft class assignment, which is a useful metric in algorithmic development. The effect of median filtering, signal-to-noise ratio and the chosen TF decomposition are investigated. The proposed method shows a significant improvement in detection accuracy and specificity, while also providing a more interpretable detection threshold setting via soft class assignment.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2211.01065</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Audio data ; Cluster analysis ; Clustering ; Continuous wavelet transform ; Decomposition ; Entropy ; Filtration ; Signal detection ; Signal to noise ratio ; Statistics - Applications ; Time-frequency analysis ; Underwater acoustics ; Vector quantization ; Wavelet transforms</subject><ispartof>arXiv.org, 2022-11</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,785,886,27930</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2211.01065$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1016/j.ecoinf.2023.101990$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Rademan, Marco W</creatorcontrib><creatorcontrib>Versfeld, Daniel J</creatorcontrib><creatorcontrib>du Preez, Johan A</creatorcontrib><title>Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform</title><title>arXiv.org</title><description>Underwater acoustic monitoring systems record many hours of audio data for marine research, making fast and reliable non-causal signal detection paramount. Such detectors assist in reducing the amount of labor required for signal annotations, which often contain large portions devoid of signals. Cetacean vocalization detection based on spectral entropy is investigated as a means of vocalization discovery. Previous techniques using spectral entropy (SE) mostly consider time-frequency enhancement of the entropy measure, and utilize the STFT as its time-frequency (TF) decomposition. SE methods also requires the user to set a detection threshold manually, which call for knowledge of the produced entropy measures. This paper considers median filtering as a simple, effective way to provide temporal stabilization to the entropy measure, and considers the CWT as an alternative TF decomposition. K-means clustering is used to determine the threshold required to accurately separate the signal/no-signal entropy measures, resulting in a one-dimensional, two-class classification problem. The class means are used to perform pseudo-probabilistic soft class assignment, which is a useful metric in algorithmic development. The effect of median filtering, signal-to-noise ratio and the chosen TF decomposition are investigated. The proposed method shows a significant improvement in detection accuracy and specificity, while also providing a more interpretable detection threshold setting via soft class assignment.</description><subject>Annotations</subject><subject>Audio data</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Continuous wavelet transform</subject><subject>Decomposition</subject><subject>Entropy</subject><subject>Filtration</subject><subject>Signal detection</subject><subject>Signal to noise ratio</subject><subject>Statistics - Applications</subject><subject>Time-frequency analysis</subject><subject>Underwater acoustics</subject><subject>Vector quantization</subject><subject>Wavelet transforms</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNot0MtOAjEUgOHGxESCPIArm7h1sHc6SzPiJWJYgLqc1E4Hhwzt2HaI-AK-tgVcdXG-c5r8AFxgNGaSc3Sj_HezHROC8RhhJPgJGBBKcSYZIWdgFMIaIUTEhHBOB-B34eqYzfvY9REumpVVLbwz0ejYOAtr52FhotJGWfjmtGqbH7WfBPgaGruCiy5Jn3amNnrX7a7hc_aScIBF24do_B4pW8H4aWDhbGxs7_oA39XWtCbCpU82_bI5B6e1aoMZ_b9DsLyfLovHbDZ_eCpuZ5nKOc-IJEaIigmtcyYRJ4JSUkkjRaV0LSTKkdS6IhVLLQRSBjOjktPsQzDJcjoEl8ezh0pl55uN8rtyX6s81Eri6ig67756E2K5dr1PWUJJJhQLjCY5p3835W5S</recordid><startdate>20221122</startdate><enddate>20221122</enddate><creator>Rademan, Marco W</creator><creator>Versfeld, Daniel J</creator><creator>du Preez, Johan A</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20221122</creationdate><title>Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform</title><author>Rademan, Marco W ; Versfeld, Daniel J ; du Preez, Johan A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a955-282e66d46cc9480526332d8e86dacf680908ccd2d448560ae14ea805c4b648493</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Annotations</topic><topic>Audio data</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Continuous wavelet transform</topic><topic>Decomposition</topic><topic>Entropy</topic><topic>Filtration</topic><topic>Signal detection</topic><topic>Signal to noise ratio</topic><topic>Statistics - Applications</topic><topic>Time-frequency analysis</topic><topic>Underwater acoustics</topic><topic>Vector quantization</topic><topic>Wavelet transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Rademan, Marco W</creatorcontrib><creatorcontrib>Versfeld, Daniel J</creatorcontrib><creatorcontrib>du Preez, Johan A</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection (Proquest) (PQ_SDU_P3)</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rademan, Marco W</au><au>Versfeld, Daniel J</au><au>du Preez, Johan A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform</atitle><jtitle>arXiv.org</jtitle><date>2022-11-22</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>Underwater acoustic monitoring systems record many hours of audio data for marine research, making fast and reliable non-causal signal detection paramount. Such detectors assist in reducing the amount of labor required for signal annotations, which often contain large portions devoid of signals. Cetacean vocalization detection based on spectral entropy is investigated as a means of vocalization discovery. Previous techniques using spectral entropy (SE) mostly consider time-frequency enhancement of the entropy measure, and utilize the STFT as its time-frequency (TF) decomposition. SE methods also requires the user to set a detection threshold manually, which call for knowledge of the produced entropy measures. This paper considers median filtering as a simple, effective way to provide temporal stabilization to the entropy measure, and considers the CWT as an alternative TF decomposition. K-means clustering is used to determine the threshold required to accurately separate the signal/no-signal entropy measures, resulting in a one-dimensional, two-class classification problem. The class means are used to perform pseudo-probabilistic soft class assignment, which is a useful metric in algorithmic development. The effect of median filtering, signal-to-noise ratio and the chosen TF decomposition are investigated. The proposed method shows a significant improvement in detection accuracy and specificity, while also providing a more interpretable detection threshold setting via soft class assignment.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2211.01065</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2022-11
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2211_01065
source arXiv.org; Free E- Journals
subjects Annotations
Audio data
Cluster analysis
Clustering
Continuous wavelet transform
Decomposition
Entropy
Filtration
Signal detection
Signal to noise ratio
Statistics - Applications
Time-frequency analysis
Underwater acoustics
Vector quantization
Wavelet transforms
title Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T22%3A40%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Soft-Output%20Signal%20Detection%20for%20Cetacean%20Vocalizations%20Using%20Spectral%20Entropy,%20K-Means%20Clustering%20and%20the%20Continuous%20Wavelet%20Transform&rft.jtitle=arXiv.org&rft.au=Rademan,%20Marco%20W&rft.date=2022-11-22&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2211.01065&rft_dat=%3Cproquest_arxiv%3E2731610795%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2731610795&rft_id=info:pmid/&rfr_iscdi=true