Sound Event Detection: A Journey Through DCASE Challenge Series

The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory sys...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	APSIPA transactions on signal and information processing 2024-01, Vol.13 (1)
Hauptverfasser:	Khandelwal, Tanmay, Das, Rohan Kumar, Chng, Eng Siong
Format:	Artikel
Sprache:	eng
Schlagworte:	Business, Economics and Politics Economics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	1
container_start_page
container_title	APSIPA transactions on signal and information processing
container_volume	13
creator	Khandelwal, Tanmay Das, Rohan Kumar Chng, Eng Siong
description	The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.
doi_str_mv	10.1561/116.00000051
format	Article
fullrecord	<record><control><sourceid>now_doaj_</sourceid><recordid>TN_cdi_now_journals_10_1561_116_00000051</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_81d50f1bd4874020a5369f2459d9adb3</doaj_id><sourcerecordid>SIP-2023-0051</sourcerecordid><originalsourceid>FETCH-LOGICAL-c329t-70edb371154124d4cb61d11df8f24512eb55a267bd3137946d9035392fad99c33</originalsourceid><addsrcrecordid>eNptkD1PwzAQhi0EEhV04wd4ZCDF58-YBVVpgaJKDC2z5cROmyrEyElB_fekLSAGvJx1evTc3YvQFZARCAm3AHJEDk_ACRpQwtNEKcJO__zP0bBtNz0CQIWWfIDuF2HbODz98E2HJ77zRVeF5g6P8XPYxsbv8HIdw3a1xpNsvJjibG3r2jcrjxc-Vr69RGelrVs__K4X6PVhusyekvnL4ywbz5OCUd0liniXMwUgOFDueJFLcACuTEvKBVCfC2GpVLljwJTm0mnCBNO0tE7rgrELNDt6XbAb8x6rNxt3JtjKHBohroyNXVXU3qTgBCkhdzxVnFBiBZN6P0Y7bfstetfN0VXE0LbRl78-IGafpemzND9Z9vj1EW_Cp9nsU-kP_R_9AluRbkU</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><source>Cambridge Journals Open Access</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</creator><creatorcontrib>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</creatorcontrib><description>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</description><identifier>ISSN: 2048-7703</identifier><identifier>EISSN: 2048-7703</identifier><identifier>DOI: 10.1561/116.00000051</identifier><language>eng</language><publisher>Boston — Delft: Now Publishers</publisher><subject>Business, Economics and Politics ; Economics</subject><ispartof>APSIPA transactions on signal and information processing, 2024-01, Vol.13 (1)</ispartof><rights>2024 T. Khandelwal, R. K. Das and E. S. Chng</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,861,2096,27905,27906</link.rule.ids></links><search><creatorcontrib>Khandelwal, Tanmay</creatorcontrib><creatorcontrib>Das, Rohan Kumar</creatorcontrib><creatorcontrib>Chng, Eng Siong</creatorcontrib><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><title>APSIPA transactions on signal and information processing</title><addtitle>SIP</addtitle><description>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</description><subject>Business, Economics and Politics</subject><subject>Economics</subject><issn>2048-7703</issn><issn>2048-7703</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>NOJ</sourceid><sourceid>DOA</sourceid><recordid>eNptkD1PwzAQhi0EEhV04wd4ZCDF58-YBVVpgaJKDC2z5cROmyrEyElB_fekLSAGvJx1evTc3YvQFZARCAm3AHJEDk_ACRpQwtNEKcJO__zP0bBtNz0CQIWWfIDuF2HbODz98E2HJ77zRVeF5g6P8XPYxsbv8HIdw3a1xpNsvJjibG3r2jcrjxc-Vr69RGelrVs__K4X6PVhusyekvnL4ywbz5OCUd0liniXMwUgOFDueJFLcACuTEvKBVCfC2GpVLljwJTm0mnCBNO0tE7rgrELNDt6XbAb8x6rNxt3JtjKHBohroyNXVXU3qTgBCkhdzxVnFBiBZN6P0Y7bfstetfN0VXE0LbRl78-IGafpemzND9Z9vj1EW_Cp9nsU-kP_R_9AluRbkU</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Khandelwal, Tanmay</creator><creator>Das, Rohan Kumar</creator><creator>Chng, Eng Siong</creator><general>Now Publishers</general><scope>NOJ</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope></search><sort><creationdate>20240101</creationdate><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><author>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c329t-70edb371154124d4cb61d11df8f24512eb55a267bd3137946d9035392fad99c33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Business, Economics and Politics</topic><topic>Economics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khandelwal, Tanmay</creatorcontrib><creatorcontrib>Das, Rohan Kumar</creatorcontrib><creatorcontrib>Chng, Eng Siong</creatorcontrib><collection>Now Publishers Journals</collection><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>APSIPA transactions on signal and information processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khandelwal, Tanmay</au><au>Das, Rohan Kumar</au><au>Chng, Eng Siong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sound Event Detection: A Journey Through DCASE Challenge Series</atitle><jtitle>APSIPA transactions on signal and information processing</jtitle><addtitle>SIP</addtitle><date>2024-01-01</date><risdate>2024</risdate><volume>13</volume><issue>1</issue><issn>2048-7703</issn><eissn>2048-7703</eissn><abstract>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</abstract><cop>Boston — Delft</cop><pub>Now Publishers</pub><doi>10.1561/116.00000051</doi><tpages>63</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2048-7703
ispartof	APSIPA transactions on signal and information processing, 2024-01, Vol.13 (1)
issn	2048-7703 2048-7703
language	eng
recordid	cdi_now_journals_10_1561_116_00000051
source	Cambridge Journals Open Access; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects	Business, Economics and Politics Economics
title	Sound Event Detection: A Journey Through DCASE Challenge Series
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T16%3A53%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-now_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sound%20Event%20Detection:%20A%20Journey%20Through%20DCASE%20Challenge%20Series&rft.jtitle=APSIPA%20transactions%20on%20signal%20and%20information%20processing&rft.au=Khandelwal,%20Tanmay&rft.date=2024-01-01&rft.volume=13&rft.issue=1&rft.issn=2048-7703&rft.eissn=2048-7703&rft_id=info:doi/10.1561/116.00000051&rft_dat=%3Cnow_doaj_%3ESIP-2023-0051%3C/now_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_doaj_id=oai_doaj_org_article_81d50f1bd4874020a5369f2459d9adb3&rfr_iscdi=true