Sound Event Detection: A Journey Through DCASE Challenge Series

The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory sys...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:APSIPA transactions on signal and information processing 2024-01, Vol.13 (1)
Hauptverfasser: Khandelwal, Tanmay, Das, Rohan Kumar, Chng, Eng Siong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title APSIPA transactions on signal and information processing
container_volume 13
creator Khandelwal, Tanmay
Das, Rohan Kumar
Chng, Eng Siong
description The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.
doi_str_mv 10.1561/116.00000051
format Article
fullrecord <record><control><sourceid>now_doaj_</sourceid><recordid>TN_cdi_now_journals_10_1561_116_00000051</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_81d50f1bd4874020a5369f2459d9adb3</doaj_id><sourcerecordid>SIP-2023-0051</sourcerecordid><originalsourceid>FETCH-LOGICAL-c329t-70edb371154124d4cb61d11df8f24512eb55a267bd3137946d9035392fad99c33</originalsourceid><addsrcrecordid>eNptkD1PwzAQhi0EEhV04wd4ZCDF58-YBVVpgaJKDC2z5cROmyrEyElB_fekLSAGvJx1evTc3YvQFZARCAm3AHJEDk_ACRpQwtNEKcJO__zP0bBtNz0CQIWWfIDuF2HbODz98E2HJ77zRVeF5g6P8XPYxsbv8HIdw3a1xpNsvJjibG3r2jcrjxc-Vr69RGelrVs__K4X6PVhusyekvnL4ywbz5OCUd0liniXMwUgOFDueJFLcACuTEvKBVCfC2GpVLljwJTm0mnCBNO0tE7rgrELNDt6XbAb8x6rNxt3JtjKHBohroyNXVXU3qTgBCkhdzxVnFBiBZN6P0Y7bfstetfN0VXE0LbRl78-IGafpemzND9Z9vj1EW_Cp9nsU-kP_R_9AluRbkU</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><source>Cambridge Journals Open Access</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</creator><creatorcontrib>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</creatorcontrib><description>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</description><identifier>ISSN: 2048-7703</identifier><identifier>EISSN: 2048-7703</identifier><identifier>DOI: 10.1561/116.00000051</identifier><language>eng</language><publisher>Boston — Delft: Now Publishers</publisher><subject>Business, Economics and Politics ; Economics</subject><ispartof>APSIPA transactions on signal and information processing, 2024-01, Vol.13 (1)</ispartof><rights>2024 T. Khandelwal, R. K. Das and E. S. Chng</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,861,2096,27905,27906</link.rule.ids></links><search><creatorcontrib>Khandelwal, Tanmay</creatorcontrib><creatorcontrib>Das, Rohan Kumar</creatorcontrib><creatorcontrib>Chng, Eng Siong</creatorcontrib><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><title>APSIPA transactions on signal and information processing</title><addtitle>SIP</addtitle><description>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</description><subject>Business, Economics and Politics</subject><subject>Economics</subject><issn>2048-7703</issn><issn>2048-7703</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>NOJ</sourceid><sourceid>DOA</sourceid><recordid>eNptkD1PwzAQhi0EEhV04wd4ZCDF58-YBVVpgaJKDC2z5cROmyrEyElB_fekLSAGvJx1evTc3YvQFZARCAm3AHJEDk_ACRpQwtNEKcJO__zP0bBtNz0CQIWWfIDuF2HbODz98E2HJ77zRVeF5g6P8XPYxsbv8HIdw3a1xpNsvJjibG3r2jcrjxc-Vr69RGelrVs__K4X6PVhusyekvnL4ywbz5OCUd0liniXMwUgOFDueJFLcACuTEvKBVCfC2GpVLljwJTm0mnCBNO0tE7rgrELNDt6XbAb8x6rNxt3JtjKHBohroyNXVXU3qTgBCkhdzxVnFBiBZN6P0Y7bfstetfN0VXE0LbRl78-IGafpemzND9Z9vj1EW_Cp9nsU-kP_R_9AluRbkU</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Khandelwal, Tanmay</creator><creator>Das, Rohan Kumar</creator><creator>Chng, Eng Siong</creator><general>Now Publishers</general><scope>NOJ</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope></search><sort><creationdate>20240101</creationdate><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><author>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c329t-70edb371154124d4cb61d11df8f24512eb55a267bd3137946d9035392fad99c33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Business, Economics and Politics</topic><topic>Economics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khandelwal, Tanmay</creatorcontrib><creatorcontrib>Das, Rohan Kumar</creatorcontrib><creatorcontrib>Chng, Eng Siong</creatorcontrib><collection>Now Publishers Journals</collection><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>APSIPA transactions on signal and information processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khandelwal, Tanmay</au><au>Das, Rohan Kumar</au><au>Chng, Eng Siong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sound Event Detection: A Journey Through DCASE Challenge Series</atitle><jtitle>APSIPA transactions on signal and information processing</jtitle><addtitle>SIP</addtitle><date>2024-01-01</date><risdate>2024</risdate><volume>13</volume><issue>1</issue><issn>2048-7703</issn><eissn>2048-7703</eissn><abstract>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</abstract><cop>Boston — Delft</cop><pub>Now Publishers</pub><doi>10.1561/116.00000051</doi><tpages>63</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2048-7703
ispartof APSIPA transactions on signal and information processing, 2024-01, Vol.13 (1)
issn 2048-7703
2048-7703
language eng
recordid cdi_now_journals_10_1561_116_00000051
source Cambridge Journals Open Access; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Business, Economics and Politics
Economics
title Sound Event Detection: A Journey Through DCASE Challenge Series
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T16%3A53%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-now_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sound%20Event%20Detection:%20A%20Journey%20Through%20DCASE%20Challenge%20Series&rft.jtitle=APSIPA%20transactions%20on%20signal%20and%20information%20processing&rft.au=Khandelwal,%20Tanmay&rft.date=2024-01-01&rft.volume=13&rft.issue=1&rft.issn=2048-7703&rft.eissn=2048-7703&rft_id=info:doi/10.1561/116.00000051&rft_dat=%3Cnow_doaj_%3ESIP-2023-0051%3C/now_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_doaj_id=oai_doaj_org_article_81d50f1bd4874020a5369f2459d9adb3&rfr_iscdi=true