Sound Event Detection: A Journey Through DCASE Challenge Series
The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory sys...
Gespeichert in:
Veröffentlicht in: | APSIPA transactions on signal and information processing 2024-01, Vol.13 (1) |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | |
container_title | APSIPA transactions on signal and information processing |
container_volume | 13 |
creator | Khandelwal, Tanmay Das, Rohan Kumar Chng, Eng Siong |
description | The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions. |
doi_str_mv | 10.1561/116.00000051 |
format | Article |
fullrecord | <record><control><sourceid>now_doaj_</sourceid><recordid>TN_cdi_now_journals_10_1561_116_00000051</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><doaj_id>oai_doaj_org_article_81d50f1bd4874020a5369f2459d9adb3</doaj_id><sourcerecordid>SIP-2023-0051</sourcerecordid><originalsourceid>FETCH-LOGICAL-c329t-70edb371154124d4cb61d11df8f24512eb55a267bd3137946d9035392fad99c33</originalsourceid><addsrcrecordid>eNptkD1PwzAQhi0EEhV04wd4ZCDF58-YBVVpgaJKDC2z5cROmyrEyElB_fekLSAGvJx1evTc3YvQFZARCAm3AHJEDk_ACRpQwtNEKcJO__zP0bBtNz0CQIWWfIDuF2HbODz98E2HJ77zRVeF5g6P8XPYxsbv8HIdw3a1xpNsvJjibG3r2jcrjxc-Vr69RGelrVs__K4X6PVhusyekvnL4ywbz5OCUd0liniXMwUgOFDueJFLcACuTEvKBVCfC2GpVLljwJTm0mnCBNO0tE7rgrELNDt6XbAb8x6rNxt3JtjKHBohroyNXVXU3qTgBCkhdzxVnFBiBZN6P0Y7bfstetfN0VXE0LbRl78-IGafpemzND9Z9vj1EW_Cp9nsU-kP_R_9AluRbkU</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><source>Cambridge Journals Open Access</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</creator><creatorcontrib>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</creatorcontrib><description>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</description><identifier>ISSN: 2048-7703</identifier><identifier>EISSN: 2048-7703</identifier><identifier>DOI: 10.1561/116.00000051</identifier><language>eng</language><publisher>Boston — Delft: Now Publishers</publisher><subject>Business, Economics and Politics ; Economics</subject><ispartof>APSIPA transactions on signal and information processing, 2024-01, Vol.13 (1)</ispartof><rights>2024 T. Khandelwal, R. K. Das and E. S. Chng</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,861,2096,27905,27906</link.rule.ids></links><search><creatorcontrib>Khandelwal, Tanmay</creatorcontrib><creatorcontrib>Das, Rohan Kumar</creatorcontrib><creatorcontrib>Chng, Eng Siong</creatorcontrib><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><title>APSIPA transactions on signal and information processing</title><addtitle>SIP</addtitle><description>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</description><subject>Business, Economics and Politics</subject><subject>Economics</subject><issn>2048-7703</issn><issn>2048-7703</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>NOJ</sourceid><sourceid>DOA</sourceid><recordid>eNptkD1PwzAQhi0EEhV04wd4ZCDF58-YBVVpgaJKDC2z5cROmyrEyElB_fekLSAGvJx1evTc3YvQFZARCAm3AHJEDk_ACRpQwtNEKcJO__zP0bBtNz0CQIWWfIDuF2HbODz98E2HJ77zRVeF5g6P8XPYxsbv8HIdw3a1xpNsvJjibG3r2jcrjxc-Vr69RGelrVs__K4X6PVhusyekvnL4ywbz5OCUd0liniXMwUgOFDueJFLcACuTEvKBVCfC2GpVLljwJTm0mnCBNO0tE7rgrELNDt6XbAb8x6rNxt3JtjKHBohroyNXVXU3qTgBCkhdzxVnFBiBZN6P0Y7bfstetfN0VXE0LbRl78-IGafpemzND9Z9vj1EW_Cp9nsU-kP_R_9AluRbkU</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Khandelwal, Tanmay</creator><creator>Das, Rohan Kumar</creator><creator>Chng, Eng Siong</creator><general>Now Publishers</general><scope>NOJ</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>DOA</scope></search><sort><creationdate>20240101</creationdate><title>Sound Event Detection: A Journey Through DCASE Challenge Series</title><author>Khandelwal, Tanmay ; Das, Rohan Kumar ; Chng, Eng Siong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c329t-70edb371154124d4cb61d11df8f24512eb55a267bd3137946d9035392fad99c33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Business, Economics and Politics</topic><topic>Economics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khandelwal, Tanmay</creatorcontrib><creatorcontrib>Das, Rohan Kumar</creatorcontrib><creatorcontrib>Chng, Eng Siong</creatorcontrib><collection>Now Publishers Journals</collection><collection>CrossRef</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>APSIPA transactions on signal and information processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Khandelwal, Tanmay</au><au>Das, Rohan Kumar</au><au>Chng, Eng Siong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Sound Event Detection: A Journey Through DCASE Challenge Series</atitle><jtitle>APSIPA transactions on signal and information processing</jtitle><addtitle>SIP</addtitle><date>2024-01-01</date><risdate>2024</risdate><volume>13</volume><issue>1</issue><issn>2048-7703</issn><eissn>2048-7703</eissn><abstract>The sense of hearing is fundamental to human beings, as it allows them to perceive their surroundings. However, this simple task of recognizing different sounds in complex environments poses a challenge for machines. Sound event detection (SED) is a field that aims to automate the human auditory system’s detection and recognition of sound events with their onset and offset points. Training an SED system typically requires a large labeled set, but is associated with high annotation costs and is dependent on the subjective judgments of annotators. Therefore, significant efforts have been made in this area, including the major DCASE challenge series, which brings researchers together annually to address this issue. The DCASE challenge was started in the year 2013, and it has evolved over the years to witness some significant breakthroughs in the field of SED. In this study, we delve into the methods proposed by various authors in the DCASE challenge series, providing a thorough discussion of feature extraction, machine learning techniques, and post-processing methods. We also study the results from top teams in each edition of the DCASE challenge to bring out the highlights of the best-performing SED systems and explore potential future research directions.</abstract><cop>Boston — Delft</cop><pub>Now Publishers</pub><doi>10.1561/116.00000051</doi><tpages>63</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2048-7703 |
ispartof | APSIPA transactions on signal and information processing, 2024-01, Vol.13 (1) |
issn | 2048-7703 2048-7703 |
language | eng |
recordid | cdi_now_journals_10_1561_116_00000051 |
source | Cambridge Journals Open Access; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals |
subjects | Business, Economics and Politics Economics |
title | Sound Event Detection: A Journey Through DCASE Challenge Series |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T16%3A53%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-now_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Sound%20Event%20Detection:%20A%20Journey%20Through%20DCASE%20Challenge%20Series&rft.jtitle=APSIPA%20transactions%20on%20signal%20and%20information%20processing&rft.au=Khandelwal,%20Tanmay&rft.date=2024-01-01&rft.volume=13&rft.issue=1&rft.issn=2048-7703&rft.eissn=2048-7703&rft_id=info:doi/10.1561/116.00000051&rft_dat=%3Cnow_doaj_%3ESIP-2023-0051%3C/now_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_doaj_id=oai_doaj_org_article_81d50f1bd4874020a5369f2459d9adb3&rfr_iscdi=true |