Subjective and Objective Quality Assessment of Audio Source Separation

We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on audio, speech, and language processing speech, and language processing, 2011-09, Vol.19 (7), p.2046-2057
Hauptverfasser: Emiya, V., Vincent, E., Harlander, N., Hohmann, V.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2057
container_issue 7
container_start_page 2046
container_title IEEE transactions on audio, speech, and language processing
container_volume 19
creator Emiya, V.
Vincent, E.
Harlander, N.
Hohmann, V.
description We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely available toolkit called PEASS.
doi_str_mv 10.1109/TASL.2011.2109381
format Article
fullrecord <record><control><sourceid>hal_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TASL_2011_2109381</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5704564</ieee_id><sourcerecordid>oai_HAL_inria_00567152v1</sourcerecordid><originalsourceid>FETCH-LOGICAL-c440t-8a3dbe979b3a836244b8148d501a3db6ddbdb766d76d800c560f7965ff1c90e63</originalsourceid><addsrcrecordid>eNo9kE9Lw0AQxRdRsFY_gHjJxZO07iT7L8dQrBUCRVLPy2Z3g1vSpOwmBb-9CSk5zTzee8PwQ-gZ8BoAp--HrMjXMQZYx4NMBNygBVAqVjyNye28A7tHDyEcMSYJI7BA26Ivj1Z37mIj1ZhoP6vvXtWu-4uyEGwIJ9t0UVtFWW9cGxVt77WNCntWXnWubR7RXaXqYJ-uc4l-th-HzW6V7z-_Nlm-0oTgbiVUYkqb8rRMlEhYTEgpgAhDMYwOM6Y0JWfMcGYExpoyXPGU0aoCnWLLkiV6m-7-qlqevTsp_ydb5eQuy6VrvFMSY8o40PgCQxqmtPZtCN5WcwWwHLHJEZscsckrtqHzOnXOKmhVV1412oW5OLwsiGB8yL1MOWetnW3KMaGMJP9QKXVZ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Subjective and Objective Quality Assessment of Audio Source Separation</title><source>IEEE Electronic Library (IEL)</source><creator>Emiya, V. ; Vincent, E. ; Harlander, N. ; Hohmann, V.</creator><creatorcontrib>Emiya, V. ; Vincent, E. ; Harlander, N. ; Hohmann, V.</creatorcontrib><description>We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely available toolkit called PEASS.</description><identifier>ISSN: 1558-7916</identifier><identifier>EISSN: 1558-7924</identifier><identifier>DOI: 10.1109/TASL.2011.2109381</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Audio ; Computer Science ; Detection, estimation, filtering, equalization, prediction ; Distortion measurement ; Engineering Sciences ; Exact sciences and technology ; Information, signal and communications theory ; Noise ; Nonlinear distortion ; objective measure ; Protocols ; quality assessment ; Signal and communications theory ; Signal and Image Processing ; Signal, noise ; Source separation ; Speech ; subjective test protocol ; Telecommunications and information theory</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2011-09, Vol.19 (7), p.2046-2057</ispartof><rights>2015 INIST-CNRS</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c440t-8a3dbe979b3a836244b8148d501a3db6ddbdb766d76d800c560f7965ff1c90e63</citedby><cites>FETCH-LOGICAL-c440t-8a3dbe979b3a836244b8148d501a3db6ddbdb766d76d800c560f7965ff1c90e63</cites><orcidid>0000-0001-7102-6943 ; 0000-0002-0183-7289</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5704564$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>230,314,776,780,792,881,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5704564$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=24484867$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://inria.hal.science/inria-00567152$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Emiya, V.</creatorcontrib><creatorcontrib>Vincent, E.</creatorcontrib><creatorcontrib>Harlander, N.</creatorcontrib><creatorcontrib>Hohmann, V.</creatorcontrib><title>Subjective and Objective Quality Assessment of Audio Source Separation</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely available toolkit called PEASS.</description><subject>Applied sciences</subject><subject>Audio</subject><subject>Computer Science</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Distortion measurement</subject><subject>Engineering Sciences</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Noise</subject><subject>Nonlinear distortion</subject><subject>objective measure</subject><subject>Protocols</subject><subject>quality assessment</subject><subject>Signal and communications theory</subject><subject>Signal and Image Processing</subject><subject>Signal, noise</subject><subject>Source separation</subject><subject>Speech</subject><subject>subjective test protocol</subject><subject>Telecommunications and information theory</subject><issn>1558-7916</issn><issn>1558-7924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE9Lw0AQxRdRsFY_gHjJxZO07iT7L8dQrBUCRVLPy2Z3g1vSpOwmBb-9CSk5zTzee8PwQ-gZ8BoAp--HrMjXMQZYx4NMBNygBVAqVjyNye28A7tHDyEcMSYJI7BA26Ivj1Z37mIj1ZhoP6vvXtWu-4uyEGwIJ9t0UVtFWW9cGxVt77WNCntWXnWubR7RXaXqYJ-uc4l-th-HzW6V7z-_Nlm-0oTgbiVUYkqb8rRMlEhYTEgpgAhDMYwOM6Y0JWfMcGYExpoyXPGU0aoCnWLLkiV6m-7-qlqevTsp_ydb5eQuy6VrvFMSY8o40PgCQxqmtPZtCN5WcwWwHLHJEZscsckrtqHzOnXOKmhVV1412oW5OLwsiGB8yL1MOWetnW3KMaGMJP9QKXVZ</recordid><startdate>20110901</startdate><enddate>20110901</enddate><creator>Emiya, V.</creator><creator>Vincent, E.</creator><creator>Harlander, N.</creator><creator>Hohmann, V.</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0001-7102-6943</orcidid><orcidid>https://orcid.org/0000-0002-0183-7289</orcidid></search><sort><creationdate>20110901</creationdate><title>Subjective and Objective Quality Assessment of Audio Source Separation</title><author>Emiya, V. ; Vincent, E. ; Harlander, N. ; Hohmann, V.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c440t-8a3dbe979b3a836244b8148d501a3db6ddbdb766d76d800c560f7965ff1c90e63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Applied sciences</topic><topic>Audio</topic><topic>Computer Science</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Distortion measurement</topic><topic>Engineering Sciences</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Noise</topic><topic>Nonlinear distortion</topic><topic>objective measure</topic><topic>Protocols</topic><topic>quality assessment</topic><topic>Signal and communications theory</topic><topic>Signal and Image Processing</topic><topic>Signal, noise</topic><topic>Source separation</topic><topic>Speech</topic><topic>subjective test protocol</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Emiya, V.</creatorcontrib><creatorcontrib>Vincent, E.</creatorcontrib><creatorcontrib>Harlander, N.</creatorcontrib><creatorcontrib>Hohmann, V.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Emiya, V.</au><au>Vincent, E.</au><au>Harlander, N.</au><au>Hohmann, V.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Subjective and Objective Quality Assessment of Audio Source Separation</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2011-09-01</date><risdate>2011</risdate><volume>19</volume><issue>7</issue><spage>2046</spage><epage>2057</epage><pages>2046-2057</pages><issn>1558-7916</issn><eissn>1558-7924</eissn><coden>ITASD8</coden><abstract>We aim to assess the perceived quality of estimated source signals in the context of audio source separation. These signals may involve one or more kinds of distortions, including distortion of the target source, interference from the other sources or musical noise artifacts. We propose a subjective test protocol to assess the perceived quality with respect to each kind of distortion and collect the scores of 20 subjects over 80 sounds. We then propose a family of objective measures aiming to predict these subjective scores based on the decomposition of the estimation error into several distortion components and on the use of the PEMO-Q perceptual salience measure to provide multiple features that are then combined. These measures increase correlation with subjective scores up to 0.5 compared to nonlinear mapping of individual state-of-the-art source separation measures. Finally, we released the data and code presented in this paper in a freely available toolkit called PEASS.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2011.2109381</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-7102-6943</orcidid><orcidid>https://orcid.org/0000-0002-0183-7289</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1558-7916
ispartof IEEE transactions on audio, speech, and language processing, 2011-09, Vol.19 (7), p.2046-2057
issn 1558-7916
1558-7924
language eng
recordid cdi_crossref_primary_10_1109_TASL_2011_2109381
source IEEE Electronic Library (IEL)
subjects Applied sciences
Audio
Computer Science
Detection, estimation, filtering, equalization, prediction
Distortion measurement
Engineering Sciences
Exact sciences and technology
Information, signal and communications theory
Noise
Nonlinear distortion
objective measure
Protocols
quality assessment
Signal and communications theory
Signal and Image Processing
Signal, noise
Source separation
Speech
subjective test protocol
Telecommunications and information theory
title Subjective and Objective Quality Assessment of Audio Source Separation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T12%3A25%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-hal_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Subjective%20and%20Objective%20Quality%20Assessment%20of%20Audio%20Source%20Separation&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Emiya,%20V.&rft.date=2011-09-01&rft.volume=19&rft.issue=7&rft.spage=2046&rft.epage=2057&rft.pages=2046-2057&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2011.2109381&rft_dat=%3Chal_RIE%3Eoai_HAL_inria_00567152v1%3C/hal_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5704564&rfr_iscdi=true