Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression
Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challeng...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on audio, speech, and language processing speech, and language processing, 2012-05, Vol.20 (4), p.1217-1232 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1232 |
---|---|
container_issue | 4 |
container_start_page | 1217 |
container_title | IEEE transactions on audio, speech, and language processing |
container_volume | 20 |
creator | Narwaria, M. Weisi Lin McLoughlin, I. V. Emmanuel, S. Liang-Tien Chia |
description | Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach. |
doi_str_mv | 10.1109/TASL.2011.2174223 |
format | Article |
fullrecord | <record><control><sourceid>pascalfrancis_RIE</sourceid><recordid>TN_cdi_ieee_primary_6064875</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6064875</ieee_id><sourcerecordid>26346864</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</originalsourceid><addsrcrecordid>eNo9kE1PwzAMhiMEEmPwAxCXXDh25KtpepymDZDGEGzAsQqpswV1bZVkSPv3tGzayZb9Ppb1IHRLyYhSkj-sxsv5iBFKR4xmgjF-hgY0TVWS5Uycn3oqL9FVCD-ECC4FHaCwaGpXR78L7hfw205XLu7xOAQIYQt1xI3Fi8YFwMtd2_puCiVetgBmg79c3OAXqJKZqyL4bjGtwa8dBKzr8h9ofMSfYGLj8Tuse9w19TW6sLoKcHOsQ_Qxm64mT8n89fF5Mp4nhuVpTLQkkgj1zXOhGJREpRoyBlnJhaa5EsIoznRJCbGgqFFW5JRJYnPWZUuQfIjo4a7xTQgebNF6t9V-X1BS9NaK3lrRWyuO1jrm_sC0OhhdWa9r48IJZJILqaTocneHnAOA07r7WKgs5X8xOncw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</title><source>IEEE Electronic Library (IEL)</source><creator>Narwaria, M. ; Weisi Lin ; McLoughlin, I. V. ; Emmanuel, S. ; Liang-Tien Chia</creator><creatorcontrib>Narwaria, M. ; Weisi Lin ; McLoughlin, I. V. ; Emmanuel, S. ; Liang-Tien Chia</creatorcontrib><description>Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.</description><identifier>ISSN: 1558-7916</identifier><identifier>EISSN: 1558-7924</identifier><identifier>DOI: 10.1109/TASL.2011.2174223</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Detection, estimation, filtering, equalization, prediction ; Exact sciences and technology ; Feature extraction ; Information, signal and communications theory ; Mel filter bank energies ; Noise ; Noise measurement ; Quality assessment ; Signal and communications theory ; Signal processing ; Signal representation. Spectral analysis ; Signal, noise ; Speech ; Speech processing ; speech quality assessment ; support vector regression ; Telecommunications and information theory</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2012-05, Vol.20 (4), p.1217-1232</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</citedby><cites>FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6064875$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6064875$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=26346864$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Narwaria, M.</creatorcontrib><creatorcontrib>Weisi Lin</creatorcontrib><creatorcontrib>McLoughlin, I. V.</creatorcontrib><creatorcontrib>Emmanuel, S.</creatorcontrib><creatorcontrib>Liang-Tien Chia</creatorcontrib><title>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.</description><subject>Applied sciences</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Exact sciences and technology</subject><subject>Feature extraction</subject><subject>Information, signal and communications theory</subject><subject>Mel filter bank energies</subject><subject>Noise</subject><subject>Noise measurement</subject><subject>Quality assessment</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal representation. Spectral analysis</subject><subject>Signal, noise</subject><subject>Speech</subject><subject>Speech processing</subject><subject>speech quality assessment</subject><subject>support vector regression</subject><subject>Telecommunications and information theory</subject><issn>1558-7916</issn><issn>1558-7924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1PwzAMhiMEEmPwAxCXXDh25KtpepymDZDGEGzAsQqpswV1bZVkSPv3tGzayZb9Ppb1IHRLyYhSkj-sxsv5iBFKR4xmgjF-hgY0TVWS5Uycn3oqL9FVCD-ECC4FHaCwaGpXR78L7hfw205XLu7xOAQIYQt1xI3Fi8YFwMtd2_puCiVetgBmg79c3OAXqJKZqyL4bjGtwa8dBKzr8h9ofMSfYGLj8Tuse9w19TW6sLoKcHOsQ_Qxm64mT8n89fF5Mp4nhuVpTLQkkgj1zXOhGJREpRoyBlnJhaa5EsIoznRJCbGgqFFW5JRJYnPWZUuQfIjo4a7xTQgebNF6t9V-X1BS9NaK3lrRWyuO1jrm_sC0OhhdWa9r48IJZJILqaTocneHnAOA07r7WKgs5X8xOncw</recordid><startdate>20120501</startdate><enddate>20120501</enddate><creator>Narwaria, M.</creator><creator>Weisi Lin</creator><creator>McLoughlin, I. V.</creator><creator>Emmanuel, S.</creator><creator>Liang-Tien Chia</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20120501</creationdate><title>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</title><author>Narwaria, M. ; Weisi Lin ; McLoughlin, I. V. ; Emmanuel, S. ; Liang-Tien Chia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Exact sciences and technology</topic><topic>Feature extraction</topic><topic>Information, signal and communications theory</topic><topic>Mel filter bank energies</topic><topic>Noise</topic><topic>Noise measurement</topic><topic>Quality assessment</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal representation. Spectral analysis</topic><topic>Signal, noise</topic><topic>Speech</topic><topic>Speech processing</topic><topic>speech quality assessment</topic><topic>support vector regression</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Narwaria, M.</creatorcontrib><creatorcontrib>Weisi Lin</creatorcontrib><creatorcontrib>McLoughlin, I. V.</creatorcontrib><creatorcontrib>Emmanuel, S.</creatorcontrib><creatorcontrib>Liang-Tien Chia</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Narwaria, M.</au><au>Weisi Lin</au><au>McLoughlin, I. V.</au><au>Emmanuel, S.</au><au>Liang-Tien Chia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2012-05-01</date><risdate>2012</risdate><volume>20</volume><issue>4</issue><spage>1217</spage><epage>1232</epage><pages>1217-1232</pages><issn>1558-7916</issn><eissn>1558-7924</eissn><coden>ITASD8</coden><abstract>Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2011.2174223</doi><tpages>16</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1558-7916 |
ispartof | IEEE transactions on audio, speech, and language processing, 2012-05, Vol.20 (4), p.1217-1232 |
issn | 1558-7916 1558-7924 |
language | eng |
recordid | cdi_ieee_primary_6064875 |
source | IEEE Electronic Library (IEL) |
subjects | Applied sciences Detection, estimation, filtering, equalization, prediction Exact sciences and technology Feature extraction Information, signal and communications theory Mel filter bank energies Noise Noise measurement Quality assessment Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Speech Speech processing speech quality assessment support vector regression Telecommunications and information theory |
title | Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T23%3A43%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Nonintrusive%20Quality%20Assessment%20of%20Noise%20Suppressed%20Speech%20With%20Mel-Filtered%20Energies%20and%20Support%20Vector%20Regression&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Narwaria,%20M.&rft.date=2012-05-01&rft.volume=20&rft.issue=4&rft.spage=1217&rft.epage=1232&rft.pages=1217-1232&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2011.2174223&rft_dat=%3Cpascalfrancis_RIE%3E26346864%3C/pascalfrancis_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6064875&rfr_iscdi=true |