Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression

Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challeng...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on audio, speech, and language processing speech, and language processing, 2012-05, Vol.20 (4), p.1217-1232
Hauptverfasser: Narwaria, M., Weisi Lin, McLoughlin, I. V., Emmanuel, S., Liang-Tien Chia
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1232
container_issue 4
container_start_page 1217
container_title IEEE transactions on audio, speech, and language processing
container_volume 20
creator Narwaria, M.
Weisi Lin
McLoughlin, I. V.
Emmanuel, S.
Liang-Tien Chia
description Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.
doi_str_mv 10.1109/TASL.2011.2174223
format Article
fullrecord <record><control><sourceid>pascalfrancis_RIE</sourceid><recordid>TN_cdi_ieee_primary_6064875</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6064875</ieee_id><sourcerecordid>26346864</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</originalsourceid><addsrcrecordid>eNo9kE1PwzAMhiMEEmPwAxCXXDh25KtpepymDZDGEGzAsQqpswV1bZVkSPv3tGzayZb9Ppb1IHRLyYhSkj-sxsv5iBFKR4xmgjF-hgY0TVWS5Uycn3oqL9FVCD-ECC4FHaCwaGpXR78L7hfw205XLu7xOAQIYQt1xI3Fi8YFwMtd2_puCiVetgBmg79c3OAXqJKZqyL4bjGtwa8dBKzr8h9ofMSfYGLj8Tuse9w19TW6sLoKcHOsQ_Qxm64mT8n89fF5Mp4nhuVpTLQkkgj1zXOhGJREpRoyBlnJhaa5EsIoznRJCbGgqFFW5JRJYnPWZUuQfIjo4a7xTQgebNF6t9V-X1BS9NaK3lrRWyuO1jrm_sC0OhhdWa9r48IJZJILqaTocneHnAOA07r7WKgs5X8xOncw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</title><source>IEEE Electronic Library (IEL)</source><creator>Narwaria, M. ; Weisi Lin ; McLoughlin, I. V. ; Emmanuel, S. ; Liang-Tien Chia</creator><creatorcontrib>Narwaria, M. ; Weisi Lin ; McLoughlin, I. V. ; Emmanuel, S. ; Liang-Tien Chia</creatorcontrib><description>Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.</description><identifier>ISSN: 1558-7916</identifier><identifier>EISSN: 1558-7924</identifier><identifier>DOI: 10.1109/TASL.2011.2174223</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Applied sciences ; Detection, estimation, filtering, equalization, prediction ; Exact sciences and technology ; Feature extraction ; Information, signal and communications theory ; Mel filter bank energies ; Noise ; Noise measurement ; Quality assessment ; Signal and communications theory ; Signal processing ; Signal representation. Spectral analysis ; Signal, noise ; Speech ; Speech processing ; speech quality assessment ; support vector regression ; Telecommunications and information theory</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2012-05, Vol.20 (4), p.1217-1232</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</citedby><cites>FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6064875$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6064875$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=26346864$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Narwaria, M.</creatorcontrib><creatorcontrib>Weisi Lin</creatorcontrib><creatorcontrib>McLoughlin, I. V.</creatorcontrib><creatorcontrib>Emmanuel, S.</creatorcontrib><creatorcontrib>Liang-Tien Chia</creatorcontrib><title>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.</description><subject>Applied sciences</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Exact sciences and technology</subject><subject>Feature extraction</subject><subject>Information, signal and communications theory</subject><subject>Mel filter bank energies</subject><subject>Noise</subject><subject>Noise measurement</subject><subject>Quality assessment</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal representation. Spectral analysis</subject><subject>Signal, noise</subject><subject>Speech</subject><subject>Speech processing</subject><subject>speech quality assessment</subject><subject>support vector regression</subject><subject>Telecommunications and information theory</subject><issn>1558-7916</issn><issn>1558-7924</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1PwzAMhiMEEmPwAxCXXDh25KtpepymDZDGEGzAsQqpswV1bZVkSPv3tGzayZb9Ppb1IHRLyYhSkj-sxsv5iBFKR4xmgjF-hgY0TVWS5Uycn3oqL9FVCD-ECC4FHaCwaGpXR78L7hfw205XLu7xOAQIYQt1xI3Fi8YFwMtd2_puCiVetgBmg79c3OAXqJKZqyL4bjGtwa8dBKzr8h9ofMSfYGLj8Tuse9w19TW6sLoKcHOsQ_Qxm64mT8n89fF5Mp4nhuVpTLQkkgj1zXOhGJREpRoyBlnJhaa5EsIoznRJCbGgqFFW5JRJYnPWZUuQfIjo4a7xTQgebNF6t9V-X1BS9NaK3lrRWyuO1jrm_sC0OhhdWa9r48IJZJILqaTocneHnAOA07r7WKgs5X8xOncw</recordid><startdate>20120501</startdate><enddate>20120501</enddate><creator>Narwaria, M.</creator><creator>Weisi Lin</creator><creator>McLoughlin, I. V.</creator><creator>Emmanuel, S.</creator><creator>Liang-Tien Chia</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20120501</creationdate><title>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</title><author>Narwaria, M. ; Weisi Lin ; McLoughlin, I. V. ; Emmanuel, S. ; Liang-Tien Chia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-a606048b39482ed085ae72e7d34a19844c832ad100fe81c8f491260f92d08de63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Exact sciences and technology</topic><topic>Feature extraction</topic><topic>Information, signal and communications theory</topic><topic>Mel filter bank energies</topic><topic>Noise</topic><topic>Noise measurement</topic><topic>Quality assessment</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal representation. Spectral analysis</topic><topic>Signal, noise</topic><topic>Speech</topic><topic>Speech processing</topic><topic>speech quality assessment</topic><topic>support vector regression</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Narwaria, M.</creatorcontrib><creatorcontrib>Weisi Lin</creatorcontrib><creatorcontrib>McLoughlin, I. V.</creatorcontrib><creatorcontrib>Emmanuel, S.</creatorcontrib><creatorcontrib>Liang-Tien Chia</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Narwaria, M.</au><au>Weisi Lin</au><au>McLoughlin, I. V.</au><au>Emmanuel, S.</au><au>Liang-Tien Chia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2012-05-01</date><risdate>2012</risdate><volume>20</volume><issue>4</issue><spage>1217</spage><epage>1232</epage><pages>1217-1232</pages><issn>1558-7916</issn><eissn>1558-7924</eissn><coden>ITASD8</coden><abstract>Objective speech quality assessment is a challenging task which aims to emulate human judgment in the complex and time consuming task of subjective assessment. It is difficult to perform in line with the human perception due the complex and nonlinear nature of the human auditory system. The challenge lies in representing speech signals using appropriate features and subsequently mapping these features into a quality score. This paper proposes a nonintrusive metric for the quality assessment of noise-suppressed speech. The originality of the proposed approach lies primarily in the use of Mel filter bank energies (FBEs) as features and the use of support vector regression (SVR) for feature mapping. We utilize the sensitivity of FBEs to noise in order to obtain an effective representation of speech towards quality assessment. In addition, the use of SVR exploits the advantages of kernels which allow the regression algorithm to learn complex data patterns via nonlinear transformation for an effective and generalized mapping of features into the quality score. Extensive experiments conducted using two third party databases with different noise-suppressed speech signals show the effectiveness of the proposed approach.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2011.2174223</doi><tpages>16</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1558-7916
ispartof IEEE transactions on audio, speech, and language processing, 2012-05, Vol.20 (4), p.1217-1232
issn 1558-7916
1558-7924
language eng
recordid cdi_ieee_primary_6064875
source IEEE Electronic Library (IEL)
subjects Applied sciences
Detection, estimation, filtering, equalization, prediction
Exact sciences and technology
Feature extraction
Information, signal and communications theory
Mel filter bank energies
Noise
Noise measurement
Quality assessment
Signal and communications theory
Signal processing
Signal representation. Spectral analysis
Signal, noise
Speech
Speech processing
speech quality assessment
support vector regression
Telecommunications and information theory
title Nonintrusive Quality Assessment of Noise Suppressed Speech With Mel-Filtered Energies and Support Vector Regression
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T23%3A43%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Nonintrusive%20Quality%20Assessment%20of%20Noise%20Suppressed%20Speech%20With%20Mel-Filtered%20Energies%20and%20Support%20Vector%20Regression&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Narwaria,%20M.&rft.date=2012-05-01&rft.volume=20&rft.issue=4&rft.spage=1217&rft.epage=1232&rft.pages=1217-1232&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2011.2174223&rft_dat=%3Cpascalfrancis_RIE%3E26346864%3C/pascalfrancis_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6064875&rfr_iscdi=true