Rapid joint speaker and noise compensation for robust speech recognition
For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch functio...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 5503 |
---|---|
container_issue | |
container_start_page | 5500 |
container_title | |
container_volume | |
creator | Chin, K. K. Haitian Xu Gales, Mark J. F. Breslin, Catherine Knill, Kate |
description | For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method. |
doi_str_mv | 10.1109/ICASSP.2011.5947604 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5947604</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5947604</ieee_id><sourcerecordid>5947604</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-6c76cd0db520037e5e6c9083deee5926612ac35c0a2ad3186db631242d0b3c683</originalsourceid><addsrcrecordid>eNo1UEtOwzAUND-JUHqCbnyBhPfs2I6XqIIWqRKIgsSucmwXXGgc2WHB7QlQZjOL-Wg0hMwQKkTQV3fz6_X6oWKAWAldKwn1EbnAWigFgmt1TArGlS5Rw8sJmWrV_GsNnJICBYNSYq3PyTTnHYyQTCmhC7J8NH1wdBdDN9Dce_PuEzWdo10M2VMb973vshlC7Og2Jppi-5l_nd6-0eRtfO3Cj3pJzrbmI_vpgSfk-fbmab4sV_eLcf6qDKjEUEqrpHXg2nEScOWFl1ZDw533XmgmJTJjubBgmHEcG-layZHVzEHLrWz4hMz-esOY2PQp7E362hxO4d8MXFIQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Rapid joint speaker and noise compensation for robust speech recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</creator><creatorcontrib>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</creatorcontrib><description>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781457705380</identifier><identifier>ISBN: 1457705389</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1457705397</identifier><identifier>EISBN: 9781457705373</identifier><identifier>EISBN: 9781457705397</identifier><identifier>EISBN: 1457705370</identifier><identifier>DOI: 10.1109/ICASSP.2011.5947604</identifier><language>eng</language><publisher>IEEE</publisher><subject>Adaptation models ; Estimation ; Hidden Markov models ; Noise ; Noise compensation ; Rapid adaptation ; Robust ASR ; Speaker adaptation ; Speech ; Speech recognition ; Transforms</subject><ispartof>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.5500-5503</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5947604$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5947604$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Chin, K. K.</creatorcontrib><creatorcontrib>Haitian Xu</creatorcontrib><creatorcontrib>Gales, Mark J. F.</creatorcontrib><creatorcontrib>Breslin, Catherine</creatorcontrib><creatorcontrib>Knill, Kate</creatorcontrib><title>Rapid joint speaker and noise compensation for robust speech recognition</title><title>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</title><addtitle>ICASSP</addtitle><description>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</description><subject>Adaptation models</subject><subject>Estimation</subject><subject>Hidden Markov models</subject><subject>Noise</subject><subject>Noise compensation</subject><subject>Rapid adaptation</subject><subject>Robust ASR</subject><subject>Speaker adaptation</subject><subject>Speech</subject><subject>Speech recognition</subject><subject>Transforms</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781457705380</isbn><isbn>1457705389</isbn><isbn>1457705397</isbn><isbn>9781457705373</isbn><isbn>9781457705397</isbn><isbn>1457705370</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UEtOwzAUND-JUHqCbnyBhPfs2I6XqIIWqRKIgsSucmwXXGgc2WHB7QlQZjOL-Wg0hMwQKkTQV3fz6_X6oWKAWAldKwn1EbnAWigFgmt1TArGlS5Rw8sJmWrV_GsNnJICBYNSYq3PyTTnHYyQTCmhC7J8NH1wdBdDN9Dce_PuEzWdo10M2VMb973vshlC7Og2Jppi-5l_nd6-0eRtfO3Cj3pJzrbmI_vpgSfk-fbmab4sV_eLcf6qDKjEUEqrpHXg2nEScOWFl1ZDw533XmgmJTJjubBgmHEcG-layZHVzEHLrWz4hMz-esOY2PQp7E362hxO4d8MXFIQ</recordid><startdate>201105</startdate><enddate>201105</enddate><creator>Chin, K. K.</creator><creator>Haitian Xu</creator><creator>Gales, Mark J. F.</creator><creator>Breslin, Catherine</creator><creator>Knill, Kate</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201105</creationdate><title>Rapid joint speaker and noise compensation for robust speech recognition</title><author>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-6c76cd0db520037e5e6c9083deee5926612ac35c0a2ad3186db631242d0b3c683</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Adaptation models</topic><topic>Estimation</topic><topic>Hidden Markov models</topic><topic>Noise</topic><topic>Noise compensation</topic><topic>Rapid adaptation</topic><topic>Robust ASR</topic><topic>Speaker adaptation</topic><topic>Speech</topic><topic>Speech recognition</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Chin, K. K.</creatorcontrib><creatorcontrib>Haitian Xu</creatorcontrib><creatorcontrib>Gales, Mark J. F.</creatorcontrib><creatorcontrib>Breslin, Catherine</creatorcontrib><creatorcontrib>Knill, Kate</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Chin, K. K.</au><au>Haitian Xu</au><au>Gales, Mark J. F.</au><au>Breslin, Catherine</au><au>Knill, Kate</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Rapid joint speaker and noise compensation for robust speech recognition</atitle><btitle>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</btitle><stitle>ICASSP</stitle><date>2011-05</date><risdate>2011</risdate><spage>5500</spage><epage>5503</epage><pages>5500-5503</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781457705380</isbn><isbn>1457705389</isbn><eisbn>1457705397</eisbn><eisbn>9781457705373</eisbn><eisbn>9781457705397</eisbn><eisbn>1457705370</eisbn><abstract>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2011.5947604</doi><tpages>4</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.5500-5503 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_5947604 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Adaptation models Estimation Hidden Markov models Noise Noise compensation Rapid adaptation Robust ASR Speaker adaptation Speech Speech recognition Transforms |
title | Rapid joint speaker and noise compensation for robust speech recognition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T14%3A09%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Rapid%20joint%20speaker%20and%20noise%20compensation%20for%20robust%20speech%20recognition&rft.btitle=2011%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing%20(ICASSP)&rft.au=Chin,%20K.%20K.&rft.date=2011-05&rft.spage=5500&rft.epage=5503&rft.pages=5500-5503&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781457705380&rft.isbn_list=1457705389&rft_id=info:doi/10.1109/ICASSP.2011.5947604&rft_dat=%3Cieee_6IE%3E5947604%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1457705397&rft.eisbn_list=9781457705373&rft.eisbn_list=9781457705397&rft.eisbn_list=1457705370&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5947604&rfr_iscdi=true |