Rapid joint speaker and noise compensation for robust speech recognition

For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch functio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Chin, K. K., Haitian Xu, Gales, Mark J. F., Breslin, Catherine, Knill, Kate
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 5503
container_issue
container_start_page 5500
container_title
container_volume
creator Chin, K. K.
Haitian Xu
Gales, Mark J. F.
Breslin, Catherine
Knill, Kate
description For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.
doi_str_mv 10.1109/ICASSP.2011.5947604
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5947604</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5947604</ieee_id><sourcerecordid>5947604</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-6c76cd0db520037e5e6c9083deee5926612ac35c0a2ad3186db631242d0b3c683</originalsourceid><addsrcrecordid>eNo1UEtOwzAUND-JUHqCbnyBhPfs2I6XqIIWqRKIgsSucmwXXGgc2WHB7QlQZjOL-Wg0hMwQKkTQV3fz6_X6oWKAWAldKwn1EbnAWigFgmt1TArGlS5Rw8sJmWrV_GsNnJICBYNSYq3PyTTnHYyQTCmhC7J8NH1wdBdDN9Dce_PuEzWdo10M2VMb973vshlC7Og2Jppi-5l_nd6-0eRtfO3Cj3pJzrbmI_vpgSfk-fbmab4sV_eLcf6qDKjEUEqrpHXg2nEScOWFl1ZDw533XmgmJTJjubBgmHEcG-layZHVzEHLrWz4hMz-esOY2PQp7E362hxO4d8MXFIQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Rapid joint speaker and noise compensation for robust speech recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</creator><creatorcontrib>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</creatorcontrib><description>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781457705380</identifier><identifier>ISBN: 1457705389</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1457705397</identifier><identifier>EISBN: 9781457705373</identifier><identifier>EISBN: 9781457705397</identifier><identifier>EISBN: 1457705370</identifier><identifier>DOI: 10.1109/ICASSP.2011.5947604</identifier><language>eng</language><publisher>IEEE</publisher><subject>Adaptation models ; Estimation ; Hidden Markov models ; Noise ; Noise compensation ; Rapid adaptation ; Robust ASR ; Speaker adaptation ; Speech ; Speech recognition ; Transforms</subject><ispartof>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.5500-5503</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5947604$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5947604$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Chin, K. K.</creatorcontrib><creatorcontrib>Haitian Xu</creatorcontrib><creatorcontrib>Gales, Mark J. F.</creatorcontrib><creatorcontrib>Breslin, Catherine</creatorcontrib><creatorcontrib>Knill, Kate</creatorcontrib><title>Rapid joint speaker and noise compensation for robust speech recognition</title><title>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</title><addtitle>ICASSP</addtitle><description>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</description><subject>Adaptation models</subject><subject>Estimation</subject><subject>Hidden Markov models</subject><subject>Noise</subject><subject>Noise compensation</subject><subject>Rapid adaptation</subject><subject>Robust ASR</subject><subject>Speaker adaptation</subject><subject>Speech</subject><subject>Speech recognition</subject><subject>Transforms</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781457705380</isbn><isbn>1457705389</isbn><isbn>1457705397</isbn><isbn>9781457705373</isbn><isbn>9781457705397</isbn><isbn>1457705370</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UEtOwzAUND-JUHqCbnyBhPfs2I6XqIIWqRKIgsSucmwXXGgc2WHB7QlQZjOL-Wg0hMwQKkTQV3fz6_X6oWKAWAldKwn1EbnAWigFgmt1TArGlS5Rw8sJmWrV_GsNnJICBYNSYq3PyTTnHYyQTCmhC7J8NH1wdBdDN9Dce_PuEzWdo10M2VMb973vshlC7Og2Jppi-5l_nd6-0eRtfO3Cj3pJzrbmI_vpgSfk-fbmab4sV_eLcf6qDKjEUEqrpHXg2nEScOWFl1ZDw533XmgmJTJjubBgmHEcG-layZHVzEHLrWz4hMz-esOY2PQp7E362hxO4d8MXFIQ</recordid><startdate>201105</startdate><enddate>201105</enddate><creator>Chin, K. K.</creator><creator>Haitian Xu</creator><creator>Gales, Mark J. F.</creator><creator>Breslin, Catherine</creator><creator>Knill, Kate</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201105</creationdate><title>Rapid joint speaker and noise compensation for robust speech recognition</title><author>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-6c76cd0db520037e5e6c9083deee5926612ac35c0a2ad3186db631242d0b3c683</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Adaptation models</topic><topic>Estimation</topic><topic>Hidden Markov models</topic><topic>Noise</topic><topic>Noise compensation</topic><topic>Rapid adaptation</topic><topic>Robust ASR</topic><topic>Speaker adaptation</topic><topic>Speech</topic><topic>Speech recognition</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Chin, K. K.</creatorcontrib><creatorcontrib>Haitian Xu</creatorcontrib><creatorcontrib>Gales, Mark J. F.</creatorcontrib><creatorcontrib>Breslin, Catherine</creatorcontrib><creatorcontrib>Knill, Kate</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Chin, K. K.</au><au>Haitian Xu</au><au>Gales, Mark J. F.</au><au>Breslin, Catherine</au><au>Knill, Kate</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Rapid joint speaker and noise compensation for robust speech recognition</atitle><btitle>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</btitle><stitle>ICASSP</stitle><date>2011-05</date><risdate>2011</risdate><spage>5500</spage><epage>5503</epage><pages>5500-5503</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781457705380</isbn><isbn>1457705389</isbn><eisbn>1457705397</eisbn><eisbn>9781457705373</eisbn><eisbn>9781457705397</eisbn><eisbn>1457705370</eisbn><abstract>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2011.5947604</doi><tpages>4</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.5500-5503
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_5947604
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Adaptation models
Estimation
Hidden Markov models
Noise
Noise compensation
Rapid adaptation
Robust ASR
Speaker adaptation
Speech
Speech recognition
Transforms
title Rapid joint speaker and noise compensation for robust speech recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T14%3A09%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Rapid%20joint%20speaker%20and%20noise%20compensation%20for%20robust%20speech%20recognition&rft.btitle=2011%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing%20(ICASSP)&rft.au=Chin,%20K.%20K.&rft.date=2011-05&rft.spage=5500&rft.epage=5503&rft.pages=5500-5503&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781457705380&rft.isbn_list=1457705389&rft_id=info:doi/10.1109/ICASSP.2011.5947604&rft_dat=%3Cieee_6IE%3E5947604%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1457705397&rft.eisbn_list=9781457705373&rft.eisbn_list=9781457705397&rft.eisbn_list=1457705370&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5947604&rfr_iscdi=true