Rapid joint speaker and noise compensation for robust speech recognition

For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch functio...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Chin, K. K., Haitian Xu, Gales, Mark J. F., Breslin, Catherine, Knill, Kate
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Adaptation models Estimation Hidden Markov models Noise Noise compensation Rapid adaptation Robust ASR Speaker adaptation Speech Speech recognition Transforms
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	5503
container_issue
container_start_page	5500
container_title
container_volume
creator	Chin, K. K. Haitian Xu Gales, Mark J. F. Breslin, Catherine Knill, Kate
description	For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.
doi_str_mv	10.1109/ICASSP.2011.5947604
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5947604</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5947604</ieee_id><sourcerecordid>5947604</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-6c76cd0db520037e5e6c9083deee5926612ac35c0a2ad3186db631242d0b3c683</originalsourceid><addsrcrecordid>eNo1UEtOwzAUND-JUHqCbnyBhPfs2I6XqIIWqRKIgsSucmwXXGgc2WHB7QlQZjOL-Wg0hMwQKkTQV3fz6_X6oWKAWAldKwn1EbnAWigFgmt1TArGlS5Rw8sJmWrV_GsNnJICBYNSYq3PyTTnHYyQTCmhC7J8NH1wdBdDN9Dce_PuEzWdo10M2VMb973vshlC7Og2Jppi-5l_nd6-0eRtfO3Cj3pJzrbmI_vpgSfk-fbmab4sV_eLcf6qDKjEUEqrpHXg2nEScOWFl1ZDw533XmgmJTJjubBgmHEcG-layZHVzEHLrWz4hMz-esOY2PQp7E362hxO4d8MXFIQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Rapid joint speaker and noise compensation for robust speech recognition</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</creator><creatorcontrib>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</creatorcontrib><description>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781457705380</identifier><identifier>ISBN: 1457705389</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1457705397</identifier><identifier>EISBN: 9781457705373</identifier><identifier>EISBN: 9781457705397</identifier><identifier>EISBN: 1457705370</identifier><identifier>DOI: 10.1109/ICASSP.2011.5947604</identifier><language>eng</language><publisher>IEEE</publisher><subject>Adaptation models ; Estimation ; Hidden Markov models ; Noise ; Noise compensation ; Rapid adaptation ; Robust ASR ; Speaker adaptation ; Speech ; Speech recognition ; Transforms</subject><ispartof>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.5500-5503</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5947604$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5947604$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Chin, K. K.</creatorcontrib><creatorcontrib>Haitian Xu</creatorcontrib><creatorcontrib>Gales, Mark J. F.</creatorcontrib><creatorcontrib>Breslin, Catherine</creatorcontrib><creatorcontrib>Knill, Kate</creatorcontrib><title>Rapid joint speaker and noise compensation for robust speech recognition</title><title>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</title><addtitle>ICASSP</addtitle><description>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</description><subject>Adaptation models</subject><subject>Estimation</subject><subject>Hidden Markov models</subject><subject>Noise</subject><subject>Noise compensation</subject><subject>Rapid adaptation</subject><subject>Robust ASR</subject><subject>Speaker adaptation</subject><subject>Speech</subject><subject>Speech recognition</subject><subject>Transforms</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781457705380</isbn><isbn>1457705389</isbn><isbn>1457705397</isbn><isbn>9781457705373</isbn><isbn>9781457705397</isbn><isbn>1457705370</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2011</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UEtOwzAUND-JUHqCbnyBhPfs2I6XqIIWqRKIgsSucmwXXGgc2WHB7QlQZjOL-Wg0hMwQKkTQV3fz6_X6oWKAWAldKwn1EbnAWigFgmt1TArGlS5Rw8sJmWrV_GsNnJICBYNSYq3PyTTnHYyQTCmhC7J8NH1wdBdDN9Dce_PuEzWdo10M2VMb973vshlC7Og2Jppi-5l_nd6-0eRtfO3Cj3pJzrbmI_vpgSfk-fbmab4sV_eLcf6qDKjEUEqrpHXg2nEScOWFl1ZDw533XmgmJTJjubBgmHEcG-layZHVzEHLrWz4hMz-esOY2PQp7E362hxO4d8MXFIQ</recordid><startdate>201105</startdate><enddate>201105</enddate><creator>Chin, K. K.</creator><creator>Haitian Xu</creator><creator>Gales, Mark J. F.</creator><creator>Breslin, Catherine</creator><creator>Knill, Kate</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201105</creationdate><title>Rapid joint speaker and noise compensation for robust speech recognition</title><author>Chin, K. K. ; Haitian Xu ; Gales, Mark J. F. ; Breslin, Catherine ; Knill, Kate</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-6c76cd0db520037e5e6c9083deee5926612ac35c0a2ad3186db631242d0b3c683</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Adaptation models</topic><topic>Estimation</topic><topic>Hidden Markov models</topic><topic>Noise</topic><topic>Noise compensation</topic><topic>Rapid adaptation</topic><topic>Robust ASR</topic><topic>Speaker adaptation</topic><topic>Speech</topic><topic>Speech recognition</topic><topic>Transforms</topic><toplevel>online_resources</toplevel><creatorcontrib>Chin, K. K.</creatorcontrib><creatorcontrib>Haitian Xu</creatorcontrib><creatorcontrib>Gales, Mark J. F.</creatorcontrib><creatorcontrib>Breslin, Catherine</creatorcontrib><creatorcontrib>Knill, Kate</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Chin, K. K.</au><au>Haitian Xu</au><au>Gales, Mark J. F.</au><au>Breslin, Catherine</au><au>Knill, Kate</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Rapid joint speaker and noise compensation for robust speech recognition</atitle><btitle>2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)</btitle><stitle>ICASSP</stitle><date>2011-05</date><risdate>2011</risdate><spage>5500</spage><epage>5503</epage><pages>5500-5503</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781457705380</isbn><isbn>1457705389</isbn><eisbn>1457705397</eisbn><eisbn>9781457705373</eisbn><eisbn>9781457705397</eisbn><eisbn>1457705370</eisbn><abstract>For speech recognition, mismatches between training and testing for speaker and noise are normally handled separately. The work presented in this paper aims at jointly applying speaker adaptation and model-based noise compensation by embedding speaker adaptation as part of the noise mismatch function. The proposed method gives a faster and more optimum adaptation compared to compensating for these two factors separately. It is also more consistent with respect to the basic assumptions of speaker and noise adaptation. Experimental results show significant and consistent gains from the proposed method.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2011.5947604</doi><tpages>4</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, p.5500-5503
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_5947604
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Adaptation models Estimation Hidden Markov models Noise Noise compensation Rapid adaptation Robust ASR Speaker adaptation Speech Speech recognition Transforms
title	Rapid joint speaker and noise compensation for robust speech recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T14%3A09%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Rapid%20joint%20speaker%20and%20noise%20compensation%20for%20robust%20speech%20recognition&rft.btitle=2011%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing%20(ICASSP)&rft.au=Chin,%20K.%20K.&rft.date=2011-05&rft.spage=5500&rft.epage=5503&rft.pages=5500-5503&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781457705380&rft.isbn_list=1457705389&rft_id=info:doi/10.1109/ICASSP.2011.5947604&rft_dat=%3Cieee_6IE%3E5947604%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1457705397&rft.eisbn_list=9781457705373&rft.eisbn_list=9781457705397&rft.eisbn_list=1457705370&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5947604&rfr_iscdi=true