DIFFERENTIAL ACOUSTIC MODEL REPRESENTATION AND LINEAR TRANSFORM-BASED ADAPTATION FOR EFFICIENT USER PROFILE UPDATE TECHNIQUES IN AUTOMATIC SPEECH RECOGNITION

A computer-implemented method is described for speaker adaptation in automatic speech recognition. Speech recognition data from a particular speaker is used for adaptation of an initial speech recognition acoustic model to produce a speaker adapted acoustic model. A speaker dependent differential ac...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	WILLETT, DANIEL, GOLLAN, CHRISTIAN
Format:	Patent
Sprache:	eng ; fre
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	WILLETT, DANIEL GOLLAN, CHRISTIAN
description	A computer-implemented method is described for speaker adaptation in automatic speech recognition. Speech recognition data from a particular speaker is used for adaptation of an initial speech recognition acoustic model to produce a speaker adapted acoustic model. A speaker dependent differential acoustic model is determined that represents differences between the initial speech recognition acoustic model and the speaker adapted acoustic model. In addition, an approach is also disclosed to estimate speaker-specific feature or model transforms over multiple sessions. This is achieved by updating the previously estimated transform using only adaptation statistics of the current session. L'invention concerne un procédé mis en oeuvre par ordinateur pour une adaptation de haut-parleur dans une reconnaissance automatique de parole. Des données de reconnaissance de parole provenant d'un haut-parleur particulier sont utilisées pour l'adaptation d'un modèle acoustique de reconnaissance de parole initial pour produire un modèle acoustique adapté à un haut-parleur. Un modèle acoustique différentiel dépendant d'un haut-parleur est déterminé, lequel représente des différences entre le modèle acoustique de reconnaissance de parole initial et le modèle acoustique adapté à un haut-parleur. En outre, l'invention concerne également une approche pour estimer des transformations de caractéristique ou de modèle spécifique à un haut-parleur sur de multiples sessions. Ceci est obtenu par mise à jour de la transformation estimée précédemment à l'aide uniquement de statistiques d'adaptation de la session courante.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_WO2013169232A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>WO2013169232A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_WO2013169232A13</originalsourceid><addsrcrecordid>eNqNjc0KwjAQhHvxIOo7LHgu2BYEj2uy0YU2ifnBYykST6IFfR3f1RT6AJ4GZr6ZWRZfyUqRIx0YW0Bhog8soDOSWnBkHfmcYWCjAbWEljWhg-BQe2VcVx7RkwSUaGcqu0BKseBchOjJgXVGcUsQrcRAEEicNV8ieeC8GoPpcDr1lnKSX4U5aZ7G1sXiPjzeaTPrqtgqCuJcpvHVp_c43NIzffqrqXdVU-0PdVNj1fxH_QACMkVs</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>DIFFERENTIAL ACOUSTIC MODEL REPRESENTATION AND LINEAR TRANSFORM-BASED ADAPTATION FOR EFFICIENT USER PROFILE UPDATE TECHNIQUES IN AUTOMATIC SPEECH RECOGNITION</title><source>esp@cenet</source><creator>WILLETT, DANIEL ; GOLLAN, CHRISTIAN</creator><creatorcontrib>WILLETT, DANIEL ; GOLLAN, CHRISTIAN</creatorcontrib><description>A computer-implemented method is described for speaker adaptation in automatic speech recognition. Speech recognition data from a particular speaker is used for adaptation of an initial speech recognition acoustic model to produce a speaker adapted acoustic model. A speaker dependent differential acoustic model is determined that represents differences between the initial speech recognition acoustic model and the speaker adapted acoustic model. In addition, an approach is also disclosed to estimate speaker-specific feature or model transforms over multiple sessions. This is achieved by updating the previously estimated transform using only adaptation statistics of the current session. L'invention concerne un procédé mis en oeuvre par ordinateur pour une adaptation de haut-parleur dans une reconnaissance automatique de parole. Des données de reconnaissance de parole provenant d'un haut-parleur particulier sont utilisées pour l'adaptation d'un modèle acoustique de reconnaissance de parole initial pour produire un modèle acoustique adapté à un haut-parleur. Un modèle acoustique différentiel dépendant d'un haut-parleur est déterminé, lequel représente des différences entre le modèle acoustique de reconnaissance de parole initial et le modèle acoustique adapté à un haut-parleur. En outre, l'invention concerne également une approche pour estimer des transformations de caractéristique ou de modèle spécifique à un haut-parleur sur de multiples sessions. Ceci est obtenu par mise à jour de la transformation estimée précédemment à l'aide uniquement de statistiques d'adaptation de la session courante.</description><language>eng ; fre</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2013</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20131114&DB=EPODOC&CC=WO&NR=2013169232A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20131114&DB=EPODOC&CC=WO&NR=2013169232A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>WILLETT, DANIEL</creatorcontrib><creatorcontrib>GOLLAN, CHRISTIAN</creatorcontrib><title>DIFFERENTIAL ACOUSTIC MODEL REPRESENTATION AND LINEAR TRANSFORM-BASED ADAPTATION FOR EFFICIENT USER PROFILE UPDATE TECHNIQUES IN AUTOMATIC SPEECH RECOGNITION</title><description>A computer-implemented method is described for speaker adaptation in automatic speech recognition. Speech recognition data from a particular speaker is used for adaptation of an initial speech recognition acoustic model to produce a speaker adapted acoustic model. A speaker dependent differential acoustic model is determined that represents differences between the initial speech recognition acoustic model and the speaker adapted acoustic model. In addition, an approach is also disclosed to estimate speaker-specific feature or model transforms over multiple sessions. This is achieved by updating the previously estimated transform using only adaptation statistics of the current session. L'invention concerne un procédé mis en oeuvre par ordinateur pour une adaptation de haut-parleur dans une reconnaissance automatique de parole. Des données de reconnaissance de parole provenant d'un haut-parleur particulier sont utilisées pour l'adaptation d'un modèle acoustique de reconnaissance de parole initial pour produire un modèle acoustique adapté à un haut-parleur. Un modèle acoustique différentiel dépendant d'un haut-parleur est déterminé, lequel représente des différences entre le modèle acoustique de reconnaissance de parole initial et le modèle acoustique adapté à un haut-parleur. En outre, l'invention concerne également une approche pour estimer des transformations de caractéristique ou de modèle spécifique à un haut-parleur sur de multiples sessions. Ceci est obtenu par mise à jour de la transformation estimée précédemment à l'aide uniquement de statistiques d'adaptation de la session courante.</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2013</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNjc0KwjAQhHvxIOo7LHgu2BYEj2uy0YU2ifnBYykST6IFfR3f1RT6AJ4GZr6ZWRZfyUqRIx0YW0Bhog8soDOSWnBkHfmcYWCjAbWEljWhg-BQe2VcVx7RkwSUaGcqu0BKseBchOjJgXVGcUsQrcRAEEicNV8ieeC8GoPpcDr1lnKSX4U5aZ7G1sXiPjzeaTPrqtgqCuJcpvHVp_c43NIzffqrqXdVU-0PdVNj1fxH_QACMkVs</recordid><startdate>20131114</startdate><enddate>20131114</enddate><creator>WILLETT, DANIEL</creator><creator>GOLLAN, CHRISTIAN</creator><scope>EVB</scope></search><sort><creationdate>20131114</creationdate><title>DIFFERENTIAL ACOUSTIC MODEL REPRESENTATION AND LINEAR TRANSFORM-BASED ADAPTATION FOR EFFICIENT USER PROFILE UPDATE TECHNIQUES IN AUTOMATIC SPEECH RECOGNITION</title><author>WILLETT, DANIEL ; GOLLAN, CHRISTIAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_WO2013169232A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre</language><creationdate>2013</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>WILLETT, DANIEL</creatorcontrib><creatorcontrib>GOLLAN, CHRISTIAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WILLETT, DANIEL</au><au>GOLLAN, CHRISTIAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>DIFFERENTIAL ACOUSTIC MODEL REPRESENTATION AND LINEAR TRANSFORM-BASED ADAPTATION FOR EFFICIENT USER PROFILE UPDATE TECHNIQUES IN AUTOMATIC SPEECH RECOGNITION</title><date>2013-11-14</date><risdate>2013</risdate><abstract>A computer-implemented method is described for speaker adaptation in automatic speech recognition. Speech recognition data from a particular speaker is used for adaptation of an initial speech recognition acoustic model to produce a speaker adapted acoustic model. A speaker dependent differential acoustic model is determined that represents differences between the initial speech recognition acoustic model and the speaker adapted acoustic model. In addition, an approach is also disclosed to estimate speaker-specific feature or model transforms over multiple sessions. This is achieved by updating the previously estimated transform using only adaptation statistics of the current session. L'invention concerne un procédé mis en oeuvre par ordinateur pour une adaptation de haut-parleur dans une reconnaissance automatique de parole. Des données de reconnaissance de parole provenant d'un haut-parleur particulier sont utilisées pour l'adaptation d'un modèle acoustique de reconnaissance de parole initial pour produire un modèle acoustique adapté à un haut-parleur. Un modèle acoustique différentiel dépendant d'un haut-parleur est déterminé, lequel représente des différences entre le modèle acoustique de reconnaissance de parole initial et le modèle acoustique adapté à un haut-parleur. En outre, l'invention concerne également une approche pour estimer des transformations de caractéristique ou de modèle spécifique à un haut-parleur sur de multiples sessions. Ceci est obtenu par mise à jour de la transformation estimée précédemment à l'aide uniquement de statistiques d'adaptation de la session courante.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; fre
recordid	cdi_epo_espacenet_WO2013169232A1
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	DIFFERENTIAL ACOUSTIC MODEL REPRESENTATION AND LINEAR TRANSFORM-BASED ADAPTATION FOR EFFICIENT USER PROFILE UPDATE TECHNIQUES IN AUTOMATIC SPEECH RECOGNITION
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T09%3A09%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=WILLETT,%20DANIEL&rft.date=2013-11-14&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EWO2013169232A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true