On the dynamic adaptation of language models based on dialogue information

► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2013-03, Vol.40 (4), p.1069-1085
Hauptverfasser: Lucas-Cuesta, J.M., Ferreiros, J., Fernández-Martı´nez, F., Echeverry, J.D., Lutfi, S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1085
container_issue 4
container_start_page 1069
container_title Expert systems with applications
container_volume 40
creator Lucas-Cuesta, J.M.
Ferreiros, J.
Fernández-Martı´nez, F.
Echeverry, J.D.
Lutfi, S.
description ► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system. We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.
doi_str_mv 10.1016/j.eswa.2012.08.029
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1701020991</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417412009906</els_id><sourcerecordid>1701020991</sourcerecordid><originalsourceid>FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</originalsourceid><addsrcrecordid>eNqFkD2P1DAQhi0EEsvBH6Byg0ST3EzsjWOJBp34OHTSNVBbs_Zk8SqJFzsLun-Plz1RQjWFn_ed8SPEa4QWAfvrQ8vlF7UdYNfC0EJnn4gNDkY1vbHqqdiA3ZpGo9HPxYtSDgBoAMxGfLlf5PqdZXhYaI5eUqDjSmtMi0yjnGjZn2jPck6BpyJ3VDjI-hYiTWl_YhmXMeX5T-CleDbSVPjV47wS3z5--Hrzubm7_3R78_6u8VrD2gQNnbLAoHnrwbAZlbYWe9YEahsQaLSBdsgelbI7hMHumBD0qAC16dWVeHvpPeb048RldXMsnqd6LKdTcfVnCB3Uzv-jdbXuBzuoinYX1OdUSubRHXOcKT84BHd27A7u7NidHTsYXHVcQ28e-6l4msZMi4_lb7IzCNChqdy7C1cl8s_I2RUfefEcYma_upDiv9b8BkU3kL0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1349468983</pqid></control><display><type>article</type><title>On the dynamic adaptation of language models based on dialogue information</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Lucas-Cuesta, J.M. ; Ferreiros, J. ; Fernández-Martı´nez, F. ; Echeverry, J.D. ; Lutfi, S.</creator><creatorcontrib>Lucas-Cuesta, J.M. ; Ferreiros, J. ; Fernández-Martı´nez, F. ; Echeverry, J.D. ; Lutfi, S.</creatorcontrib><description>► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system. We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2012.08.029</identifier><language>eng</language><publisher>Amsterdam: Elsevier Ltd</publisher><subject>Acoustic signal processing ; Acoustics ; Adaptation ; Applied sciences ; Artificial intelligence ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Dialogue-based information ; Dynamic adaptation ; Dynamical systems ; Dynamics ; Estimates ; Exact sciences and technology ; Firm modelling ; Fundamental areas of phenomenology (including applications) ; Inference ; Interpolation ; Language models ; Operational research and scientific management ; Operational research. Management science ; Physics ; Semantic clustering ; Software ; Speech and sound recognition and synthesis. Linguistics ; Speech recognition ; Spoken dialogue system ; Strategy</subject><ispartof>Expert systems with applications, 2013-03, Vol.40 (4), p.1069-1085</ispartof><rights>2012 Elsevier Ltd</rights><rights>2014 INIST-CNRS</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</citedby><cites>FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2012.08.029$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=27100217$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Lucas-Cuesta, J.M.</creatorcontrib><creatorcontrib>Ferreiros, J.</creatorcontrib><creatorcontrib>Fernández-Martı´nez, F.</creatorcontrib><creatorcontrib>Echeverry, J.D.</creatorcontrib><creatorcontrib>Lutfi, S.</creatorcontrib><title>On the dynamic adaptation of language models based on dialogue information</title><title>Expert systems with applications</title><description>► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system. We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.</description><subject>Acoustic signal processing</subject><subject>Acoustics</subject><subject>Adaptation</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Dialogue-based information</subject><subject>Dynamic adaptation</subject><subject>Dynamical systems</subject><subject>Dynamics</subject><subject>Estimates</subject><subject>Exact sciences and technology</subject><subject>Firm modelling</subject><subject>Fundamental areas of phenomenology (including applications)</subject><subject>Inference</subject><subject>Interpolation</subject><subject>Language models</subject><subject>Operational research and scientific management</subject><subject>Operational research. Management science</subject><subject>Physics</subject><subject>Semantic clustering</subject><subject>Software</subject><subject>Speech and sound recognition and synthesis. Linguistics</subject><subject>Speech recognition</subject><subject>Spoken dialogue system</subject><subject>Strategy</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNqFkD2P1DAQhi0EEsvBH6Byg0ST3EzsjWOJBp34OHTSNVBbs_Zk8SqJFzsLun-Plz1RQjWFn_ed8SPEa4QWAfvrQ8vlF7UdYNfC0EJnn4gNDkY1vbHqqdiA3ZpGo9HPxYtSDgBoAMxGfLlf5PqdZXhYaI5eUqDjSmtMi0yjnGjZn2jPck6BpyJ3VDjI-hYiTWl_YhmXMeX5T-CleDbSVPjV47wS3z5--Hrzubm7_3R78_6u8VrD2gQNnbLAoHnrwbAZlbYWe9YEahsQaLSBdsgelbI7hMHumBD0qAC16dWVeHvpPeb048RldXMsnqd6LKdTcfVnCB3Uzv-jdbXuBzuoinYX1OdUSubRHXOcKT84BHd27A7u7NidHTsYXHVcQ28e-6l4msZMi4_lb7IzCNChqdy7C1cl8s_I2RUfefEcYma_upDiv9b8BkU3kL0</recordid><startdate>20130301</startdate><enddate>20130301</enddate><creator>Lucas-Cuesta, J.M.</creator><creator>Ferreiros, J.</creator><creator>Fernández-Martı´nez, F.</creator><creator>Echeverry, J.D.</creator><creator>Lutfi, S.</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20130301</creationdate><title>On the dynamic adaptation of language models based on dialogue information</title><author>Lucas-Cuesta, J.M. ; Ferreiros, J. ; Fernández-Martı´nez, F. ; Echeverry, J.D. ; Lutfi, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Acoustic signal processing</topic><topic>Acoustics</topic><topic>Adaptation</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Dialogue-based information</topic><topic>Dynamic adaptation</topic><topic>Dynamical systems</topic><topic>Dynamics</topic><topic>Estimates</topic><topic>Exact sciences and technology</topic><topic>Firm modelling</topic><topic>Fundamental areas of phenomenology (including applications)</topic><topic>Inference</topic><topic>Interpolation</topic><topic>Language models</topic><topic>Operational research and scientific management</topic><topic>Operational research. Management science</topic><topic>Physics</topic><topic>Semantic clustering</topic><topic>Software</topic><topic>Speech and sound recognition and synthesis. Linguistics</topic><topic>Speech recognition</topic><topic>Spoken dialogue system</topic><topic>Strategy</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lucas-Cuesta, J.M.</creatorcontrib><creatorcontrib>Ferreiros, J.</creatorcontrib><creatorcontrib>Fernández-Martı´nez, F.</creatorcontrib><creatorcontrib>Echeverry, J.D.</creatorcontrib><creatorcontrib>Lutfi, S.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lucas-Cuesta, J.M.</au><au>Ferreiros, J.</au><au>Fernández-Martı´nez, F.</au><au>Echeverry, J.D.</au><au>Lutfi, S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On the dynamic adaptation of language models based on dialogue information</atitle><jtitle>Expert systems with applications</jtitle><date>2013-03-01</date><risdate>2013</risdate><volume>40</volume><issue>4</issue><spage>1069</spage><epage>1085</epage><pages>1069-1085</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system. We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.</abstract><cop>Amsterdam</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2012.08.029</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0957-4174
ispartof Expert systems with applications, 2013-03, Vol.40 (4), p.1069-1085
issn 0957-4174
1873-6793
language eng
recordid cdi_proquest_miscellaneous_1701020991
source Elsevier ScienceDirect Journals Complete
subjects Acoustic signal processing
Acoustics
Adaptation
Applied sciences
Artificial intelligence
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Dialogue-based information
Dynamic adaptation
Dynamical systems
Dynamics
Estimates
Exact sciences and technology
Firm modelling
Fundamental areas of phenomenology (including applications)
Inference
Interpolation
Language models
Operational research and scientific management
Operational research. Management science
Physics
Semantic clustering
Software
Speech and sound recognition and synthesis. Linguistics
Speech recognition
Spoken dialogue system
Strategy
title On the dynamic adaptation of language models based on dialogue information
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T07%3A01%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On%20the%20dynamic%20adaptation%20of%20language%20models%20based%20on%20dialogue%20information&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Lucas-Cuesta,%20J.M.&rft.date=2013-03-01&rft.volume=40&rft.issue=4&rft.spage=1069&rft.epage=1085&rft.pages=1069-1085&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2012.08.029&rft_dat=%3Cproquest_cross%3E1701020991%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1349468983&rft_id=info:pmid/&rft_els_id=S0957417412009906&rfr_iscdi=true