On the dynamic adaptation of language models based on dialogue information
► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the i...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2013-03, Vol.40 (4), p.1069-1085 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1085 |
---|---|
container_issue | 4 |
container_start_page | 1069 |
container_title | Expert systems with applications |
container_volume | 40 |
creator | Lucas-Cuesta, J.M. Ferreiros, J. Fernández-Martı´nez, F. Echeverry, J.D. Lutfi, S. |
description | ► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system.
We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks. |
doi_str_mv | 10.1016/j.eswa.2012.08.029 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1701020991</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417412009906</els_id><sourcerecordid>1701020991</sourcerecordid><originalsourceid>FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</originalsourceid><addsrcrecordid>eNqFkD2P1DAQhi0EEsvBH6Byg0ST3EzsjWOJBp34OHTSNVBbs_Zk8SqJFzsLun-Plz1RQjWFn_ed8SPEa4QWAfvrQ8vlF7UdYNfC0EJnn4gNDkY1vbHqqdiA3ZpGo9HPxYtSDgBoAMxGfLlf5PqdZXhYaI5eUqDjSmtMi0yjnGjZn2jPck6BpyJ3VDjI-hYiTWl_YhmXMeX5T-CleDbSVPjV47wS3z5--Hrzubm7_3R78_6u8VrD2gQNnbLAoHnrwbAZlbYWe9YEahsQaLSBdsgelbI7hMHumBD0qAC16dWVeHvpPeb048RldXMsnqd6LKdTcfVnCB3Uzv-jdbXuBzuoinYX1OdUSubRHXOcKT84BHd27A7u7NidHTsYXHVcQ28e-6l4msZMi4_lb7IzCNChqdy7C1cl8s_I2RUfefEcYma_upDiv9b8BkU3kL0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1349468983</pqid></control><display><type>article</type><title>On the dynamic adaptation of language models based on dialogue information</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Lucas-Cuesta, J.M. ; Ferreiros, J. ; Fernández-Martı´nez, F. ; Echeverry, J.D. ; Lutfi, S.</creator><creatorcontrib>Lucas-Cuesta, J.M. ; Ferreiros, J. ; Fernández-Martı´nez, F. ; Echeverry, J.D. ; Lutfi, S.</creatorcontrib><description>► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system.
We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2012.08.029</identifier><language>eng</language><publisher>Amsterdam: Elsevier Ltd</publisher><subject>Acoustic signal processing ; Acoustics ; Adaptation ; Applied sciences ; Artificial intelligence ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Dialogue-based information ; Dynamic adaptation ; Dynamical systems ; Dynamics ; Estimates ; Exact sciences and technology ; Firm modelling ; Fundamental areas of phenomenology (including applications) ; Inference ; Interpolation ; Language models ; Operational research and scientific management ; Operational research. Management science ; Physics ; Semantic clustering ; Software ; Speech and sound recognition and synthesis. Linguistics ; Speech recognition ; Spoken dialogue system ; Strategy</subject><ispartof>Expert systems with applications, 2013-03, Vol.40 (4), p.1069-1085</ispartof><rights>2012 Elsevier Ltd</rights><rights>2014 INIST-CNRS</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</citedby><cites>FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2012.08.029$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=27100217$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Lucas-Cuesta, J.M.</creatorcontrib><creatorcontrib>Ferreiros, J.</creatorcontrib><creatorcontrib>Fernández-Martı´nez, F.</creatorcontrib><creatorcontrib>Echeverry, J.D.</creatorcontrib><creatorcontrib>Lutfi, S.</creatorcontrib><title>On the dynamic adaptation of language models based on dialogue information</title><title>Expert systems with applications</title><description>► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system.
We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.</description><subject>Acoustic signal processing</subject><subject>Acoustics</subject><subject>Adaptation</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Dialogue-based information</subject><subject>Dynamic adaptation</subject><subject>Dynamical systems</subject><subject>Dynamics</subject><subject>Estimates</subject><subject>Exact sciences and technology</subject><subject>Firm modelling</subject><subject>Fundamental areas of phenomenology (including applications)</subject><subject>Inference</subject><subject>Interpolation</subject><subject>Language models</subject><subject>Operational research and scientific management</subject><subject>Operational research. Management science</subject><subject>Physics</subject><subject>Semantic clustering</subject><subject>Software</subject><subject>Speech and sound recognition and synthesis. Linguistics</subject><subject>Speech recognition</subject><subject>Spoken dialogue system</subject><subject>Strategy</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNqFkD2P1DAQhi0EEsvBH6Byg0ST3EzsjWOJBp34OHTSNVBbs_Zk8SqJFzsLun-Plz1RQjWFn_ed8SPEa4QWAfvrQ8vlF7UdYNfC0EJnn4gNDkY1vbHqqdiA3ZpGo9HPxYtSDgBoAMxGfLlf5PqdZXhYaI5eUqDjSmtMi0yjnGjZn2jPck6BpyJ3VDjI-hYiTWl_YhmXMeX5T-CleDbSVPjV47wS3z5--Hrzubm7_3R78_6u8VrD2gQNnbLAoHnrwbAZlbYWe9YEahsQaLSBdsgelbI7hMHumBD0qAC16dWVeHvpPeb048RldXMsnqd6LKdTcfVnCB3Uzv-jdbXuBzuoinYX1OdUSubRHXOcKT84BHd27A7u7NidHTsYXHVcQ28e-6l4msZMi4_lb7IzCNChqdy7C1cl8s_I2RUfefEcYma_upDiv9b8BkU3kL0</recordid><startdate>20130301</startdate><enddate>20130301</enddate><creator>Lucas-Cuesta, J.M.</creator><creator>Ferreiros, J.</creator><creator>Fernández-Martı´nez, F.</creator><creator>Echeverry, J.D.</creator><creator>Lutfi, S.</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20130301</creationdate><title>On the dynamic adaptation of language models based on dialogue information</title><author>Lucas-Cuesta, J.M. ; Ferreiros, J. ; Fernández-Martı´nez, F. ; Echeverry, J.D. ; Lutfi, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c440t-d402390e04e5c07e7f349916e4a035d10af9dab1ec1339b1089bea104f3014763</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Acoustic signal processing</topic><topic>Acoustics</topic><topic>Adaptation</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Dialogue-based information</topic><topic>Dynamic adaptation</topic><topic>Dynamical systems</topic><topic>Dynamics</topic><topic>Estimates</topic><topic>Exact sciences and technology</topic><topic>Firm modelling</topic><topic>Fundamental areas of phenomenology (including applications)</topic><topic>Inference</topic><topic>Interpolation</topic><topic>Language models</topic><topic>Operational research and scientific management</topic><topic>Operational research. Management science</topic><topic>Physics</topic><topic>Semantic clustering</topic><topic>Software</topic><topic>Speech and sound recognition and synthesis. Linguistics</topic><topic>Speech recognition</topic><topic>Spoken dialogue system</topic><topic>Strategy</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lucas-Cuesta, J.M.</creatorcontrib><creatorcontrib>Ferreiros, J.</creatorcontrib><creatorcontrib>Fernández-Martı´nez, F.</creatorcontrib><creatorcontrib>Echeverry, J.D.</creatorcontrib><creatorcontrib>Lutfi, S.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lucas-Cuesta, J.M.</au><au>Ferreiros, J.</au><au>Fernández-Martı´nez, F.</au><au>Echeverry, J.D.</au><au>Lutfi, S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On the dynamic adaptation of language models based on dialogue information</atitle><jtitle>Expert systems with applications</jtitle><date>2013-03-01</date><risdate>2013</risdate><volume>40</volume><issue>4</issue><spage>1069</spage><epage>1085</epage><pages>1069-1085</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>► We present an approach to adapt dynamically the language models used by a speech recognizer using dialogue-based information. ► On each dialogue turn, the system interpolates a static LM with several content-dependent models related to semantic and intention information. ► The system obtains the interpolation weights using the posterior probabilities of concepts and goals estimated by the dialogue manager. ► We evaluate two strategies to obtain the models (one LM for each element, and several clustering approaches to group dialogue elements). ► The evaluation shows a significant reduction of the error rates when adapting the LMs in a dialogue system used to control a Hi-Fi audio system.
We present an approach to adapt dynamically the language models (LMs) used by a speech recognizer that is part of a spoken dialogue system. We have developed a grammar generation strategy that automatically adapts the LMs using the semantic information that the user provides (represented as dialogue concepts), together with the information regarding the intentions of the speaker (inferred by the dialogue manager, and represented as dialogue goals). We carry out the adaptation as a linear interpolation between a background LM, and one or more of the LMs associated to the dialogue elements (concepts or goals) addressed by the user. The interpolation weights between those models are automatically estimated on each dialogue turn, using measures such as the posterior probabilities of concepts and goals, estimated as part of the inference procedure to determine the actions to be carried out. We propose two approaches to handle the LMs related to concepts and goals. Whereas in the first one we estimate a LM for each one of them, in the second one we apply several clustering strategies to group together those elements that share some common properties, and estimate a LM for each cluster. Our evaluation shows how the system can estimate a dynamic model adapted to each dialogue turn, which helps to significantly improve the performance of the speech recognition, which leads to an improvement in both the language understanding and the dialogue management tasks.</abstract><cop>Amsterdam</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2012.08.029</doi><tpages>17</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2013-03, Vol.40 (4), p.1069-1085 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_miscellaneous_1701020991 |
source | Elsevier ScienceDirect Journals Complete |
subjects | Acoustic signal processing Acoustics Adaptation Applied sciences Artificial intelligence Computer science control theory systems Computer systems and distributed systems. User interface Dialogue-based information Dynamic adaptation Dynamical systems Dynamics Estimates Exact sciences and technology Firm modelling Fundamental areas of phenomenology (including applications) Inference Interpolation Language models Operational research and scientific management Operational research. Management science Physics Semantic clustering Software Speech and sound recognition and synthesis. Linguistics Speech recognition Spoken dialogue system Strategy |
title | On the dynamic adaptation of language models based on dialogue information |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T07%3A01%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On%20the%20dynamic%20adaptation%20of%20language%20models%20based%20on%20dialogue%20information&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Lucas-Cuesta,%20J.M.&rft.date=2013-03-01&rft.volume=40&rft.issue=4&rft.spage=1069&rft.epage=1085&rft.pages=1069-1085&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2012.08.029&rft_dat=%3Cproquest_cross%3E1701020991%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1349468983&rft_id=info:pmid/&rft_els_id=S0957417412009906&rfr_iscdi=true |