Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus
Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic spe...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | IV-12 |
---|---|
container_issue | |
container_start_page | IV-9 |
container_title | |
container_volume | 4 |
creator | Damnati, G. Bechet, F. De Mori, R. |
description | Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels. |
doi_str_mv | 10.1109/ICASSP.2007.367150 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4218024</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4218024</ieee_id><sourcerecordid>4218024</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-8057c142a02cd63e8fa348fa7e095857d6f6eea8c233d24faf036b649d33b2f73</originalsourceid><addsrcrecordid>eNpVj8tKw0AYhccbGGtfQDfzAon_XDKXZQlWhYBCWnVXppM_MdpOQpIu-vYN6MbNOXAOfJxDyB2DhDGwDy_ZoijeEg6gE6E0S-GMzK02THIpQXOjzknEhbYxs_B58a_T9pJELOUQKybtNbkZhm8AMFqaiHwUXfuDgeYu1AdXI12HEvthdKFsQk2LsXcj1g0OtA10_EK67F3wSFe4Q9_uqZhI9L1tpmhRY_BHmrV9dxhuyVXldgPO_3xG1svHVfYc569P05k8bphOx9hAqv001AH3pRJoKifkJBrBpibVpaoUojOeC1FyWbkKhNoqaUshtrzSYkbuf7kNIm66vtm7_riRnBngUpwAfOtV1Q</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Damnati, G. ; Bechet, F. ; De Mori, R.</creator><creatorcontrib>Damnati, G. ; Bechet, F. ; De Mori, R.</creatorcontrib><description>Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424407279</identifier><identifier>ISBN: 1424407273</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 9781424407286</identifier><identifier>EISBN: 1424407281</identifier><identifier>DOI: 10.1109/ICASSP.2007.367150</identifier><language>eng</language><publisher>IEEE</publisher><subject>Automata ; Automatic speech recognition ; Delta modulation ; Encoding ; Language Models ; Lattices ; Natural languages ; Predictive models ; Research and development ; Spoken Dialogue Systems ; Spoken Language Understanding ; Telecommunications ; Telephony</subject><ispartof>2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07, 2007, Vol.4, p.IV-9-IV-12</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4218024$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4218024$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Damnati, G.</creatorcontrib><creatorcontrib>Bechet, F.</creatorcontrib><creatorcontrib>De Mori, R.</creatorcontrib><title>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</title><title>2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07</title><addtitle>ICASSP</addtitle><description>Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.</description><subject>Automata</subject><subject>Automatic speech recognition</subject><subject>Delta modulation</subject><subject>Encoding</subject><subject>Language Models</subject><subject>Lattices</subject><subject>Natural languages</subject><subject>Predictive models</subject><subject>Research and development</subject><subject>Spoken Dialogue Systems</subject><subject>Spoken Language Understanding</subject><subject>Telecommunications</subject><subject>Telephony</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424407279</isbn><isbn>1424407273</isbn><isbn>9781424407286</isbn><isbn>1424407281</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2007</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpVj8tKw0AYhccbGGtfQDfzAon_XDKXZQlWhYBCWnVXppM_MdpOQpIu-vYN6MbNOXAOfJxDyB2DhDGwDy_ZoijeEg6gE6E0S-GMzK02THIpQXOjzknEhbYxs_B58a_T9pJELOUQKybtNbkZhm8AMFqaiHwUXfuDgeYu1AdXI12HEvthdKFsQk2LsXcj1g0OtA10_EK67F3wSFe4Q9_uqZhI9L1tpmhRY_BHmrV9dxhuyVXldgPO_3xG1svHVfYc569P05k8bphOx9hAqv001AH3pRJoKifkJBrBpibVpaoUojOeC1FyWbkKhNoqaUshtrzSYkbuf7kNIm66vtm7_riRnBngUpwAfOtV1Q</recordid><startdate>200704</startdate><enddate>200704</enddate><creator>Damnati, G.</creator><creator>Bechet, F.</creator><creator>De Mori, R.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>200704</creationdate><title>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</title><author>Damnati, G. ; Bechet, F. ; De Mori, R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-8057c142a02cd63e8fa348fa7e095857d6f6eea8c233d24faf036b649d33b2f73</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2007</creationdate><topic>Automata</topic><topic>Automatic speech recognition</topic><topic>Delta modulation</topic><topic>Encoding</topic><topic>Language Models</topic><topic>Lattices</topic><topic>Natural languages</topic><topic>Predictive models</topic><topic>Research and development</topic><topic>Spoken Dialogue Systems</topic><topic>Spoken Language Understanding</topic><topic>Telecommunications</topic><topic>Telephony</topic><toplevel>online_resources</toplevel><creatorcontrib>Damnati, G.</creatorcontrib><creatorcontrib>Bechet, F.</creatorcontrib><creatorcontrib>De Mori, R.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Damnati, G.</au><au>Bechet, F.</au><au>De Mori, R.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</atitle><btitle>2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07</btitle><stitle>ICASSP</stitle><date>2007-04</date><risdate>2007</risdate><volume>4</volume><spage>IV-9</spage><epage>IV-12</epage><pages>IV-9-IV-12</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424407279</isbn><isbn>1424407273</isbn><eisbn>9781424407286</eisbn><eisbn>1424407281</eisbn><abstract>Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2007.367150</doi></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07, 2007, Vol.4, p.IV-9-IV-12 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_4218024 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Automata Automatic speech recognition Delta modulation Encoding Language Models Lattices Natural languages Predictive models Research and development Spoken Dialogue Systems Spoken Language Understanding Telecommunications Telephony |
title | Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T08%3A48%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Spoken%20Language%20Understanding%20Strategies%20on%20the%20France%20Telecom%203000%20Voice%20Agency%20Corpus&rft.btitle=2007%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing%20-%20ICASSP%20'07&rft.au=Damnati,%20G.&rft.date=2007-04&rft.volume=4&rft.spage=IV-9&rft.epage=IV-12&rft.pages=IV-9-IV-12&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424407279&rft.isbn_list=1424407273&rft_id=info:doi/10.1109/ICASSP.2007.367150&rft_dat=%3Cieee_6IE%3E4218024%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781424407286&rft.eisbn_list=1424407281&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4218024&rfr_iscdi=true |