Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus

Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic spe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Damnati, G., Bechet, F., De Mori, R.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Automata Automatic speech recognition Delta modulation Encoding Language Models Lattices Natural languages Predictive models Research and development Spoken Dialogue Systems Spoken Language Understanding Telecommunications Telephony
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	IV-12
container_issue
container_start_page	IV-9
container_title
container_volume	4
creator	Damnati, G. Bechet, F. De Mori, R.
description	Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.
doi_str_mv	10.1109/ICASSP.2007.367150
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4218024</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4218024</ieee_id><sourcerecordid>4218024</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-8057c142a02cd63e8fa348fa7e095857d6f6eea8c233d24faf036b649d33b2f73</originalsourceid><addsrcrecordid>eNpVj8tKw0AYhccbGGtfQDfzAon_XDKXZQlWhYBCWnVXppM_MdpOQpIu-vYN6MbNOXAOfJxDyB2DhDGwDy_ZoijeEg6gE6E0S-GMzK02THIpQXOjzknEhbYxs_B58a_T9pJELOUQKybtNbkZhm8AMFqaiHwUXfuDgeYu1AdXI12HEvthdKFsQk2LsXcj1g0OtA10_EK67F3wSFe4Q9_uqZhI9L1tpmhRY_BHmrV9dxhuyVXldgPO_3xG1svHVfYc569P05k8bphOx9hAqv001AH3pRJoKifkJBrBpibVpaoUojOeC1FyWbkKhNoqaUshtrzSYkbuf7kNIm66vtm7_riRnBngUpwAfOtV1Q</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Damnati, G. ; Bechet, F. ; De Mori, R.</creator><creatorcontrib>Damnati, G. ; Bechet, F. ; De Mori, R.</creatorcontrib><description>Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424407279</identifier><identifier>ISBN: 1424407273</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 9781424407286</identifier><identifier>EISBN: 1424407281</identifier><identifier>DOI: 10.1109/ICASSP.2007.367150</identifier><language>eng</language><publisher>IEEE</publisher><subject>Automata ; Automatic speech recognition ; Delta modulation ; Encoding ; Language Models ; Lattices ; Natural languages ; Predictive models ; Research and development ; Spoken Dialogue Systems ; Spoken Language Understanding ; Telecommunications ; Telephony</subject><ispartof>2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07, 2007, Vol.4, p.IV-9-IV-12</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4218024$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4218024$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Damnati, G.</creatorcontrib><creatorcontrib>Bechet, F.</creatorcontrib><creatorcontrib>De Mori, R.</creatorcontrib><title>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</title><title>2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07</title><addtitle>ICASSP</addtitle><description>Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.</description><subject>Automata</subject><subject>Automatic speech recognition</subject><subject>Delta modulation</subject><subject>Encoding</subject><subject>Language Models</subject><subject>Lattices</subject><subject>Natural languages</subject><subject>Predictive models</subject><subject>Research and development</subject><subject>Spoken Dialogue Systems</subject><subject>Spoken Language Understanding</subject><subject>Telecommunications</subject><subject>Telephony</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424407279</isbn><isbn>1424407273</isbn><isbn>9781424407286</isbn><isbn>1424407281</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2007</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpVj8tKw0AYhccbGGtfQDfzAon_XDKXZQlWhYBCWnVXppM_MdpOQpIu-vYN6MbNOXAOfJxDyB2DhDGwDy_ZoijeEg6gE6E0S-GMzK02THIpQXOjzknEhbYxs_B58a_T9pJELOUQKybtNbkZhm8AMFqaiHwUXfuDgeYu1AdXI12HEvthdKFsQk2LsXcj1g0OtA10_EK67F3wSFe4Q9_uqZhI9L1tpmhRY_BHmrV9dxhuyVXldgPO_3xG1svHVfYc569P05k8bphOx9hAqv001AH3pRJoKifkJBrBpibVpaoUojOeC1FyWbkKhNoqaUshtrzSYkbuf7kNIm66vtm7_riRnBngUpwAfOtV1Q</recordid><startdate>200704</startdate><enddate>200704</enddate><creator>Damnati, G.</creator><creator>Bechet, F.</creator><creator>De Mori, R.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>200704</creationdate><title>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</title><author>Damnati, G. ; Bechet, F. ; De Mori, R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-8057c142a02cd63e8fa348fa7e095857d6f6eea8c233d24faf036b649d33b2f73</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2007</creationdate><topic>Automata</topic><topic>Automatic speech recognition</topic><topic>Delta modulation</topic><topic>Encoding</topic><topic>Language Models</topic><topic>Lattices</topic><topic>Natural languages</topic><topic>Predictive models</topic><topic>Research and development</topic><topic>Spoken Dialogue Systems</topic><topic>Spoken Language Understanding</topic><topic>Telecommunications</topic><topic>Telephony</topic><toplevel>online_resources</toplevel><creatorcontrib>Damnati, G.</creatorcontrib><creatorcontrib>Bechet, F.</creatorcontrib><creatorcontrib>De Mori, R.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Damnati, G.</au><au>Bechet, F.</au><au>De Mori, R.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus</atitle><btitle>2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07</btitle><stitle>ICASSP</stitle><date>2007-04</date><risdate>2007</risdate><volume>4</volume><spage>IV-9</spage><epage>IV-12</epage><pages>IV-9-IV-12</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424407279</isbn><isbn>1424407273</isbn><eisbn>9781424407286</eisbn><eisbn>1424407281</eisbn><abstract>Telephone services are now deployed that allow users to react to telephone prompts in spoken natural language. These systems have limited domain semantics and dialogue strategies which are represented by finite state diagrams. Most of these systems adopt a sequential approach where the automatic speech recognition (ASR) process, the spoken language understanding (SLU) process and the dialogue management (DM) are separate processes. In the framework of the France Telecom 3000 voice service, we propose in this paper to study several strategies in order to integrate more closely these three processes: ASR, SLU, and DM. By means of a finite state machine paradigm encoding the different models used by these three levels we show how the search for the best sequence of dialogue states can be done simultaneously at the word, concept, interpretation and dialogue state levels.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2007.367150</doi></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07, 2007, Vol.4, p.IV-9-IV-12
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_4218024
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Automata Automatic speech recognition Delta modulation Encoding Language Models Lattices Natural languages Predictive models Research and development Spoken Dialogue Systems Spoken Language Understanding Telecommunications Telephony
title	Spoken Language Understanding Strategies on the France Telecom 3000 Voice Agency Corpus
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-27T08%3A48%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Spoken%20Language%20Understanding%20Strategies%20on%20the%20France%20Telecom%203000%20Voice%20Agency%20Corpus&rft.btitle=2007%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing%20-%20ICASSP%20'07&rft.au=Damnati,%20G.&rft.date=2007-04&rft.volume=4&rft.spage=IV-9&rft.epage=IV-12&rft.pages=IV-9-IV-12&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424407279&rft.isbn_list=1424407273&rft_id=info:doi/10.1109/ICASSP.2007.367150&rft_dat=%3Cieee_6IE%3E4218024%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=9781424407286&rft.eisbn_list=1424407281&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4218024&rfr_iscdi=true