Towards language portability in statistical speech translation

Speech translation has made significant advances over the last years. We believe that we can overcome today's limits of language and domain portable conversational speech translation systems by relying more radically on learning approaches and by the use of multiple layers of reduction and tran...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Waibel, A., Schultz, T., Vogel, S., Fugen, C., Honal, M., Kolss, M., Reichert, J., Stuker, S.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Applied sciences Automatic speech recognition Cleaning Exact sciences and technology Information, signal and communications theory Interactive systems Laboratories Lattices Miscellaneous Natural languages Signal processing Speech enhancement Speech recognition Stochastic processes Surface-mount technology Telecommunications and information theory
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	765
container_issue
container_start_page	iii
container_title
container_volume	3
creator	Waibel, A. Schultz, T. Vogel, S. Fugen, C. Honal, M. Kolss, M. Reichert, J. Stuker, S.
description	Speech translation has made significant advances over the last years. We believe that we can overcome today's limits of language and domain portable conversational speech translation systems by relying more radically on learning approaches and by the use of multiple layers of reduction and transformation to extract the desired content in another language. Therefore, we cascade stochastic source-channel models that extract an underlying message from a corrupt observed output. The three models effectively translate: (1) speech to word lattices (automatic speech recognition, ASR); (2) ill-formed fragments of word strings into a compact well-formed sentence (Clean); (3) sentences in one language to sentences in another (machine translation, MT). We present results of our research efforts towards rapid language portability of all these components. The results on translation suggest that MT systems can be successfully constructed for any language pair by cascading multiple MT systems via English. Moreover, end-to-end performance can be improved, if the interlingua language is enriched with additional linguistic information that can be derived automatically and monolingually in a data-driven fashion.
doi_str_mv	10.1109/ICASSP.2004.1326657
format	Conference Proceeding
fullrecord	<record><control><sourceid>pascalfrancis_6IE</sourceid><recordid>TN_cdi_ieee_primary_1326657</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>1326657</ieee_id><sourcerecordid>17610478</sourcerecordid><originalsourceid>FETCH-LOGICAL-i957-548ca4881d7f5c7755a49e974e53c8051cb26e53c2e576ea7e4bab58e0998d333</originalsourceid><addsrcrecordid>eNpFkFtLw0AQhRcvYK39BX3Ji4-ps7fM7osgxRsUFNoH38pkM6krMQ3ZiPTfG6kgHJjhzGH4OELMJSykBH_zvLxbr18XCsAspFZFYfFETJRGn0sPb6di5tHBKO2MM-pMTKRVkBfS-AtxmdIHADg0biJuN_tv6quUNdTuvmjHWbfvBypjE4dDFtssDTTENMRATZY65vCeDT21qRntfXslzmtqEs_-5lRsHu43y6d89fI4Qq7y6C3m1rhAxjlZYW0DorVkPHs0bHVwYGUoVfG7K7ZYMCGbkkrrGLx3ldZ6Kq6PbztKI0g9AoSYtl0fP6k_bCUWEgy6MTc_5iIz_5-PDekfJ3JYXw</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Towards language portability in statistical speech translation</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Waibel, A. ; Schultz, T. ; Vogel, S. ; Fugen, C. ; Honal, M. ; Kolss, M. ; Reichert, J. ; Stuker, S.</creator><creatorcontrib>Waibel, A. ; Schultz, T. ; Vogel, S. ; Fugen, C. ; Honal, M. ; Kolss, M. ; Reichert, J. ; Stuker, S.</creatorcontrib><description>Speech translation has made significant advances over the last years. We believe that we can overcome today's limits of language and domain portable conversational speech translation systems by relying more radically on learning approaches and by the use of multiple layers of reduction and transformation to extract the desired content in another language. Therefore, we cascade stochastic source-channel models that extract an underlying message from a corrupt observed output. The three models effectively translate: (1) speech to word lattices (automatic speech recognition, ASR); (2) ill-formed fragments of word strings into a compact well-formed sentence (Clean); (3) sentences in one language to sentences in another (machine translation, MT). We present results of our research efforts towards rapid language portability of all these components. The results on translation suggest that MT systems can be successfully constructed for any language pair by cascading multiple MT systems via English. Moreover, end-to-end performance can be improved, if the interlingua language is enriched with additional linguistic information that can be derived automatically and monolingually in a data-driven fashion.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9780780384842</identifier><identifier>ISBN: 0780384849</identifier><identifier>EISSN: 2379-190X</identifier><identifier>DOI: 10.1109/ICASSP.2004.1326657</identifier><language>eng</language><publisher>Piscataway, N.J: IEEE</publisher><subject>Applied sciences ; Automatic speech recognition ; Cleaning ; Exact sciences and technology ; Information, signal and communications theory ; Interactive systems ; Laboratories ; Lattices ; Miscellaneous ; Natural languages ; Signal processing ; Speech enhancement ; Speech recognition ; Stochastic processes ; Surface-mount technology ; Telecommunications and information theory</subject><ispartof>2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004, Vol.3, p.iii-765</ispartof><rights>2006 INIST-CNRS</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/1326657$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/1326657$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17610478$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Waibel, A.</creatorcontrib><creatorcontrib>Schultz, T.</creatorcontrib><creatorcontrib>Vogel, S.</creatorcontrib><creatorcontrib>Fugen, C.</creatorcontrib><creatorcontrib>Honal, M.</creatorcontrib><creatorcontrib>Kolss, M.</creatorcontrib><creatorcontrib>Reichert, J.</creatorcontrib><creatorcontrib>Stuker, S.</creatorcontrib><title>Towards language portability in statistical speech translation</title><title>2004 IEEE International Conference on Acoustics, Speech, and Signal Processing</title><addtitle>ICASSP</addtitle><description>Speech translation has made significant advances over the last years. We believe that we can overcome today's limits of language and domain portable conversational speech translation systems by relying more radically on learning approaches and by the use of multiple layers of reduction and transformation to extract the desired content in another language. Therefore, we cascade stochastic source-channel models that extract an underlying message from a corrupt observed output. The three models effectively translate: (1) speech to word lattices (automatic speech recognition, ASR); (2) ill-formed fragments of word strings into a compact well-formed sentence (Clean); (3) sentences in one language to sentences in another (machine translation, MT). We present results of our research efforts towards rapid language portability of all these components. The results on translation suggest that MT systems can be successfully constructed for any language pair by cascading multiple MT systems via English. Moreover, end-to-end performance can be improved, if the interlingua language is enriched with additional linguistic information that can be derived automatically and monolingually in a data-driven fashion.</description><subject>Applied sciences</subject><subject>Automatic speech recognition</subject><subject>Cleaning</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Interactive systems</subject><subject>Laboratories</subject><subject>Lattices</subject><subject>Miscellaneous</subject><subject>Natural languages</subject><subject>Signal processing</subject><subject>Speech enhancement</subject><subject>Speech recognition</subject><subject>Stochastic processes</subject><subject>Surface-mount technology</subject><subject>Telecommunications and information theory</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9780780384842</isbn><isbn>0780384849</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2004</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFkFtLw0AQhRcvYK39BX3Ji4-ps7fM7osgxRsUFNoH38pkM6krMQ3ZiPTfG6kgHJjhzGH4OELMJSykBH_zvLxbr18XCsAspFZFYfFETJRGn0sPb6di5tHBKO2MM-pMTKRVkBfS-AtxmdIHADg0biJuN_tv6quUNdTuvmjHWbfvBypjE4dDFtssDTTENMRATZY65vCeDT21qRntfXslzmtqEs_-5lRsHu43y6d89fI4Qq7y6C3m1rhAxjlZYW0DorVkPHs0bHVwYGUoVfG7K7ZYMCGbkkrrGLx3ldZ6Kq6PbztKI0g9AoSYtl0fP6k_bCUWEgy6MTc_5iIz_5-PDekfJ3JYXw</recordid><startdate>2004</startdate><enddate>2004</enddate><creator>Waibel, A.</creator><creator>Schultz, T.</creator><creator>Vogel, S.</creator><creator>Fugen, C.</creator><creator>Honal, M.</creator><creator>Kolss, M.</creator><creator>Reichert, J.</creator><creator>Stuker, S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>IQODW</scope></search><sort><creationdate>2004</creationdate><title>Towards language portability in statistical speech translation</title><author>Waibel, A. ; Schultz, T. ; Vogel, S. ; Fugen, C. ; Honal, M. ; Kolss, M. ; Reichert, J. ; Stuker, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i957-548ca4881d7f5c7755a49e974e53c8051cb26e53c2e576ea7e4bab58e0998d333</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Applied sciences</topic><topic>Automatic speech recognition</topic><topic>Cleaning</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Interactive systems</topic><topic>Laboratories</topic><topic>Lattices</topic><topic>Miscellaneous</topic><topic>Natural languages</topic><topic>Signal processing</topic><topic>Speech enhancement</topic><topic>Speech recognition</topic><topic>Stochastic processes</topic><topic>Surface-mount technology</topic><topic>Telecommunications and information theory</topic><toplevel>online_resources</toplevel><creatorcontrib>Waibel, A.</creatorcontrib><creatorcontrib>Schultz, T.</creatorcontrib><creatorcontrib>Vogel, S.</creatorcontrib><creatorcontrib>Fugen, C.</creatorcontrib><creatorcontrib>Honal, M.</creatorcontrib><creatorcontrib>Kolss, M.</creatorcontrib><creatorcontrib>Reichert, J.</creatorcontrib><creatorcontrib>Stuker, S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Waibel, A.</au><au>Schultz, T.</au><au>Vogel, S.</au><au>Fugen, C.</au><au>Honal, M.</au><au>Kolss, M.</au><au>Reichert, J.</au><au>Stuker, S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Towards language portability in statistical speech translation</atitle><btitle>2004 IEEE International Conference on Acoustics, Speech, and Signal Processing</btitle><stitle>ICASSP</stitle><date>2004</date><risdate>2004</risdate><volume>3</volume><spage>iii</spage><epage>765</epage><pages>iii-765</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9780780384842</isbn><isbn>0780384849</isbn><abstract>Speech translation has made significant advances over the last years. We believe that we can overcome today's limits of language and domain portable conversational speech translation systems by relying more radically on learning approaches and by the use of multiple layers of reduction and transformation to extract the desired content in another language. Therefore, we cascade stochastic source-channel models that extract an underlying message from a corrupt observed output. The three models effectively translate: (1) speech to word lattices (automatic speech recognition, ASR); (2) ill-formed fragments of word strings into a compact well-formed sentence (Clean); (3) sentences in one language to sentences in another (machine translation, MT). We present results of our research efforts towards rapid language portability of all these components. The results on translation suggest that MT systems can be successfully constructed for any language pair by cascading multiple MT systems via English. Moreover, end-to-end performance can be improved, if the interlingua language is enriched with additional linguistic information that can be derived automatically and monolingually in a data-driven fashion.</abstract><cop>Piscataway, N.J</cop><pub>IEEE</pub><doi>10.1109/ICASSP.2004.1326657</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-6149
ispartof	2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004, Vol.3, p.iii-765
issn	1520-6149 2379-190X
language	eng
recordid	cdi_ieee_primary_1326657
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Applied sciences Automatic speech recognition Cleaning Exact sciences and technology Information, signal and communications theory Interactive systems Laboratories Lattices Miscellaneous Natural languages Signal processing Speech enhancement Speech recognition Stochastic processes Surface-mount technology Telecommunications and information theory
title	Towards language portability in statistical speech translation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-30T23%3A41%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Towards%20language%20portability%20in%20statistical%20speech%20translation&rft.btitle=2004%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech,%20and%20Signal%20Processing&rft.au=Waibel,%20A.&rft.date=2004&rft.volume=3&rft.spage=iii&rft.epage=765&rft.pages=iii-765&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9780780384842&rft.isbn_list=0780384849&rft_id=info:doi/10.1109/ICASSP.2004.1326657&rft_dat=%3Cpascalfrancis_6IE%3E17610478%3C/pascalfrancis_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=1326657&rfr_iscdi=true