An experimental Japanese/English interpreting video phone system

We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An Americ...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Karaorman, M., Applebaum, T.H., Itoh, T., Endo, M., Ohno, Y., Hoshimi, M., Kamai, T., Matsui, K., Hata, K., Pearson, S., Junqua, J.-C.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1679 vol.3
container_issue
container_start_page 1676
container_title
container_volume 3
creator Karaorman, M.
Applebaum, T.H.
Itoh, T.
Endo, M.
Ohno, Y.
Hoshimi, M.
Kamai, T.
Matsui, K.
Hata, K.
Pearson, S.
Junqua, J.-C.
description We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.
doi_str_mv 10.1109/ICSLP.1996.607948
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_607948</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>607948</ieee_id><sourcerecordid>607948</sourcerecordid><originalsourceid>FETCH-LOGICAL-i147t-4526891954f4752ae75cd5562975cbce07f23421970b7290dc6fadeb71984f723</originalsourceid><addsrcrecordid>eNotj81Kw0AUhQdEUGsfQFfzAknn_2Z2llBtJWChui6T5E47kk5DJoh9ewP1bM63OnyHkCfOcs6ZXWzKXbXNubUmNwysKm7IA4OCSam1VndkntI3m6I0B27uycsyUvztcQgnjKPr6LvrXcSEi1U8dCEdaYgjDv2AY4gH-hNaPNP-eI5I0yWNeHokt951Cef_PSNfr6vPcp1VH2-bclllgSsYM6WFKSy3WnkFWjgE3bRaG2EnqBtk4IVUgltgNQjL2sZ412IN3BbKg5Az8nzdDYi47ydfN1z214_yD88VR0w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>An experimental Japanese/English interpreting video phone system</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Karaorman, M. ; Applebaum, T.H. ; Itoh, T. ; Endo, M. ; Ohno, Y. ; Hoshimi, M. ; Kamai, T. ; Matsui, K. ; Hata, K. ; Pearson, S. ; Junqua, J.-C.</creator><creatorcontrib>Karaorman, M. ; Applebaum, T.H. ; Itoh, T. ; Endo, M. ; Ohno, Y. ; Hoshimi, M. ; Kamai, T. ; Matsui, K. ; Hata, K. ; Pearson, S. ; Junqua, J.-C.</creatorcontrib><description>We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.</description><identifier>ISBN: 0780335554</identifier><identifier>ISBN: 9780780335554</identifier><identifier>DOI: 10.1109/ICSLP.1996.607948</identifier><language>eng</language><publisher>IEEE</publisher><subject>Buildings ; Communication system software ; Internet ; ISDN ; Natural languages ; Speech processing ; Speech recognition ; Speech synthesis ; Videophone systems ; Vocabulary</subject><ispartof>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.3, p.1676-1679 vol.3</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/607948$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/607948$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Karaorman, M.</creatorcontrib><creatorcontrib>Applebaum, T.H.</creatorcontrib><creatorcontrib>Itoh, T.</creatorcontrib><creatorcontrib>Endo, M.</creatorcontrib><creatorcontrib>Ohno, Y.</creatorcontrib><creatorcontrib>Hoshimi, M.</creatorcontrib><creatorcontrib>Kamai, T.</creatorcontrib><creatorcontrib>Matsui, K.</creatorcontrib><creatorcontrib>Hata, K.</creatorcontrib><creatorcontrib>Pearson, S.</creatorcontrib><creatorcontrib>Junqua, J.-C.</creatorcontrib><title>An experimental Japanese/English interpreting video phone system</title><title>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</title><addtitle>ICSLP</addtitle><description>We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.</description><subject>Buildings</subject><subject>Communication system software</subject><subject>Internet</subject><subject>ISDN</subject><subject>Natural languages</subject><subject>Speech processing</subject><subject>Speech recognition</subject><subject>Speech synthesis</subject><subject>Videophone systems</subject><subject>Vocabulary</subject><isbn>0780335554</isbn><isbn>9780780335554</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj81Kw0AUhQdEUGsfQFfzAknn_2Z2llBtJWChui6T5E47kk5DJoh9ewP1bM63OnyHkCfOcs6ZXWzKXbXNubUmNwysKm7IA4OCSam1VndkntI3m6I0B27uycsyUvztcQgnjKPr6LvrXcSEi1U8dCEdaYgjDv2AY4gH-hNaPNP-eI5I0yWNeHokt951Cef_PSNfr6vPcp1VH2-bclllgSsYM6WFKSy3WnkFWjgE3bRaG2EnqBtk4IVUgltgNQjL2sZ412IN3BbKg5Az8nzdDYi47ydfN1z214_yD88VR0w</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Karaorman, M.</creator><creator>Applebaum, T.H.</creator><creator>Itoh, T.</creator><creator>Endo, M.</creator><creator>Ohno, Y.</creator><creator>Hoshimi, M.</creator><creator>Kamai, T.</creator><creator>Matsui, K.</creator><creator>Hata, K.</creator><creator>Pearson, S.</creator><creator>Junqua, J.-C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>An experimental Japanese/English interpreting video phone system</title><author>Karaorman, M. ; Applebaum, T.H. ; Itoh, T. ; Endo, M. ; Ohno, Y. ; Hoshimi, M. ; Kamai, T. ; Matsui, K. ; Hata, K. ; Pearson, S. ; Junqua, J.-C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i147t-4526891954f4752ae75cd5562975cbce07f23421970b7290dc6fadeb71984f723</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Buildings</topic><topic>Communication system software</topic><topic>Internet</topic><topic>ISDN</topic><topic>Natural languages</topic><topic>Speech processing</topic><topic>Speech recognition</topic><topic>Speech synthesis</topic><topic>Videophone systems</topic><topic>Vocabulary</topic><toplevel>online_resources</toplevel><creatorcontrib>Karaorman, M.</creatorcontrib><creatorcontrib>Applebaum, T.H.</creatorcontrib><creatorcontrib>Itoh, T.</creatorcontrib><creatorcontrib>Endo, M.</creatorcontrib><creatorcontrib>Ohno, Y.</creatorcontrib><creatorcontrib>Hoshimi, M.</creatorcontrib><creatorcontrib>Kamai, T.</creatorcontrib><creatorcontrib>Matsui, K.</creatorcontrib><creatorcontrib>Hata, K.</creatorcontrib><creatorcontrib>Pearson, S.</creatorcontrib><creatorcontrib>Junqua, J.-C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Karaorman, M.</au><au>Applebaum, T.H.</au><au>Itoh, T.</au><au>Endo, M.</au><au>Ohno, Y.</au><au>Hoshimi, M.</au><au>Kamai, T.</au><au>Matsui, K.</au><au>Hata, K.</au><au>Pearson, S.</au><au>Junqua, J.-C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>An experimental Japanese/English interpreting video phone system</atitle><btitle>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</btitle><stitle>ICSLP</stitle><date>1996</date><risdate>1996</risdate><volume>3</volume><spage>1676</spage><epage>1679 vol.3</epage><pages>1676-1679 vol.3</pages><isbn>0780335554</isbn><isbn>9780780335554</isbn><abstract>We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.</abstract><pub>IEEE</pub><doi>10.1109/ICSLP.1996.607948</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 0780335554
ispartof Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.3, p.1676-1679 vol.3
issn
language eng
recordid cdi_ieee_primary_607948
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Buildings
Communication system software
Internet
ISDN
Natural languages
Speech processing
Speech recognition
Speech synthesis
Videophone systems
Vocabulary
title An experimental Japanese/English interpreting video phone system
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T16%3A55%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=An%20experimental%20Japanese/English%20interpreting%20video%20phone%20system&rft.btitle=Proceeding%20of%20Fourth%20International%20Conference%20on%20Spoken%20Language%20Processing.%20ICSLP%20'96&rft.au=Karaorman,%20M.&rft.date=1996&rft.volume=3&rft.spage=1676&rft.epage=1679%20vol.3&rft.pages=1676-1679%20vol.3&rft.isbn=0780335554&rft.isbn_list=9780780335554&rft_id=info:doi/10.1109/ICSLP.1996.607948&rft_dat=%3Cieee_6IE%3E607948%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=607948&rfr_iscdi=true