An experimental Japanese/English interpreting video phone system
We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An Americ...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1679 vol.3 |
---|---|
container_issue | |
container_start_page | 1676 |
container_title | |
container_volume | 3 |
creator | Karaorman, M. Applebaum, T.H. Itoh, T. Endo, M. Ohno, Y. Hoshimi, M. Kamai, T. Matsui, K. Hata, K. Pearson, S. Junqua, J.-C. |
description | We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported. |
doi_str_mv | 10.1109/ICSLP.1996.607948 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_607948</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>607948</ieee_id><sourcerecordid>607948</sourcerecordid><originalsourceid>FETCH-LOGICAL-i147t-4526891954f4752ae75cd5562975cbce07f23421970b7290dc6fadeb71984f723</originalsourceid><addsrcrecordid>eNotj81Kw0AUhQdEUGsfQFfzAknn_2Z2llBtJWChui6T5E47kk5DJoh9ewP1bM63OnyHkCfOcs6ZXWzKXbXNubUmNwysKm7IA4OCSam1VndkntI3m6I0B27uycsyUvztcQgnjKPr6LvrXcSEi1U8dCEdaYgjDv2AY4gH-hNaPNP-eI5I0yWNeHokt951Cef_PSNfr6vPcp1VH2-bclllgSsYM6WFKSy3WnkFWjgE3bRaG2EnqBtk4IVUgltgNQjL2sZ412IN3BbKg5Az8nzdDYi47ydfN1z214_yD88VR0w</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>An experimental Japanese/English interpreting video phone system</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Karaorman, M. ; Applebaum, T.H. ; Itoh, T. ; Endo, M. ; Ohno, Y. ; Hoshimi, M. ; Kamai, T. ; Matsui, K. ; Hata, K. ; Pearson, S. ; Junqua, J.-C.</creator><creatorcontrib>Karaorman, M. ; Applebaum, T.H. ; Itoh, T. ; Endo, M. ; Ohno, Y. ; Hoshimi, M. ; Kamai, T. ; Matsui, K. ; Hata, K. ; Pearson, S. ; Junqua, J.-C.</creatorcontrib><description>We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.</description><identifier>ISBN: 0780335554</identifier><identifier>ISBN: 9780780335554</identifier><identifier>DOI: 10.1109/ICSLP.1996.607948</identifier><language>eng</language><publisher>IEEE</publisher><subject>Buildings ; Communication system software ; Internet ; ISDN ; Natural languages ; Speech processing ; Speech recognition ; Speech synthesis ; Videophone systems ; Vocabulary</subject><ispartof>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.3, p.1676-1679 vol.3</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/607948$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,4050,4051,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/607948$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Karaorman, M.</creatorcontrib><creatorcontrib>Applebaum, T.H.</creatorcontrib><creatorcontrib>Itoh, T.</creatorcontrib><creatorcontrib>Endo, M.</creatorcontrib><creatorcontrib>Ohno, Y.</creatorcontrib><creatorcontrib>Hoshimi, M.</creatorcontrib><creatorcontrib>Kamai, T.</creatorcontrib><creatorcontrib>Matsui, K.</creatorcontrib><creatorcontrib>Hata, K.</creatorcontrib><creatorcontrib>Pearson, S.</creatorcontrib><creatorcontrib>Junqua, J.-C.</creatorcontrib><title>An experimental Japanese/English interpreting video phone system</title><title>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</title><addtitle>ICSLP</addtitle><description>We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.</description><subject>Buildings</subject><subject>Communication system software</subject><subject>Internet</subject><subject>ISDN</subject><subject>Natural languages</subject><subject>Speech processing</subject><subject>Speech recognition</subject><subject>Speech synthesis</subject><subject>Videophone systems</subject><subject>Vocabulary</subject><isbn>0780335554</isbn><isbn>9780780335554</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj81Kw0AUhQdEUGsfQFfzAknn_2Z2llBtJWChui6T5E47kk5DJoh9ewP1bM63OnyHkCfOcs6ZXWzKXbXNubUmNwysKm7IA4OCSam1VndkntI3m6I0B27uycsyUvztcQgnjKPr6LvrXcSEi1U8dCEdaYgjDv2AY4gH-hNaPNP-eI5I0yWNeHokt951Cef_PSNfr6vPcp1VH2-bclllgSsYM6WFKSy3WnkFWjgE3bRaG2EnqBtk4IVUgltgNQjL2sZ412IN3BbKg5Az8nzdDYi47ydfN1z214_yD88VR0w</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Karaorman, M.</creator><creator>Applebaum, T.H.</creator><creator>Itoh, T.</creator><creator>Endo, M.</creator><creator>Ohno, Y.</creator><creator>Hoshimi, M.</creator><creator>Kamai, T.</creator><creator>Matsui, K.</creator><creator>Hata, K.</creator><creator>Pearson, S.</creator><creator>Junqua, J.-C.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>An experimental Japanese/English interpreting video phone system</title><author>Karaorman, M. ; Applebaum, T.H. ; Itoh, T. ; Endo, M. ; Ohno, Y. ; Hoshimi, M. ; Kamai, T. ; Matsui, K. ; Hata, K. ; Pearson, S. ; Junqua, J.-C.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i147t-4526891954f4752ae75cd5562975cbce07f23421970b7290dc6fadeb71984f723</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Buildings</topic><topic>Communication system software</topic><topic>Internet</topic><topic>ISDN</topic><topic>Natural languages</topic><topic>Speech processing</topic><topic>Speech recognition</topic><topic>Speech synthesis</topic><topic>Videophone systems</topic><topic>Vocabulary</topic><toplevel>online_resources</toplevel><creatorcontrib>Karaorman, M.</creatorcontrib><creatorcontrib>Applebaum, T.H.</creatorcontrib><creatorcontrib>Itoh, T.</creatorcontrib><creatorcontrib>Endo, M.</creatorcontrib><creatorcontrib>Ohno, Y.</creatorcontrib><creatorcontrib>Hoshimi, M.</creatorcontrib><creatorcontrib>Kamai, T.</creatorcontrib><creatorcontrib>Matsui, K.</creatorcontrib><creatorcontrib>Hata, K.</creatorcontrib><creatorcontrib>Pearson, S.</creatorcontrib><creatorcontrib>Junqua, J.-C.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Karaorman, M.</au><au>Applebaum, T.H.</au><au>Itoh, T.</au><au>Endo, M.</au><au>Ohno, Y.</au><au>Hoshimi, M.</au><au>Kamai, T.</au><au>Matsui, K.</au><au>Hata, K.</au><au>Pearson, S.</au><au>Junqua, J.-C.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>An experimental Japanese/English interpreting video phone system</atitle><btitle>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</btitle><stitle>ICSLP</stitle><date>1996</date><risdate>1996</risdate><volume>3</volume><spage>1676</spage><epage>1679 vol.3</epage><pages>1676-1679 vol.3</pages><isbn>0780335554</isbn><isbn>9780780335554</isbn><abstract>We report on the architectural design issues and experiences gained while building and demonstrating an experimental interpreting video phone (IVP) system. The IVP system has been demonstrated in an Internet home shopping simulation simultaneously before live audiences in Japan and the US. An American shop assistant and a Japanese customer engaged in task directed dialogues using their native languages. In addition to their direct audio/visual contact by ISDN video phone, each participant heard a translation of the remote speaker's utterances in a synthetic voice in real time. Each site used a medium size vocabulary, a continuous speech recognition system and a text to speech synthesis (TTS) system for the local language. Recognition results were transmitted over the Internet to the remote site, where the corresponding translated sentence was spoken by TTS in the listener's native language. All of the speech and language processing software components of the system were independently developed proprietary technologies of the authors' laboratories which were integrated using commercially available hardware and communication media. Difficulties encountered in developing the system, the accommodations which were made, and other experiences gained through the process are reported.</abstract><pub>IEEE</pub><doi>10.1109/ICSLP.1996.607948</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 0780335554 |
ispartof | Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.3, p.1676-1679 vol.3 |
issn | |
language | eng |
recordid | cdi_ieee_primary_607948 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Buildings Communication system software Internet ISDN Natural languages Speech processing Speech recognition Speech synthesis Videophone systems Vocabulary |
title | An experimental Japanese/English interpreting video phone system |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T16%3A55%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=An%20experimental%20Japanese/English%20interpreting%20video%20phone%20system&rft.btitle=Proceeding%20of%20Fourth%20International%20Conference%20on%20Spoken%20Language%20Processing.%20ICSLP%20'96&rft.au=Karaorman,%20M.&rft.date=1996&rft.volume=3&rft.spage=1676&rft.epage=1679%20vol.3&rft.pages=1676-1679%20vol.3&rft.isbn=0780335554&rft.isbn_list=9780780335554&rft_id=info:doi/10.1109/ICSLP.1996.607948&rft_dat=%3Cieee_6IE%3E607948%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=607948&rfr_iscdi=true |