Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data

This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on speech and audio processing 1997-03, Vol.5 (2), p.195-200
Hauptverfasser: WANG, H.-M, HO, T.-H, YANG, R.-C, SHEN, J.-L, BAI, B.-R, HONG, J.-C, CHEN, W.-P, YU, T.-L, LEE, L.-S
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 200
container_issue 2
container_start_page 195
container_title IEEE transactions on speech and audio processing
container_volume 5
creator WANG, H.-M
HO, T.-H
YANG, R.-C
SHEN, J.-L
BAI, B.-R
HONG, J.-C
CHEN, W.-P
YU, T.-L
LEE, L.-S
description This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accuracy achieved is 92.2% for finally decoded Chinese characters.
doi_str_mv 10.1109/89.554782
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_28210211</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>554782</ieee_id><sourcerecordid>28210211</sourcerecordid><originalsourceid>FETCH-LOGICAL-c306t-26f780c9b2bcd3e0c1b8ec52af59fe8af609fccef6c575604109d9618ace211f3</originalsourceid><addsrcrecordid>eNo9kLtPwzAQxiMEEuUxsDJ5QEgMAdupHWdEFS-piAXmyL2cW6PULrZT1IW_HVetOt3rd5_uvqK4YvSeMdo8qOZeiHGt-FExYkKokleiOs45lVUpZS1Pi7MYvymlitXjUfE38ctVjwlJQPBzZ5P1jnhDwLtk3eCHSN6163SwjsQVIiyI8YFMFtZhRNJrNx_0HMmvTQuyxrDJrZDrtQc9G3K-IUO0bk56u7QJO5KCtm7b6HTSF8WJ0X3Ey308L76enz4nr-X04-Vt8jgtoaIylVyaWlFoZnwGXYUU2EwhCK6NaAwqbSRtDAAaCaIWko6zFV0jmdKAnDFTnRe3O91V8D8DxtQubQTs8_mYX2y54oxmMoN3OxCCjzGgaVfBLvMXLaPt1uFWNe3O4cze7EV1BN2boB3YeFjgQolKNhm73mEWEQ_TvcY_x7mGHw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>28210211</pqid></control><display><type>article</type><title>Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data</title><source>IEEE Electronic Library (IEL)</source><creator>WANG, H.-M ; HO, T.-H ; YANG, R.-C ; SHEN, J.-L ; BAI, B.-R ; HONG, J.-C ; CHEN, W.-P ; YU, T.-L ; LEE, L.-S</creator><creatorcontrib>WANG, H.-M ; HO, T.-H ; YANG, R.-C ; SHEN, J.-L ; BAI, B.-R ; HONG, J.-C ; CHEN, W.-P ; YU, T.-L ; LEE, L.-S</creatorcontrib><description>This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accuracy achieved is 92.2% for finally decoded Chinese characters.</description><identifier>ISSN: 1063-6676</identifier><identifier>EISSN: 1558-2353</identifier><identifier>DOI: 10.1109/89.554782</identifier><identifier>CODEN: IESPEJ</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Applied sciences ; Character recognition ; Computer science ; Decoding ; Exact sciences and technology ; Hidden Markov models ; Information, signal and communications theory ; Natural languages ; Prototypes ; Signal processing ; Speech processing ; Speech recognition ; Telecommunications and information theory ; Training data ; Vocabulary</subject><ispartof>IEEE transactions on speech and audio processing, 1997-03, Vol.5 (2), p.195-200</ispartof><rights>1997 INIST-CNRS</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c306t-26f780c9b2bcd3e0c1b8ec52af59fe8af609fccef6c575604109d9618ace211f3</citedby><cites>FETCH-LOGICAL-c306t-26f780c9b2bcd3e0c1b8ec52af59fe8af609fccef6c575604109d9618ace211f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/554782$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/554782$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=2585369$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>WANG, H.-M</creatorcontrib><creatorcontrib>HO, T.-H</creatorcontrib><creatorcontrib>YANG, R.-C</creatorcontrib><creatorcontrib>SHEN, J.-L</creatorcontrib><creatorcontrib>BAI, B.-R</creatorcontrib><creatorcontrib>HONG, J.-C</creatorcontrib><creatorcontrib>CHEN, W.-P</creatorcontrib><creatorcontrib>YU, T.-L</creatorcontrib><creatorcontrib>LEE, L.-S</creatorcontrib><title>Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data</title><title>IEEE transactions on speech and audio processing</title><addtitle>T-SAP</addtitle><description>This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accuracy achieved is 92.2% for finally decoded Chinese characters.</description><subject>Applied sciences</subject><subject>Character recognition</subject><subject>Computer science</subject><subject>Decoding</subject><subject>Exact sciences and technology</subject><subject>Hidden Markov models</subject><subject>Information, signal and communications theory</subject><subject>Natural languages</subject><subject>Prototypes</subject><subject>Signal processing</subject><subject>Speech processing</subject><subject>Speech recognition</subject><subject>Telecommunications and information theory</subject><subject>Training data</subject><subject>Vocabulary</subject><issn>1063-6676</issn><issn>1558-2353</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1997</creationdate><recordtype>article</recordtype><recordid>eNo9kLtPwzAQxiMEEuUxsDJ5QEgMAdupHWdEFS-piAXmyL2cW6PULrZT1IW_HVetOt3rd5_uvqK4YvSeMdo8qOZeiHGt-FExYkKokleiOs45lVUpZS1Pi7MYvymlitXjUfE38ctVjwlJQPBzZ5P1jnhDwLtk3eCHSN6163SwjsQVIiyI8YFMFtZhRNJrNx_0HMmvTQuyxrDJrZDrtQc9G3K-IUO0bk56u7QJO5KCtm7b6HTSF8WJ0X3Ey308L76enz4nr-X04-Vt8jgtoaIylVyaWlFoZnwGXYUU2EwhCK6NaAwqbSRtDAAaCaIWko6zFV0jmdKAnDFTnRe3O91V8D8DxtQubQTs8_mYX2y54oxmMoN3OxCCjzGgaVfBLvMXLaPt1uFWNe3O4cze7EV1BN2boB3YeFjgQolKNhm73mEWEQ_TvcY_x7mGHw</recordid><startdate>19970301</startdate><enddate>19970301</enddate><creator>WANG, H.-M</creator><creator>HO, T.-H</creator><creator>YANG, R.-C</creator><creator>SHEN, J.-L</creator><creator>BAI, B.-R</creator><creator>HONG, J.-C</creator><creator>CHEN, W.-P</creator><creator>YU, T.-L</creator><creator>LEE, L.-S</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>19970301</creationdate><title>Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data</title><author>WANG, H.-M ; HO, T.-H ; YANG, R.-C ; SHEN, J.-L ; BAI, B.-R ; HONG, J.-C ; CHEN, W.-P ; YU, T.-L ; LEE, L.-S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c306t-26f780c9b2bcd3e0c1b8ec52af59fe8af609fccef6c575604109d9618ace211f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1997</creationdate><topic>Applied sciences</topic><topic>Character recognition</topic><topic>Computer science</topic><topic>Decoding</topic><topic>Exact sciences and technology</topic><topic>Hidden Markov models</topic><topic>Information, signal and communications theory</topic><topic>Natural languages</topic><topic>Prototypes</topic><topic>Signal processing</topic><topic>Speech processing</topic><topic>Speech recognition</topic><topic>Telecommunications and information theory</topic><topic>Training data</topic><topic>Vocabulary</topic><toplevel>online_resources</toplevel><creatorcontrib>WANG, H.-M</creatorcontrib><creatorcontrib>HO, T.-H</creatorcontrib><creatorcontrib>YANG, R.-C</creatorcontrib><creatorcontrib>SHEN, J.-L</creatorcontrib><creatorcontrib>BAI, B.-R</creatorcontrib><creatorcontrib>HONG, J.-C</creatorcontrib><creatorcontrib>CHEN, W.-P</creatorcontrib><creatorcontrib>YU, T.-L</creatorcontrib><creatorcontrib>LEE, L.-S</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on speech and audio processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WANG, H.-M</au><au>HO, T.-H</au><au>YANG, R.-C</au><au>SHEN, J.-L</au><au>BAI, B.-R</au><au>HONG, J.-C</au><au>CHEN, W.-P</au><au>YU, T.-L</au><au>LEE, L.-S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data</atitle><jtitle>IEEE transactions on speech and audio processing</jtitle><stitle>T-SAP</stitle><date>1997-03-01</date><risdate>1997</risdate><volume>5</volume><issue>2</issue><spage>195</spage><epage>200</epage><pages>195-200</pages><issn>1063-6676</issn><eissn>1558-2353</eissn><coden>IESPEJ</coden><abstract>This correspondence presents the first known results of complete recognition of continuous Mandarin speech for the Chinese language with very large vocabulary but very limited training data. Various acoustic and linguistic processing techniques were developed, and a prototype system of a continuous speech Mandarin dictation machine has been successfully implemented. The best recognition accuracy achieved is 92.2% for finally decoded Chinese characters.</abstract><cop>New York, NY</cop><pub>IEEE</pub><doi>10.1109/89.554782</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1063-6676
ispartof IEEE transactions on speech and audio processing, 1997-03, Vol.5 (2), p.195-200
issn 1063-6676
1558-2353
language eng
recordid cdi_proquest_miscellaneous_28210211
source IEEE Electronic Library (IEL)
subjects Applied sciences
Character recognition
Computer science
Decoding
Exact sciences and technology
Hidden Markov models
Information, signal and communications theory
Natural languages
Prototypes
Signal processing
Speech processing
Speech recognition
Telecommunications and information theory
Training data
Vocabulary
title Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T17%3A15%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Complete%20recognition%20of%20continuous%20Mandarin%20speech%20for%20Chinese%20language%20with%20very%20large%20vocabulary%20using%20limited%20training%20data&rft.jtitle=IEEE%20transactions%20on%20speech%20and%20audio%20processing&rft.au=WANG,%20H.-M&rft.date=1997-03-01&rft.volume=5&rft.issue=2&rft.spage=195&rft.epage=200&rft.pages=195-200&rft.issn=1063-6676&rft.eissn=1558-2353&rft.coden=IESPEJ&rft_id=info:doi/10.1109/89.554782&rft_dat=%3Cproquest_RIE%3E28210211%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=28210211&rft_id=info:pmid/&rft_ieee_id=554782&rfr_iscdi=true