Deriving articulatory representations from speech with various excitation modes

A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Richards, H.B., Mason, J.S., Hunt, M.J., Bridle, J.S.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Background noise Bandwidth Character generation Noise measurement Parameter estimation Resonance Shape Speech analysis Speech synthesis State estimation
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1236 vol.2
container_issue
container_start_page	1233
container_title
container_volume	2
creator	Richards, H.B. Mason, J.S. Hunt, M.J. Bridle, J.S.
description	A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.
doi_str_mv	10.1109/ICSLP.1996.607831
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_607831</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>607831</ieee_id><sourcerecordid>607831</sourcerecordid><originalsourceid>FETCH-LOGICAL-i147t-60398555393a719a8a6ae1c5db2c11c82446adda96d5fe2f1cd241b91f4bccd53</originalsourceid><addsrcrecordid>eNotj11LwzAYhQMiqHM_QK_yB1rzNh9tLqV-bFCYoF6PNHnrIms7kmy6f2-hnpvn5vBwDiF3wHIAph_W9XvzloPWKlesrDhckJuJjHMppbgiyxi_2RQhoQR1TTZPGPzJD1_UhOTtcW_SGM404CFgxCGZ5Mch0i6MPY0HRLujPz7t6MkEPx4jxV_r5xLtR4fxllx2Zh9x-c8F-Xx5_qhXWbN5XdePTeZBlClTjOtqWsQ1NyVoUxllEKx0bWEBbFUIoYxzRisnOyw6sK4Q0GroRGutk3xB7mevR8TtIfjehPN2vsz_ANvNTuc</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Deriving articulatory representations from speech with various excitation modes</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Richards, H.B. ; Mason, J.S. ; Hunt, M.J. ; Bridle, J.S.</creator><creatorcontrib>Richards, H.B. ; Mason, J.S. ; Hunt, M.J. ; Bridle, J.S.</creatorcontrib><description>A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.</description><identifier>ISBN: 0780335554</identifier><identifier>ISBN: 9780780335554</identifier><identifier>DOI: 10.1109/ICSLP.1996.607831</identifier><language>eng</language><publisher>IEEE</publisher><subject>Background noise ; Bandwidth ; Character generation ; Noise measurement ; Parameter estimation ; Resonance ; Shape ; Speech analysis ; Speech synthesis ; State estimation</subject><ispartof>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.2, p.1233-1236 vol.2</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/607831$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/607831$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Richards, H.B.</creatorcontrib><creatorcontrib>Mason, J.S.</creatorcontrib><creatorcontrib>Hunt, M.J.</creatorcontrib><creatorcontrib>Bridle, J.S.</creatorcontrib><title>Deriving articulatory representations from speech with various excitation modes</title><title>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</title><addtitle>ICSLP</addtitle><description>A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.</description><subject>Background noise</subject><subject>Bandwidth</subject><subject>Character generation</subject><subject>Noise measurement</subject><subject>Parameter estimation</subject><subject>Resonance</subject><subject>Shape</subject><subject>Speech analysis</subject><subject>Speech synthesis</subject><subject>State estimation</subject><isbn>0780335554</isbn><isbn>9780780335554</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj11LwzAYhQMiqHM_QK_yB1rzNh9tLqV-bFCYoF6PNHnrIms7kmy6f2-hnpvn5vBwDiF3wHIAph_W9XvzloPWKlesrDhckJuJjHMppbgiyxi_2RQhoQR1TTZPGPzJD1_UhOTtcW_SGM404CFgxCGZ5Mch0i6MPY0HRLujPz7t6MkEPx4jxV_r5xLtR4fxllx2Zh9x-c8F-Xx5_qhXWbN5XdePTeZBlClTjOtqWsQ1NyVoUxllEKx0bWEBbFUIoYxzRisnOyw6sK4Q0GroRGutk3xB7mevR8TtIfjehPN2vsz_ANvNTuc</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Richards, H.B.</creator><creator>Mason, J.S.</creator><creator>Hunt, M.J.</creator><creator>Bridle, J.S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>Deriving articulatory representations from speech with various excitation modes</title><author>Richards, H.B. ; Mason, J.S. ; Hunt, M.J. ; Bridle, J.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i147t-60398555393a719a8a6ae1c5db2c11c82446adda96d5fe2f1cd241b91f4bccd53</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Background noise</topic><topic>Bandwidth</topic><topic>Character generation</topic><topic>Noise measurement</topic><topic>Parameter estimation</topic><topic>Resonance</topic><topic>Shape</topic><topic>Speech analysis</topic><topic>Speech synthesis</topic><topic>State estimation</topic><toplevel>online_resources</toplevel><creatorcontrib>Richards, H.B.</creatorcontrib><creatorcontrib>Mason, J.S.</creatorcontrib><creatorcontrib>Hunt, M.J.</creatorcontrib><creatorcontrib>Bridle, J.S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Richards, H.B.</au><au>Mason, J.S.</au><au>Hunt, M.J.</au><au>Bridle, J.S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Deriving articulatory representations from speech with various excitation modes</atitle><btitle>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</btitle><stitle>ICSLP</stitle><date>1996</date><risdate>1996</risdate><volume>2</volume><spage>1233</spage><epage>1236 vol.2</epage><pages>1233-1236 vol.2</pages><isbn>0780335554</isbn><isbn>9780780335554</isbn><abstract>A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.</abstract><pub>IEEE</pub><doi>10.1109/ICSLP.1996.607831</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISBN: 0780335554
ispartof	Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.2, p.1233-1236 vol.2
issn
language	eng
recordid	cdi_ieee_primary_607831
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Background noise Bandwidth Character generation Noise measurement Parameter estimation Resonance Shape Speech analysis Speech synthesis State estimation
title	Deriving articulatory representations from speech with various excitation modes
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-21T21%3A56%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Deriving%20articulatory%20representations%20from%20speech%20with%20various%20excitation%20modes&rft.btitle=Proceeding%20of%20Fourth%20International%20Conference%20on%20Spoken%20Language%20Processing.%20ICSLP%20'96&rft.au=Richards,%20H.B.&rft.date=1996&rft.volume=2&rft.spage=1233&rft.epage=1236%20vol.2&rft.pages=1233-1236%20vol.2&rft.isbn=0780335554&rft.isbn_list=9780780335554&rft_id=info:doi/10.1109/ICSLP.1996.607831&rft_dat=%3Cieee_6IE%3E607831%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=607831&rfr_iscdi=true