Deriving articulatory representations from speech with various excitation modes
A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives....
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1236 vol.2 |
---|---|
container_issue | |
container_start_page | 1233 |
container_title | |
container_volume | 2 |
creator | Richards, H.B. Mason, J.S. Hunt, M.J. Bridle, J.S. |
description | A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used. |
doi_str_mv | 10.1109/ICSLP.1996.607831 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_607831</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>607831</ieee_id><sourcerecordid>607831</sourcerecordid><originalsourceid>FETCH-LOGICAL-i147t-60398555393a719a8a6ae1c5db2c11c82446adda96d5fe2f1cd241b91f4bccd53</originalsourceid><addsrcrecordid>eNotj11LwzAYhQMiqHM_QK_yB1rzNh9tLqV-bFCYoF6PNHnrIms7kmy6f2-hnpvn5vBwDiF3wHIAph_W9XvzloPWKlesrDhckJuJjHMppbgiyxi_2RQhoQR1TTZPGPzJD1_UhOTtcW_SGM404CFgxCGZ5Mch0i6MPY0HRLujPz7t6MkEPx4jxV_r5xLtR4fxllx2Zh9x-c8F-Xx5_qhXWbN5XdePTeZBlClTjOtqWsQ1NyVoUxllEKx0bWEBbFUIoYxzRisnOyw6sK4Q0GroRGutk3xB7mevR8TtIfjehPN2vsz_ANvNTuc</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Deriving articulatory representations from speech with various excitation modes</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Richards, H.B. ; Mason, J.S. ; Hunt, M.J. ; Bridle, J.S.</creator><creatorcontrib>Richards, H.B. ; Mason, J.S. ; Hunt, M.J. ; Bridle, J.S.</creatorcontrib><description>A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.</description><identifier>ISBN: 0780335554</identifier><identifier>ISBN: 9780780335554</identifier><identifier>DOI: 10.1109/ICSLP.1996.607831</identifier><language>eng</language><publisher>IEEE</publisher><subject>Background noise ; Bandwidth ; Character generation ; Noise measurement ; Parameter estimation ; Resonance ; Shape ; Speech analysis ; Speech synthesis ; State estimation</subject><ispartof>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.2, p.1233-1236 vol.2</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/607831$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,4036,4037,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/607831$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Richards, H.B.</creatorcontrib><creatorcontrib>Mason, J.S.</creatorcontrib><creatorcontrib>Hunt, M.J.</creatorcontrib><creatorcontrib>Bridle, J.S.</creatorcontrib><title>Deriving articulatory representations from speech with various excitation modes</title><title>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</title><addtitle>ICSLP</addtitle><description>A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.</description><subject>Background noise</subject><subject>Bandwidth</subject><subject>Character generation</subject><subject>Noise measurement</subject><subject>Parameter estimation</subject><subject>Resonance</subject><subject>Shape</subject><subject>Speech analysis</subject><subject>Speech synthesis</subject><subject>State estimation</subject><isbn>0780335554</isbn><isbn>9780780335554</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>1996</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotj11LwzAYhQMiqHM_QK_yB1rzNh9tLqV-bFCYoF6PNHnrIms7kmy6f2-hnpvn5vBwDiF3wHIAph_W9XvzloPWKlesrDhckJuJjHMppbgiyxi_2RQhoQR1TTZPGPzJD1_UhOTtcW_SGM404CFgxCGZ5Mch0i6MPY0HRLujPz7t6MkEPx4jxV_r5xLtR4fxllx2Zh9x-c8F-Xx5_qhXWbN5XdePTeZBlClTjOtqWsQ1NyVoUxllEKx0bWEBbFUIoYxzRisnOyw6sK4Q0GroRGutk3xB7mevR8TtIfjehPN2vsz_ANvNTuc</recordid><startdate>1996</startdate><enddate>1996</enddate><creator>Richards, H.B.</creator><creator>Mason, J.S.</creator><creator>Hunt, M.J.</creator><creator>Bridle, J.S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>1996</creationdate><title>Deriving articulatory representations from speech with various excitation modes</title><author>Richards, H.B. ; Mason, J.S. ; Hunt, M.J. ; Bridle, J.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i147t-60398555393a719a8a6ae1c5db2c11c82446adda96d5fe2f1cd241b91f4bccd53</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Background noise</topic><topic>Bandwidth</topic><topic>Character generation</topic><topic>Noise measurement</topic><topic>Parameter estimation</topic><topic>Resonance</topic><topic>Shape</topic><topic>Speech analysis</topic><topic>Speech synthesis</topic><topic>State estimation</topic><toplevel>online_resources</toplevel><creatorcontrib>Richards, H.B.</creatorcontrib><creatorcontrib>Mason, J.S.</creatorcontrib><creatorcontrib>Hunt, M.J.</creatorcontrib><creatorcontrib>Bridle, J.S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Richards, H.B.</au><au>Mason, J.S.</au><au>Hunt, M.J.</au><au>Bridle, J.S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Deriving articulatory representations from speech with various excitation modes</atitle><btitle>Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96</btitle><stitle>ICSLP</stitle><date>1996</date><risdate>1996</risdate><volume>2</volume><spage>1233</spage><epage>1236 vol.2</epage><pages>1233-1236 vol.2</pages><isbn>0780335554</isbn><isbn>9780780335554</isbn><abstract>A new approach is described which estimates vocal tract shape sequences for speech consisting of voiceless speech and periods of silence as well as voiced speech. This method, based on the use of articulatory codebooks, has proved successful in identifying the place position of stops and fricatives. Secondly, the authors focus on voiced speech in particular. A fast analysis-by-synthesis scheme, which gives continuously-valued area estimates, has been developed. Savings in computation of 50:1 have been achieved by using an MLP to perform the synthesis in this method. The technique also allows a more complex dynamic model to be used.</abstract><pub>IEEE</pub><doi>10.1109/ICSLP.1996.607831</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISBN: 0780335554 |
ispartof | Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96, 1996, Vol.2, p.1233-1236 vol.2 |
issn | |
language | eng |
recordid | cdi_ieee_primary_607831 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Background noise Bandwidth Character generation Noise measurement Parameter estimation Resonance Shape Speech analysis Speech synthesis State estimation |
title | Deriving articulatory representations from speech with various excitation modes |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-21T21%3A56%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Deriving%20articulatory%20representations%20from%20speech%20with%20various%20excitation%20modes&rft.btitle=Proceeding%20of%20Fourth%20International%20Conference%20on%20Spoken%20Language%20Processing.%20ICSLP%20'96&rft.au=Richards,%20H.B.&rft.date=1996&rft.volume=2&rft.spage=1233&rft.epage=1236%20vol.2&rft.pages=1233-1236%20vol.2&rft.isbn=0780335554&rft.isbn_list=9780780335554&rft_id=info:doi/10.1109/ICSLP.1996.607831&rft_dat=%3Cieee_6IE%3E607831%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=607831&rfr_iscdi=true |