Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule

We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu Information and Systems, 2003, Vol.123(3), pp.467-474
Hauptverfasser:	Shimizu, Tadaaki, Kimoto, Masaya, Yoshimura, Hiroki, Namiki, Toshie, Isu, Naoki, Sugata, Kazuhiro
Format:	Artikel
Sprache:	eng
Schlagworte:	connective distortion LSP analysis phonemic environment synthesis unit selection VCV Speech synthesis
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	474
container_issue	3
container_start_page	467
container_title	Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu
container_volume	123
creator	Shimizu, Tadaaki Kimoto, Masaya Yoshimura, Hiroki Namiki, Toshie Isu, Naoki Sugata, Kazuhiro
description	We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.
doi_str_mv	10.1541/ieejeiss.123.467
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1433876233</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3076300641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2447-70c799f7d727135d0c77a2578ca471821ce2ee1c2cba6d7818defaf0520af3213</originalsourceid><addsrcrecordid>eNpVkEFrAjEQhUNpoWK99xjoeW0myZp4LKKtIFi09VZCjLO6smZtshb8902rFXqZYR7vm2EeIffAupBLeCwRt1jG2AUuurKnrkgLhNSZhjy_Ji0mdJ5JzuGWdGIsl4xxqYUCaJGP6b4pd7air5va4650dOi_ylD7HfomyTPr10ibms6xQtfQxWBBxz421juMtKjDrzLfI7oNnR99s8FYRro80tmhwjtyU9gqYufc2-R9NHwbvGST6fN48DTJHJdSZYo51e8XaqW4ApGv0qgsz5V2VirQHBxyRHDcLW1vpTToFRa2YDlnthAcRJs8nPbuQ_15wNiYbX0IPp00IIXQqseFSC52crlQxxiwMPuQfg9HA8z85Gj-cjQpR5NyTMjohGzTy2u8ADY0pavwPyDONYEXg9vYYNCLb112gK8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1433876233</pqid></control><display><type>article</type><title>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Shimizu, Tadaaki ; Kimoto, Masaya ; Yoshimura, Hiroki ; Namiki, Toshie ; Isu, Naoki ; Sugata, Kazuhiro</creator><creatorcontrib>Shimizu, Tadaaki ; Kimoto, Masaya ; Yoshimura, Hiroki ; Namiki, Toshie ; Isu, Naoki ; Sugata, Kazuhiro</creatorcontrib><description>We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.</description><identifier>ISSN: 0385-4221</identifier><identifier>EISSN: 1348-8155</identifier><identifier>DOI: 10.1541/ieejeiss.123.467</identifier><language>eng</language><publisher>Tokyo: The Institute of Electrical Engineers of Japan</publisher><subject>connective distortion ; LSP analysis ; phonemic environment ; synthesis unit selection ; VCV Speech synthesis</subject><ispartof>IEEJ Transactions on Electronics, Information and Systems, 2003, Vol.123(3), pp.467-474</ispartof><rights>2003 by the Institute of Electrical Engineers of Japan</rights><rights>Copyright Japan Science and Technology Agency 2003</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2447-70c799f7d727135d0c77a2578ca471821ce2ee1c2cba6d7818defaf0520af3213</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4021,27921,27922,27923</link.rule.ids></links><search><creatorcontrib>Shimizu, Tadaaki</creatorcontrib><creatorcontrib>Kimoto, Masaya</creatorcontrib><creatorcontrib>Yoshimura, Hiroki</creatorcontrib><creatorcontrib>Namiki, Toshie</creatorcontrib><creatorcontrib>Isu, Naoki</creatorcontrib><creatorcontrib>Sugata, Kazuhiro</creatorcontrib><title>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</title><title>Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu</title><addtitle>IEEJ Trans. EIS</addtitle><description>We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.</description><subject>connective distortion</subject><subject>LSP analysis</subject><subject>phonemic environment</subject><subject>synthesis unit selection</subject><subject>VCV Speech synthesis</subject><issn>0385-4221</issn><issn>1348-8155</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><recordid>eNpVkEFrAjEQhUNpoWK99xjoeW0myZp4LKKtIFi09VZCjLO6smZtshb8902rFXqZYR7vm2EeIffAupBLeCwRt1jG2AUuurKnrkgLhNSZhjy_Ji0mdJ5JzuGWdGIsl4xxqYUCaJGP6b4pd7air5va4650dOi_ylD7HfomyTPr10ibms6xQtfQxWBBxz421juMtKjDrzLfI7oNnR99s8FYRro80tmhwjtyU9gqYufc2-R9NHwbvGST6fN48DTJHJdSZYo51e8XaqW4ApGv0qgsz5V2VirQHBxyRHDcLW1vpTToFRa2YDlnthAcRJs8nPbuQ_15wNiYbX0IPp00IIXQqseFSC52crlQxxiwMPuQfg9HA8z85Gj-cjQpR5NyTMjohGzTy2u8ADY0pavwPyDONYEXg9vYYNCLb112gK8</recordid><startdate>2003</startdate><enddate>2003</enddate><creator>Shimizu, Tadaaki</creator><creator>Kimoto, Masaya</creator><creator>Yoshimura, Hiroki</creator><creator>Namiki, Toshie</creator><creator>Isu, Naoki</creator><creator>Sugata, Kazuhiro</creator><general>The Institute of Electrical Engineers of Japan</general><general>Japan Science and Technology Agency</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2003</creationdate><title>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</title><author>Shimizu, Tadaaki ; Kimoto, Masaya ; Yoshimura, Hiroki ; Namiki, Toshie ; Isu, Naoki ; Sugata, Kazuhiro</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2447-70c799f7d727135d0c77a2578ca471821ce2ee1c2cba6d7818defaf0520af3213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>connective distortion</topic><topic>LSP analysis</topic><topic>phonemic environment</topic><topic>synthesis unit selection</topic><topic>VCV Speech synthesis</topic><toplevel>online_resources</toplevel><creatorcontrib>Shimizu, Tadaaki</creatorcontrib><creatorcontrib>Kimoto, Masaya</creatorcontrib><creatorcontrib>Yoshimura, Hiroki</creatorcontrib><creatorcontrib>Namiki, Toshie</creatorcontrib><creatorcontrib>Isu, Naoki</creatorcontrib><creatorcontrib>Sugata, Kazuhiro</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shimizu, Tadaaki</au><au>Kimoto, Masaya</au><au>Yoshimura, Hiroki</au><au>Namiki, Toshie</au><au>Isu, Naoki</au><au>Sugata, Kazuhiro</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</atitle><jtitle>Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu</jtitle><addtitle>IEEJ Trans. EIS</addtitle><date>2003</date><risdate>2003</risdate><volume>123</volume><issue>3</issue><spage>467</spage><epage>474</epage><pages>467-474</pages><issn>0385-4221</issn><eissn>1348-8155</eissn><abstract>We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.</abstract><cop>Tokyo</cop><pub>The Institute of Electrical Engineers of Japan</pub><doi>10.1541/ieejeiss.123.467</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0385-4221
ispartof	IEEJ Transactions on Electronics, Information and Systems, 2003, Vol.123(3), pp.467-474
issn	0385-4221 1348-8155
language	eng
recordid	cdi_proquest_journals_1433876233
source	EZB-FREE-00999 freely available EZB journals
subjects	connective distortion LSP analysis phonemic environment synthesis unit selection VCV Speech synthesis
title	Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T12%3A18%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimal%20Phonemic%20Environmental%20Range%20to%20Select%20VCV%20Instances%20for%20VCV%20Speech%20Synthesis%20by%20Rule&rft.jtitle=Denki%20Gakkai%20ronbunshi.%20C,%20Erekutoronikusu,%20joho%20kogaku,%20shisutemu&rft.au=Shimizu,%20Tadaaki&rft.date=2003&rft.volume=123&rft.issue=3&rft.spage=467&rft.epage=474&rft.pages=467-474&rft.issn=0385-4221&rft.eissn=1348-8155&rft_id=info:doi/10.1541/ieejeiss.123.467&rft_dat=%3Cproquest_cross%3E3076300641%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1433876233&rft_id=info:pmid/&rfr_iscdi=true