Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule

We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu Information and Systems, 2003, Vol.123(3), pp.467-474
Hauptverfasser: Shimizu, Tadaaki, Kimoto, Masaya, Yoshimura, Hiroki, Namiki, Toshie, Isu, Naoki, Sugata, Kazuhiro
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 474
container_issue 3
container_start_page 467
container_title Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu
container_volume 123
creator Shimizu, Tadaaki
Kimoto, Masaya
Yoshimura, Hiroki
Namiki, Toshie
Isu, Naoki
Sugata, Kazuhiro
description We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.
doi_str_mv 10.1541/ieejeiss.123.467
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_1433876233</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3076300641</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2447-70c799f7d727135d0c77a2578ca471821ce2ee1c2cba6d7818defaf0520af3213</originalsourceid><addsrcrecordid>eNpVkEFrAjEQhUNpoWK99xjoeW0myZp4LKKtIFi09VZCjLO6smZtshb8902rFXqZYR7vm2EeIffAupBLeCwRt1jG2AUuurKnrkgLhNSZhjy_Ji0mdJ5JzuGWdGIsl4xxqYUCaJGP6b4pd7air5va4650dOi_ylD7HfomyTPr10ibms6xQtfQxWBBxz421juMtKjDrzLfI7oNnR99s8FYRro80tmhwjtyU9gqYufc2-R9NHwbvGST6fN48DTJHJdSZYo51e8XaqW4ApGv0qgsz5V2VirQHBxyRHDcLW1vpTToFRa2YDlnthAcRJs8nPbuQ_15wNiYbX0IPp00IIXQqseFSC52crlQxxiwMPuQfg9HA8z85Gj-cjQpR5NyTMjohGzTy2u8ADY0pavwPyDONYEXg9vYYNCLb112gK8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1433876233</pqid></control><display><type>article</type><title>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Shimizu, Tadaaki ; Kimoto, Masaya ; Yoshimura, Hiroki ; Namiki, Toshie ; Isu, Naoki ; Sugata, Kazuhiro</creator><creatorcontrib>Shimizu, Tadaaki ; Kimoto, Masaya ; Yoshimura, Hiroki ; Namiki, Toshie ; Isu, Naoki ; Sugata, Kazuhiro</creatorcontrib><description>We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.</description><identifier>ISSN: 0385-4221</identifier><identifier>EISSN: 1348-8155</identifier><identifier>DOI: 10.1541/ieejeiss.123.467</identifier><language>eng</language><publisher>Tokyo: The Institute of Electrical Engineers of Japan</publisher><subject>connective distortion ; LSP analysis ; phonemic environment ; synthesis unit selection ; VCV Speech synthesis</subject><ispartof>IEEJ Transactions on Electronics, Information and Systems, 2003, Vol.123(3), pp.467-474</ispartof><rights>2003 by the Institute of Electrical Engineers of Japan</rights><rights>Copyright Japan Science and Technology Agency 2003</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c2447-70c799f7d727135d0c77a2578ca471821ce2ee1c2cba6d7818defaf0520af3213</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4021,27921,27922,27923</link.rule.ids></links><search><creatorcontrib>Shimizu, Tadaaki</creatorcontrib><creatorcontrib>Kimoto, Masaya</creatorcontrib><creatorcontrib>Yoshimura, Hiroki</creatorcontrib><creatorcontrib>Namiki, Toshie</creatorcontrib><creatorcontrib>Isu, Naoki</creatorcontrib><creatorcontrib>Sugata, Kazuhiro</creatorcontrib><title>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</title><title>Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu</title><addtitle>IEEJ Trans. EIS</addtitle><description>We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.</description><subject>connective distortion</subject><subject>LSP analysis</subject><subject>phonemic environment</subject><subject>synthesis unit selection</subject><subject>VCV Speech synthesis</subject><issn>0385-4221</issn><issn>1348-8155</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><recordid>eNpVkEFrAjEQhUNpoWK99xjoeW0myZp4LKKtIFi09VZCjLO6smZtshb8902rFXqZYR7vm2EeIffAupBLeCwRt1jG2AUuurKnrkgLhNSZhjy_Ji0mdJ5JzuGWdGIsl4xxqYUCaJGP6b4pd7air5va4650dOi_ylD7HfomyTPr10ibms6xQtfQxWBBxz421juMtKjDrzLfI7oNnR99s8FYRro80tmhwjtyU9gqYufc2-R9NHwbvGST6fN48DTJHJdSZYo51e8XaqW4ApGv0qgsz5V2VirQHBxyRHDcLW1vpTToFRa2YDlnthAcRJs8nPbuQ_15wNiYbX0IPp00IIXQqseFSC52crlQxxiwMPuQfg9HA8z85Gj-cjQpR5NyTMjohGzTy2u8ADY0pavwPyDONYEXg9vYYNCLb112gK8</recordid><startdate>2003</startdate><enddate>2003</enddate><creator>Shimizu, Tadaaki</creator><creator>Kimoto, Masaya</creator><creator>Yoshimura, Hiroki</creator><creator>Namiki, Toshie</creator><creator>Isu, Naoki</creator><creator>Sugata, Kazuhiro</creator><general>The Institute of Electrical Engineers of Japan</general><general>Japan Science and Technology Agency</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2003</creationdate><title>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</title><author>Shimizu, Tadaaki ; Kimoto, Masaya ; Yoshimura, Hiroki ; Namiki, Toshie ; Isu, Naoki ; Sugata, Kazuhiro</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2447-70c799f7d727135d0c77a2578ca471821ce2ee1c2cba6d7818defaf0520af3213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>connective distortion</topic><topic>LSP analysis</topic><topic>phonemic environment</topic><topic>synthesis unit selection</topic><topic>VCV Speech synthesis</topic><toplevel>online_resources</toplevel><creatorcontrib>Shimizu, Tadaaki</creatorcontrib><creatorcontrib>Kimoto, Masaya</creatorcontrib><creatorcontrib>Yoshimura, Hiroki</creatorcontrib><creatorcontrib>Namiki, Toshie</creatorcontrib><creatorcontrib>Isu, Naoki</creatorcontrib><creatorcontrib>Sugata, Kazuhiro</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shimizu, Tadaaki</au><au>Kimoto, Masaya</au><au>Yoshimura, Hiroki</au><au>Namiki, Toshie</au><au>Isu, Naoki</au><au>Sugata, Kazuhiro</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule</atitle><jtitle>Denki Gakkai ronbunshi. C, Erekutoronikusu, joho kogaku, shisutemu</jtitle><addtitle>IEEJ Trans. EIS</addtitle><date>2003</date><risdate>2003</risdate><volume>123</volume><issue>3</issue><spage>467</spage><epage>474</epage><pages>467-474</pages><issn>0385-4221</issn><eissn>1348-8155</eissn><abstract>We proposed two selection methods, 1) selection method by using phonemic environmental resemblance score (PER method), and 2) selection method by searching minimal connective distortion path (MLD method), for small scale speech synthesis system. PER method requires phonemic environmental information for each VCV instance in a VCV unit dictionary. This paper investigated experimentally to what extent we can reduce the phonemic environmental information with keeping high quality of synthesized speech. We verified that two phonemes frontward and one phoneme rearward range to a current VCV instance is enough to synthesize similar quality of speech as five phonemes frontward and five phonemes rearward. This result gives an experimental basis on minimizing a size of VCV unit dictionary.</abstract><cop>Tokyo</cop><pub>The Institute of Electrical Engineers of Japan</pub><doi>10.1541/ieejeiss.123.467</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0385-4221
ispartof IEEJ Transactions on Electronics, Information and Systems, 2003, Vol.123(3), pp.467-474
issn 0385-4221
1348-8155
language eng
recordid cdi_proquest_journals_1433876233
source EZB-FREE-00999 freely available EZB journals
subjects connective distortion
LSP analysis
phonemic environment
synthesis unit selection
VCV Speech synthesis
title Optimal Phonemic Environmental Range to Select VCV Instances for VCV Speech Synthesis by Rule
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T12%3A18%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimal%20Phonemic%20Environmental%20Range%20to%20Select%20VCV%20Instances%20for%20VCV%20Speech%20Synthesis%20by%20Rule&rft.jtitle=Denki%20Gakkai%20ronbunshi.%20C,%20Erekutoronikusu,%20joho%20kogaku,%20shisutemu&rft.au=Shimizu,%20Tadaaki&rft.date=2003&rft.volume=123&rft.issue=3&rft.spage=467&rft.epage=474&rft.pages=467-474&rft.issn=0385-4221&rft.eissn=1348-8155&rft_id=info:doi/10.1541/ieejeiss.123.467&rft_dat=%3Cproquest_cross%3E3076300641%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1433876233&rft_id=info:pmid/&rfr_iscdi=true