VOICE CONVERSION DEVICE, VOICE CONVERSION LEARNING DEVICE, IMAGE GENERATION DEVICE, IMAGE GENERATION LEARNING DEVICE, VOICE CONVERSION METHOD, VOICE CONVERSION LEARNING METHOD, IMAGE GENERATION METHOD, IMAGE GENERATION LEARNING METHOD, AND COMPUTER PROGRAM

This voice conversion device comprises a language information extraction unit that extracts language information corresponding to speech content from a voice signal of a conversion source, an appearance feature extraction unit that extracts an appearance feature representing a feature of a person�...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: KAMEOKA Hirokazu, KANEKO Takuhiro, TANAKA Ko, Valero Puche Aaron, OISHI Yasunori
Format: Patent
Sprache:eng ; fre ; jpn
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator KAMEOKA Hirokazu
KANEKO Takuhiro
TANAKA Ko
Valero Puche Aaron
OISHI Yasunori
description This voice conversion device comprises a language information extraction unit that extracts language information corresponding to speech content from a voice signal of a conversion source, an appearance feature extraction unit that extracts an appearance feature representing a feature of a person's face from a captured image in which the person is imaged, and a converted voice generation unit that generates a post-conversion voice on the basis of the language information and the appearance feature. Ce dispositif de conversion vocale comprend une unité d'extraction d'informations linguistiques qui extrait des informations linguistiques correspondant à un contenu vocal à partir d'un signal vocal d'une source de conversion, une unité d'extraction de caractéristique d'aspect qui extrait une caractéristique d'aspect représentant une caractéristique du visage d'une personne à partir d'une image capturée sur laquelle la personne est représentée, et une unité de génération de voix convertie qui génère une voix post-conversion sur la base des informations linguistiques et de la caractéristique d'aspect. 変換元の音声信号から発話内容に相当する言語情報を抽出する言語情報抽出部と、人物が撮像された撮像画像から人物の顔立ちの特徴を表す容貌特徴を抽出する容貌特徴抽出部と、言語情報と、容貌特徴とに基づいて、変換後の音声を生成する変換音声生成部と、を備える音声変換装置。
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_WO2021045194A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>WO2021045194A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_WO2021045194A13</originalsourceid><addsrcrecordid>eNrjZGQI8_d0dlVw9vcLcw0K9vT3U3BxDQOK6ChgSPi4Ogb5efq5w1V4-jq6uyq4u_q5BjmGIGvFkMDQimG4r2uIh78LPlthKjAMxymBodXRzwVotG9AaIhrkEJAkL97kKMvDwNrWmJOcSovlOZmUHZzDXH20E0tyI9PLS5ITE7NSy2JD_c3MjAyNDAxNbQ0cTQ0Jk4VANyWXno</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>VOICE CONVERSION DEVICE, VOICE CONVERSION LEARNING DEVICE, IMAGE GENERATION DEVICE, IMAGE GENERATION LEARNING DEVICE, VOICE CONVERSION METHOD, VOICE CONVERSION LEARNING METHOD, IMAGE GENERATION METHOD, IMAGE GENERATION LEARNING METHOD, AND COMPUTER PROGRAM</title><source>esp@cenet</source><creator>KAMEOKA Hirokazu ; KANEKO Takuhiro ; TANAKA Ko ; Valero Puche Aaron ; OISHI Yasunori</creator><creatorcontrib>KAMEOKA Hirokazu ; KANEKO Takuhiro ; TANAKA Ko ; Valero Puche Aaron ; OISHI Yasunori</creatorcontrib><description>This voice conversion device comprises a language information extraction unit that extracts language information corresponding to speech content from a voice signal of a conversion source, an appearance feature extraction unit that extracts an appearance feature representing a feature of a person's face from a captured image in which the person is imaged, and a converted voice generation unit that generates a post-conversion voice on the basis of the language information and the appearance feature. Ce dispositif de conversion vocale comprend une unité d'extraction d'informations linguistiques qui extrait des informations linguistiques correspondant à un contenu vocal à partir d'un signal vocal d'une source de conversion, une unité d'extraction de caractéristique d'aspect qui extrait une caractéristique d'aspect représentant une caractéristique du visage d'une personne à partir d'une image capturée sur laquelle la personne est représentée, et une unité de génération de voix convertie qui génère une voix post-conversion sur la base des informations linguistiques et de la caractéristique d'aspect. 変換元の音声信号から発話内容に相当する言語情報を抽出する言語情報抽出部と、人物が撮像された撮像画像から人物の顔立ちの特徴を表す容貌特徴を抽出する容貌特徴抽出部と、言語情報と、容貌特徴とに基づいて、変換後の音声を生成する変換音声生成部と、を備える音声変換装置。</description><language>eng ; fre ; jpn</language><subject>ACOUSTICS ; CALCULATING ; COMPUTING ; COUNTING ; IMAGE DATA PROCESSING OR GENERATION, IN GENERAL ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210311&amp;DB=EPODOC&amp;CC=WO&amp;NR=2021045194A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210311&amp;DB=EPODOC&amp;CC=WO&amp;NR=2021045194A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>KAMEOKA Hirokazu</creatorcontrib><creatorcontrib>KANEKO Takuhiro</creatorcontrib><creatorcontrib>TANAKA Ko</creatorcontrib><creatorcontrib>Valero Puche Aaron</creatorcontrib><creatorcontrib>OISHI Yasunori</creatorcontrib><title>VOICE CONVERSION DEVICE, VOICE CONVERSION LEARNING DEVICE, IMAGE GENERATION DEVICE, IMAGE GENERATION LEARNING DEVICE, VOICE CONVERSION METHOD, VOICE CONVERSION LEARNING METHOD, IMAGE GENERATION METHOD, IMAGE GENERATION LEARNING METHOD, AND COMPUTER PROGRAM</title><description>This voice conversion device comprises a language information extraction unit that extracts language information corresponding to speech content from a voice signal of a conversion source, an appearance feature extraction unit that extracts an appearance feature representing a feature of a person's face from a captured image in which the person is imaged, and a converted voice generation unit that generates a post-conversion voice on the basis of the language information and the appearance feature. Ce dispositif de conversion vocale comprend une unité d'extraction d'informations linguistiques qui extrait des informations linguistiques correspondant à un contenu vocal à partir d'un signal vocal d'une source de conversion, une unité d'extraction de caractéristique d'aspect qui extrait une caractéristique d'aspect représentant une caractéristique du visage d'une personne à partir d'une image capturée sur laquelle la personne est représentée, et une unité de génération de voix convertie qui génère une voix post-conversion sur la base des informations linguistiques et de la caractéristique d'aspect. 変換元の音声信号から発話内容に相当する言語情報を抽出する言語情報抽出部と、人物が撮像された撮像画像から人物の顔立ちの特徴を表す容貌特徴を抽出する容貌特徴抽出部と、言語情報と、容貌特徴とに基づいて、変換後の音声を生成する変換音声生成部と、を備える音声変換装置。</description><subject>ACOUSTICS</subject><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZGQI8_d0dlVw9vcLcw0K9vT3U3BxDQOK6ChgSPi4Ogb5efq5w1V4-jq6uyq4u_q5BjmGIGvFkMDQimG4r2uIh78LPlthKjAMxymBodXRzwVotG9AaIhrkEJAkL97kKMvDwNrWmJOcSovlOZmUHZzDXH20E0tyI9PLS5ITE7NSy2JD_c3MjAyNDAxNbQ0cTQ0Jk4VANyWXno</recordid><startdate>20210311</startdate><enddate>20210311</enddate><creator>KAMEOKA Hirokazu</creator><creator>KANEKO Takuhiro</creator><creator>TANAKA Ko</creator><creator>Valero Puche Aaron</creator><creator>OISHI Yasunori</creator><scope>EVB</scope></search><sort><creationdate>20210311</creationdate><title>VOICE CONVERSION DEVICE, VOICE CONVERSION LEARNING DEVICE, IMAGE GENERATION DEVICE, IMAGE GENERATION LEARNING DEVICE, VOICE CONVERSION METHOD, VOICE CONVERSION LEARNING METHOD, IMAGE GENERATION METHOD, IMAGE GENERATION LEARNING METHOD, AND COMPUTER PROGRAM</title><author>KAMEOKA Hirokazu ; KANEKO Takuhiro ; TANAKA Ko ; Valero Puche Aaron ; OISHI Yasunori</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_WO2021045194A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre ; jpn</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>IMAGE DATA PROCESSING OR GENERATION, IN GENERAL</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>KAMEOKA Hirokazu</creatorcontrib><creatorcontrib>KANEKO Takuhiro</creatorcontrib><creatorcontrib>TANAKA Ko</creatorcontrib><creatorcontrib>Valero Puche Aaron</creatorcontrib><creatorcontrib>OISHI Yasunori</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>KAMEOKA Hirokazu</au><au>KANEKO Takuhiro</au><au>TANAKA Ko</au><au>Valero Puche Aaron</au><au>OISHI Yasunori</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>VOICE CONVERSION DEVICE, VOICE CONVERSION LEARNING DEVICE, IMAGE GENERATION DEVICE, IMAGE GENERATION LEARNING DEVICE, VOICE CONVERSION METHOD, VOICE CONVERSION LEARNING METHOD, IMAGE GENERATION METHOD, IMAGE GENERATION LEARNING METHOD, AND COMPUTER PROGRAM</title><date>2021-03-11</date><risdate>2021</risdate><abstract>This voice conversion device comprises a language information extraction unit that extracts language information corresponding to speech content from a voice signal of a conversion source, an appearance feature extraction unit that extracts an appearance feature representing a feature of a person's face from a captured image in which the person is imaged, and a converted voice generation unit that generates a post-conversion voice on the basis of the language information and the appearance feature. Ce dispositif de conversion vocale comprend une unité d'extraction d'informations linguistiques qui extrait des informations linguistiques correspondant à un contenu vocal à partir d'un signal vocal d'une source de conversion, une unité d'extraction de caractéristique d'aspect qui extrait une caractéristique d'aspect représentant une caractéristique du visage d'une personne à partir d'une image capturée sur laquelle la personne est représentée, et une unité de génération de voix convertie qui génère une voix post-conversion sur la base des informations linguistiques et de la caractéristique d'aspect. 変換元の音声信号から発話内容に相当する言語情報を抽出する言語情報抽出部と、人物が撮像された撮像画像から人物の顔立ちの特徴を表す容貌特徴を抽出する容貌特徴抽出部と、言語情報と、容貌特徴とに基づいて、変換後の音声を生成する変換音声生成部と、を備える音声変換装置。</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre ; jpn
recordid cdi_epo_espacenet_WO2021045194A1
source esp@cenet
subjects ACOUSTICS
CALCULATING
COMPUTING
COUNTING
IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title VOICE CONVERSION DEVICE, VOICE CONVERSION LEARNING DEVICE, IMAGE GENERATION DEVICE, IMAGE GENERATION LEARNING DEVICE, VOICE CONVERSION METHOD, VOICE CONVERSION LEARNING METHOD, IMAGE GENERATION METHOD, IMAGE GENERATION LEARNING METHOD, AND COMPUTER PROGRAM
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T03%3A40%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=KAMEOKA%20Hirokazu&rft.date=2021-03-11&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EWO2021045194A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true