ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC

To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calcul...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	ZIZU GOWAYYED, KEYVAN MOHAJER
Format:	Patent
Sprache:	eng ; jpn
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	ZIZU GOWAYYED KEYVAN MOHAJER
description	To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図４Ａ
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_JP2021184087A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>JP2021184087A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_JP2021184087A3</originalsourceid><addsrcrecordid>eNrjZDBxdPYPDQ7xdFbw9Xdx9VFw9vdz8Qzx9Pfz9HNXcPMPUgj2D_VzUXD2cAxydA5xDfIEqeVhYE1LzClO5YXS3AxKbq4hzh66qQX58anFBYnJqXmpJfFeAUYGRoaGFiYGFuaOxkQpAgAknSf7</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><source>esp@cenet</source><creator>ZIZU GOWAYYED ; KEYVAN MOHAJER</creator><creatorcontrib>ZIZU GOWAYYED ; KEYVAN MOHAJER</creatorcontrib><description>To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図４Ａ</description><language>eng ; jpn</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211202&DB=EPODOC&CC=JP&NR=2021184087A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76516</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20211202&DB=EPODOC&CC=JP&NR=2021184087A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZIZU GOWAYYED</creatorcontrib><creatorcontrib>KEYVAN MOHAJER</creatorcontrib><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><description>To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図４Ａ</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDBxdPYPDQ7xdFbw9Xdx9VFw9vdz8Qzx9Pfz9HNXcPMPUgj2D_VzUXD2cAxydA5xDfIEqeVhYE1LzClO5YXS3AxKbq4hzh66qQX58anFBYnJqXmpJfFeAUYGRoaGFiYGFuaOxkQpAgAknSf7</recordid><startdate>20211202</startdate><enddate>20211202</enddate><creator>ZIZU GOWAYYED</creator><creator>KEYVAN MOHAJER</creator><scope>EVB</scope></search><sort><creationdate>20211202</creationdate><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><author>ZIZU GOWAYYED ; KEYVAN MOHAJER</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_JP2021184087A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; jpn</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>ZIZU GOWAYYED</creatorcontrib><creatorcontrib>KEYVAN MOHAJER</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZIZU GOWAYYED</au><au>KEYVAN MOHAJER</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><date>2021-12-02</date><risdate>2021</risdate><abstract>To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図４Ａ</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; jpn
recordid	cdi_epo_espacenet_JP2021184087A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T09%3A24%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZIZU%20GOWAYYED&rft.date=2021-12-02&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EJP2021184087A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true