ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC

To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calcul...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZIZU GOWAYYED, KEYVAN MOHAJER
Format: Patent
Sprache:eng ; jpn
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator ZIZU GOWAYYED
KEYVAN MOHAJER
description To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図4A
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_JP2021184087A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>JP2021184087A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_JP2021184087A3</originalsourceid><addsrcrecordid>eNrjZDBxdPYPDQ7xdFbw9Xdx9VFw9vdz8Qzx9Pfz9HNXcPMPUgj2D_VzUXD2cAxydA5xDfIEqeVhYE1LzClO5YXS3AxKbq4hzh66qQX58anFBYnJqXmpJfFeAUYGRoaGFiYGFuaOxkQpAgAknSf7</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><source>esp@cenet</source><creator>ZIZU GOWAYYED ; KEYVAN MOHAJER</creator><creatorcontrib>ZIZU GOWAYYED ; KEYVAN MOHAJER</creatorcontrib><description>To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図4A</description><language>eng ; jpn</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20211202&amp;DB=EPODOC&amp;CC=JP&amp;NR=2021184087A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76516</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20211202&amp;DB=EPODOC&amp;CC=JP&amp;NR=2021184087A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZIZU GOWAYYED</creatorcontrib><creatorcontrib>KEYVAN MOHAJER</creatorcontrib><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><description>To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図4A</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDBxdPYPDQ7xdFbw9Xdx9VFw9vdz8Qzx9Pfz9HNXcPMPUgj2D_VzUXD2cAxydA5xDfIEqeVhYE1LzClO5YXS3AxKbq4hzh66qQX58anFBYnJqXmpJfFeAUYGRoaGFiYGFuaOxkQpAgAknSf7</recordid><startdate>20211202</startdate><enddate>20211202</enddate><creator>ZIZU GOWAYYED</creator><creator>KEYVAN MOHAJER</creator><scope>EVB</scope></search><sort><creationdate>20211202</creationdate><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><author>ZIZU GOWAYYED ; KEYVAN MOHAJER</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_JP2021184087A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; jpn</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>ZIZU GOWAYYED</creatorcontrib><creatorcontrib>KEYVAN MOHAJER</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZIZU GOWAYYED</au><au>KEYVAN MOHAJER</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC</title><date>2021-12-02</date><risdate>2021</risdate><abstract>To provide a speech processing system and a method improving inaccuracy in speech recognition capability due to a rare environmental condition.SOLUTION: A method captures a segment of speech voice having a key phrase voice and speech production continuing immediately afterward, has an encoder calculate sound embedding by using a segment corresponding to the key phrase, and estimates phenom from an utterance sound signal by using a model which is a sound model for voice recognition conditioned to sound embedding as input.SELECTED DRAWING: Figure 4A 【課題】珍しい環境条件による、スピーチ認識能力の不正確さを改善したスピーチ処理システム及び方法を提供する。【解決手段】方法は、キーフレーズ音声と、そのすぐ後に続く発話と、を有するスピーチ音声のセグメントをキャプチャし、エンコーダが、キーフレーズに対応するセグメントを用いてサウンド埋め込みを計算し、音声認識のための音響モデルが、入力としてのサウンド埋め込みに対して条件付けされたモデルを用いて、発話音声信号からの音素を推定する。【選択図】図4A</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; jpn
recordid cdi_epo_espacenet_JP2021184087A
source esp@cenet
subjects ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title ACOUSTIC MODEL CONDITIONING FOR SOUND CHARACTERISTIC
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T09%3A24%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZIZU%20GOWAYYED&rft.date=2021-12-02&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EJP2021184087A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true