VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM

To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accur...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	SHAO JUNYAO, JIA LEI, PENG XINGYUAN
Format:	Patent
Sprache:	eng ; jpn
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	SHAO JUNYAO JIA LEI PENG XINGYUAN
description	To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】１つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、２つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第１の音響モデルにより第１の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第２の音響モデルにより第２の音響復号化情報を取得するステップと、第１の音響復号化情報に基づいて第１組の候補認識結果を確定し、第２の音響復号化情報に基づいて第２組の候補認識結果を確定し、これら２組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図２
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_JP2021033255A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>JP2021033255A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_JP2021033255A3</originalsourceid><addsrcrecordid>eNrjZAgM8_d0dlUIcnX2d_fzDPH091PwdQ3x8HfRUXBxDQNK6Sg4BgQ4BjmGhAYDmX4uCs7-vgGhIa5BQD2OLo5OPq4KwSH-QY7urkCNLp6hvjwMrGmJOcWpvFCam0HJzTXE2UM3tSA_PrW4IDE5NS-1JN4rwMjAyNDA2NjI1NTRmChFANR2L18</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><source>esp@cenet</source><creator>SHAO JUNYAO ; JIA LEI ; PENG XINGYUAN</creator><creatorcontrib>SHAO JUNYAO ; JIA LEI ; PENG XINGYUAN</creatorcontrib><description>To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】１つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、２つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第１の音響モデルにより第１の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第２の音響モデルにより第２の音響復号化情報を取得するステップと、第１の音響復号化情報に基づいて第１組の候補認識結果を確定し、第２の音響復号化情報に基づいて第２組の候補認識結果を確定し、これら２組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図２</description><language>eng ; jpn</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210301&DB=EPODOC&CC=JP&NR=2021033255A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210301&DB=EPODOC&CC=JP&NR=2021033255A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SHAO JUNYAO</creatorcontrib><creatorcontrib>JIA LEI</creatorcontrib><creatorcontrib>PENG XINGYUAN</creatorcontrib><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><description>To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】１つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、２つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第１の音響モデルにより第１の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第２の音響モデルにより第２の音響復号化情報を取得するステップと、第１の音響復号化情報に基づいて第１組の候補認識結果を確定し、第２の音響復号化情報に基づいて第２組の候補認識結果を確定し、これら２組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図２</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZAgM8_d0dlUIcnX2d_fzDPH091PwdQ3x8HfRUXBxDQNK6Sg4BgQ4BjmGhAYDmX4uCs7-vgGhIa5BQD2OLo5OPq4KwSH-QY7urkCNLp6hvjwMrGmJOcWpvFCam0HJzTXE2UM3tSA_PrW4IDE5NS-1JN4rwMjAyNDA2NjI1NTRmChFANR2L18</recordid><startdate>20210301</startdate><enddate>20210301</enddate><creator>SHAO JUNYAO</creator><creator>JIA LEI</creator><creator>PENG XINGYUAN</creator><scope>EVB</scope></search><sort><creationdate>20210301</creationdate><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><author>SHAO JUNYAO ; JIA LEI ; PENG XINGYUAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_JP2021033255A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; jpn</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>SHAO JUNYAO</creatorcontrib><creatorcontrib>JIA LEI</creatorcontrib><creatorcontrib>PENG XINGYUAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SHAO JUNYAO</au><au>JIA LEI</au><au>PENG XINGYUAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><date>2021-03-01</date><risdate>2021</risdate><abstract>To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】１つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、２つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第１の音響モデルにより第１の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第２の音響モデルにより第２の音響復号化情報を取得するステップと、第１の音響復号化情報に基づいて第１組の候補認識結果を確定し、第２の音響復号化情報に基づいて第２組の候補認識結果を確定し、これら２組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図２</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng ; jpn
recordid	cdi_epo_espacenet_JP2021033255A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T05%3A58%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SHAO%20JUNYAO&rft.date=2021-03-01&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EJP2021033255A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true