VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM

To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accur...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SHAO JUNYAO, JIA LEI, PENG XINGYUAN
Format: Patent
Sprache:eng ; jpn
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator SHAO JUNYAO
JIA LEI
PENG XINGYUAN
description To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】1つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、2つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第1の音響モデルにより第1の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第2の音響モデルにより第2の音響復号化情報を取得するステップと、第1の音響復号化情報に基づいて第1組の候補認識結果を確定し、第2の音響復号化情報に基づいて第2組の候補認識結果を確定し、これら2組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図2
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_JP2021033255A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>JP2021033255A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_JP2021033255A3</originalsourceid><addsrcrecordid>eNrjZAgM8_d0dlUIcnX2d_fzDPH091PwdQ3x8HfRUXBxDQNK6Sg4BgQ4BjmGhAYDmX4uCs7-vgGhIa5BQD2OLo5OPq4KwSH-QY7urkCNLp6hvjwMrGmJOcWpvFCam0HJzTXE2UM3tSA_PrW4IDE5NS-1JN4rwMjAyNDA2NjI1NTRmChFANR2L18</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><source>esp@cenet</source><creator>SHAO JUNYAO ; JIA LEI ; PENG XINGYUAN</creator><creatorcontrib>SHAO JUNYAO ; JIA LEI ; PENG XINGYUAN</creatorcontrib><description>To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】1つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、2つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第1の音響モデルにより第1の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第2の音響モデルにより第2の音響復号化情報を取得するステップと、第1の音響復号化情報に基づいて第1組の候補認識結果を確定し、第2の音響復号化情報に基づいて第2組の候補認識結果を確定し、これら2組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図2</description><language>eng ; jpn</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210301&amp;DB=EPODOC&amp;CC=JP&amp;NR=2021033255A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210301&amp;DB=EPODOC&amp;CC=JP&amp;NR=2021033255A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SHAO JUNYAO</creatorcontrib><creatorcontrib>JIA LEI</creatorcontrib><creatorcontrib>PENG XINGYUAN</creatorcontrib><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><description>To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】1つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、2つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第1の音響モデルにより第1の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第2の音響モデルにより第2の音響復号化情報を取得するステップと、第1の音響復号化情報に基づいて第1組の候補認識結果を確定し、第2の音響復号化情報に基づいて第2組の候補認識結果を確定し、これら2組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図2</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZAgM8_d0dlUIcnX2d_fzDPH091PwdQ3x8HfRUXBxDQNK6Sg4BgQ4BjmGhAYDmX4uCs7-vgGhIa5BQD2OLo5OPq4KwSH-QY7urkCNLp6hvjwMrGmJOcWpvFCam0HJzTXE2UM3tSA_PrW4IDE5NS-1JN4rwMjAyNDA2NjI1NTRmChFANR2L18</recordid><startdate>20210301</startdate><enddate>20210301</enddate><creator>SHAO JUNYAO</creator><creator>JIA LEI</creator><creator>PENG XINGYUAN</creator><scope>EVB</scope></search><sort><creationdate>20210301</creationdate><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><author>SHAO JUNYAO ; JIA LEI ; PENG XINGYUAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_JP2021033255A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; jpn</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>SHAO JUNYAO</creatorcontrib><creatorcontrib>JIA LEI</creatorcontrib><creatorcontrib>PENG XINGYUAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SHAO JUNYAO</au><au>JIA LEI</au><au>PENG XINGYUAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM</title><date>2021-03-01</date><risdate>2021</risdate><abstract>To provide a voice recognition method based on double decoding in which acoustic variety of one acoustic model covers a disadvantage that there are few acoustic paths of other acoustic models, two decoding paths are independent from each other, a decoding space is expanded, and accordingly the accuracy of voice recognition can be improved.SOLUTION: The voice recognition method includes: for an inputted voice signal, obtaining first acoustic decoded information by a first acoustic model, and obtaining second acoustic decoded information by a second acoustic model generated by joint modeling of sound and language; and determining a first set of candidate recognition results based on the first acoustic decoded information, determining a second set of candidate recognition results based on the second acoustic decoded information, and determining a final recognition result for the voice signal based on these two sets of candidate recognition results.SELECTED DRAWING: Figure 2 【課題】1つの音響モデルの音響多様性により他の音響モデルの音響経路が少ないという欠点を補い、2つの復号化経路が互いに独立しており、復号化空間が拡張され、それにより音声認識の正確率が向上されることができる二重復号化に基づく音声認識方法を提供すう。【解決手段】音声認識方法は、入力された音声信号に対して、第1の音響モデルにより第1の音響復号化情報を取得し、音響と言語の組合せモデリングにより生成された第2の音響モデルにより第2の音響復号化情報を取得するステップと、第1の音響復号化情報に基づいて第1組の候補認識結果を確定し、第2の音響復号化情報に基づいて第2組の候補認識結果を確定し、これら2組の候補認識結果に基づいて音声信号に対する最終的な認識結果を確定するステップを含む。【選択図】図2</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; jpn
recordid cdi_epo_espacenet_JP2021033255A
source esp@cenet
subjects ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title VOICE RECOGNITION METHOD, DEVICE, APPARATUS, AND COMPUTER READABLE STORAGE MEDIUM
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T05%3A58%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SHAO%20JUNYAO&rft.date=2021-03-01&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EJP2021033255A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true