Decoded graph optimization method and device and storage medium

The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LEI YANQIANG, BAN ZHIHUA
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator LEI YANQIANG
BAN ZHIHUA
description The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained. 本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN112466293A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN112466293A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN112466293A3</originalsourceid><addsrcrecordid>eNrjZLB3SU3OT0lNUUgvSizIUMgvKMnMzaxKLMnMz1PITS3JyE9RSMxLUUhJLctMTgUzi0vyixLTU4GyKZmluTwMrGmJOcWpvFCam0HRzTXE2UM3tSA_PrW4IDE5NS-1JN7Zz9DQyMTMzMjS2NGYGDUAiGww2A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Decoded graph optimization method and device and storage medium</title><source>esp@cenet</source><creator>LEI YANQIANG ; BAN ZHIHUA</creator><creatorcontrib>LEI YANQIANG ; BAN ZHIHUA</creatorcontrib><description>The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained. 本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210309&amp;DB=EPODOC&amp;CC=CN&amp;NR=112466293A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210309&amp;DB=EPODOC&amp;CC=CN&amp;NR=112466293A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>LEI YANQIANG</creatorcontrib><creatorcontrib>BAN ZHIHUA</creatorcontrib><title>Decoded graph optimization method and device and storage medium</title><description>The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained. 本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLB3SU3OT0lNUUgvSizIUMgvKMnMzaxKLMnMz1PITS3JyE9RSMxLUUhJLctMTgUzi0vyixLTU4GyKZmluTwMrGmJOcWpvFCam0HRzTXE2UM3tSA_PrW4IDE5NS-1JN7Zz9DQyMTMzMjS2NGYGDUAiGww2A</recordid><startdate>20210309</startdate><enddate>20210309</enddate><creator>LEI YANQIANG</creator><creator>BAN ZHIHUA</creator><scope>EVB</scope></search><sort><creationdate>20210309</creationdate><title>Decoded graph optimization method and device and storage medium</title><author>LEI YANQIANG ; BAN ZHIHUA</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN112466293A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>LEI YANQIANG</creatorcontrib><creatorcontrib>BAN ZHIHUA</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>LEI YANQIANG</au><au>BAN ZHIHUA</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Decoded graph optimization method and device and storage medium</title><date>2021-03-09</date><risdate>2021</risdate><abstract>The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained. 本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN112466293A
source esp@cenet
subjects ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title Decoded graph optimization method and device and storage medium
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T11%3A33%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=LEI%20YANQIANG&rft.date=2021-03-09&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN112466293A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true