Decoded graph optimization method and device and storage medium
The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | LEI YANQIANG BAN ZHIHUA |
description | The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained.
本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。 |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN112466293A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN112466293A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN112466293A3</originalsourceid><addsrcrecordid>eNrjZLB3SU3OT0lNUUgvSizIUMgvKMnMzaxKLMnMz1PITS3JyE9RSMxLUUhJLctMTgUzi0vyixLTU4GyKZmluTwMrGmJOcWpvFCam0HRzTXE2UM3tSA_PrW4IDE5NS-1JN7Zz9DQyMTMzMjS2NGYGDUAiGww2A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Decoded graph optimization method and device and storage medium</title><source>esp@cenet</source><creator>LEI YANQIANG ; BAN ZHIHUA</creator><creatorcontrib>LEI YANQIANG ; BAN ZHIHUA</creatorcontrib><description>The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained.
本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210309&DB=EPODOC&CC=CN&NR=112466293A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210309&DB=EPODOC&CC=CN&NR=112466293A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>LEI YANQIANG</creatorcontrib><creatorcontrib>BAN ZHIHUA</creatorcontrib><title>Decoded graph optimization method and device and storage medium</title><description>The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained.
本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZLB3SU3OT0lNUUgvSizIUMgvKMnMzaxKLMnMz1PITS3JyE9RSMxLUUhJLctMTgUzi0vyixLTU4GyKZmluTwMrGmJOcWpvFCam0HRzTXE2UM3tSA_PrW4IDE5NS-1JN7Zz9DQyMTMzMjS2NGYGDUAiGww2A</recordid><startdate>20210309</startdate><enddate>20210309</enddate><creator>LEI YANQIANG</creator><creator>BAN ZHIHUA</creator><scope>EVB</scope></search><sort><creationdate>20210309</creationdate><title>Decoded graph optimization method and device and storage medium</title><author>LEI YANQIANG ; BAN ZHIHUA</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN112466293A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>LEI YANQIANG</creatorcontrib><creatorcontrib>BAN ZHIHUA</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>LEI YANQIANG</au><au>BAN ZHIHUA</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Decoded graph optimization method and device and storage medium</title><date>2021-03-09</date><risdate>2021</risdate><abstract>The invention provides a decoding graph optimization method and device and a storage medium, and relates to the speech recognition technology. The method comprises the steps of training an acoustic model and constructing a language model, generating an initial decoding graph according to the trained acoustic model and language model, adjusting the weight of the initial decoding graph to maximize the target function value, obtaining an optimized decoding graph, and representing the occurrence probability of the annotation word sequence corresponding to the preset voice signal under the conditions of the acoustic model and the initial decoding graph by the target function. According to the invention, a decoded picture with excellent performance can be obtained.
本申请提供一种解码图优化方法、装置及存储介质,涉及语音识别技术。此方法包括:训练声学模型和构建语言模型;根据训练好的声学模型和语言模型,生成初始解码图;通过调整初始解码图的权重,使得目标函数值最大,得到优化后的解码图,目标函数表征将预设语音信号在声学模型和初始解码图的条件下,与预设语音信号对应的标注词序列的发生概率。通过本申请可以得到性能较优的解码图。</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN112466293A |
source | esp@cenet |
subjects | ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
title | Decoded graph optimization method and device and storage medium |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T11%3A33%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=LEI%20YANQIANG&rft.date=2021-03-09&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN112466293A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |