Voice recognition method and system

The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: WAN GUANGHUI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator WAN GUANGHUI
description The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN110223678A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN110223678A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN110223678A3</originalsourceid><addsrcrecordid>eNrjZFAOy89MTlUoSk3OT8_LLMnMz1PITS3JyE9RSMxLUSiuLC5JzeVhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfHOfoaGBkZGxmbmFo7GxKgBAOdfJnY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Voice recognition method and system</title><source>esp@cenet</source><creator>WAN GUANGHUI</creator><creatorcontrib>WAN GUANGHUI</creatorcontrib><description>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2019</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20190910&amp;DB=EPODOC&amp;CC=CN&amp;NR=110223678A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20190910&amp;DB=EPODOC&amp;CC=CN&amp;NR=110223678A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>WAN GUANGHUI</creatorcontrib><title>Voice recognition method and system</title><description>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2019</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZFAOy89MTlUoSk3OT8_LLMnMz1PITS3JyE9RSMxLUSiuLC5JzeVhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfHOfoaGBkZGxmbmFo7GxKgBAOdfJnY</recordid><startdate>20190910</startdate><enddate>20190910</enddate><creator>WAN GUANGHUI</creator><scope>EVB</scope></search><sort><creationdate>20190910</creationdate><title>Voice recognition method and system</title><author>WAN GUANGHUI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN110223678A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2019</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>WAN GUANGHUI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WAN GUANGHUI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Voice recognition method and system</title><date>2019-09-10</date><risdate>2019</risdate><abstract>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN110223678A
source esp@cenet
subjects ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title Voice recognition method and system
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T08%3A16%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=WAN%20GUANGHUI&rft.date=2019-09-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN110223678A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true