Voice recognition method and system
The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing th...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | WAN GUANGHUI |
description | The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN110223678A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN110223678A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN110223678A3</originalsourceid><addsrcrecordid>eNrjZFAOy89MTlUoSk3OT8_LLMnMz1PITS3JyE9RSMxLUSiuLC5JzeVhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfHOfoaGBkZGxmbmFo7GxKgBAOdfJnY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Voice recognition method and system</title><source>esp@cenet</source><creator>WAN GUANGHUI</creator><creatorcontrib>WAN GUANGHUI</creatorcontrib><description>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2019</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20190910&DB=EPODOC&CC=CN&NR=110223678A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20190910&DB=EPODOC&CC=CN&NR=110223678A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>WAN GUANGHUI</creatorcontrib><title>Voice recognition method and system</title><description>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2019</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZFAOy89MTlUoSk3OT8_LLMnMz1PITS3JyE9RSMxLUSiuLC5JzeVhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfHOfoaGBkZGxmbmFo7GxKgBAOdfJnY</recordid><startdate>20190910</startdate><enddate>20190910</enddate><creator>WAN GUANGHUI</creator><scope>EVB</scope></search><sort><creationdate>20190910</creationdate><title>Voice recognition method and system</title><author>WAN GUANGHUI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN110223678A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2019</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>WAN GUANGHUI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WAN GUANGHUI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Voice recognition method and system</title><date>2019-09-10</date><risdate>2019</risdate><abstract>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN110223678A |
source | esp@cenet |
subjects | ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
title | Voice recognition method and system |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T08%3A16%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=WAN%20GUANGHUI&rft.date=2019-09-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN110223678A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |