Voice recognition method and system

The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	WAN GUANGHUI
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	WAN GUANGHUI
description	The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN110223678A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN110223678A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN110223678A3</originalsourceid><addsrcrecordid>eNrjZFAOy89MTlUoSk3OT8_LLMnMz1PITS3JyE9RSMxLUSiuLC5JzeVhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfHOfoaGBkZGxmbmFo7GxKgBAOdfJnY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Voice recognition method and system</title><source>esp@cenet</source><creator>WAN GUANGHUI</creator><creatorcontrib>WAN GUANGHUI</creatorcontrib><description>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2019</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20190910&DB=EPODOC&CC=CN&NR=110223678A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20190910&DB=EPODOC&CC=CN&NR=110223678A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>WAN GUANGHUI</creatorcontrib><title>Voice recognition method and system</title><description>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2019</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZFAOy89MTlUoSk3OT8_LLMnMz1PITS3JyE9RSMxLUSiuLC5JzeVhYE1LzClO5YXS3AyKbq4hzh66qQX58anFBYnJqXmpJfHOfoaGBkZGxmbmFo7GxKgBAOdfJnY</recordid><startdate>20190910</startdate><enddate>20190910</enddate><creator>WAN GUANGHUI</creator><scope>EVB</scope></search><sort><creationdate>20190910</creationdate><title>Voice recognition method and system</title><author>WAN GUANGHUI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN110223678A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2019</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>WAN GUANGHUI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>WAN GUANGHUI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Voice recognition method and system</title><date>2019-09-10</date><risdate>2019</risdate><abstract>The embodiment of the invention provides a voice recognition method. The voice recognition method comprises the steps that audio features of each frame of an extracted voice file are input into a deeplearning neutral network, the posterior probability of each frame is determined, and by smoothing the posterior probability of each frame, key words constituting conversation voice are determined; astring word set where the key words are located is determined; a first label sequence composed of labels corresponding to the maximum posterior probabilities of all the frames in the voice file and asecond label sequence determined by pronunciation mapping of all to-be-selected words are obtained, similarities between the first label sequence and the second label sequence corresponding to all theto-be-selected words are traversed, and the to-be-selected words corresponding to the maximum similarity serve as recognized words of the conversation voice. The embodiment of the invention further provides a voice recognition</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN110223678A
source	esp@cenet
subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
title	Voice recognition method and system
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T08%3A16%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=WAN%20GUANGHUI&rft.date=2019-09-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN110223678A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true