Voice activity detection method and apparatus thereof

The invention provides a voice activity detection method and an apparatus thereof. The method comprises the following steps of calculating an auditory characteristic of a sound signal, wherein the auditory characteristic includes a first dimension parameter related to a priori signal to noise ratio,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: CAI GANGLIN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CAI GANGLIN
description The invention provides a voice activity detection method and an apparatus thereof. The method comprises the following steps of calculating an auditory characteristic of a sound signal, wherein the auditory characteristic includes a first dimension parameter related to a priori signal to noise ratio, a second dimension parameter related to a posterior signal-to-noise ratio and a third dimension parameter related to the time domain signal; comparing the first dimension parameter, the second dimension parameter and the third dimension parameter with corresponding auditory thresholds respectively so as to acquire a detection result. In the invention, the priori signal to noise ratio and the posterior signal-to-noise ratio are used to combine the time domain signal to represent the auditory characteristic, and the extracted auditory characteristic can be used for comparing to the auditory threshold and detecting real-time voice activities. Under a single microphone system, the auditory characteristic under a remot
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN107393558A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN107393558A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN107393558A3</originalsourceid><addsrcrecordid>eNrjZDANy89MTlVITC7JLMssqVRISS1JBbLz8xRyU0sy8lMUEvOAuKAgsSixpLRYoSQjtSg1P42HgTUtMac4lRdKczMourmGOHvophbkx6cWFyQmp-allsQ7-xkamBtbGpuaWjgaE6MGAIYhLYI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Voice activity detection method and apparatus thereof</title><source>esp@cenet</source><creator>CAI GANGLIN</creator><creatorcontrib>CAI GANGLIN</creatorcontrib><description>The invention provides a voice activity detection method and an apparatus thereof. The method comprises the following steps of calculating an auditory characteristic of a sound signal, wherein the auditory characteristic includes a first dimension parameter related to a priori signal to noise ratio, a second dimension parameter related to a posterior signal-to-noise ratio and a third dimension parameter related to the time domain signal; comparing the first dimension parameter, the second dimension parameter and the third dimension parameter with corresponding auditory thresholds respectively so as to acquire a detection result. In the invention, the priori signal to noise ratio and the posterior signal-to-noise ratio are used to combine the time domain signal to represent the auditory characteristic, and the extracted auditory characteristic can be used for comparing to the auditory threshold and detecting real-time voice activities. Under a single microphone system, the auditory characteristic under a remot</description><language>chi ; eng</language><subject>ACOUSTICS ; MUSICAL INSTRUMENTS ; PHYSICS ; SPEECH ANALYSIS OR SYNTHESIS ; SPEECH OR AUDIO CODING OR DECODING ; SPEECH OR VOICE PROCESSING ; SPEECH RECOGNITION</subject><creationdate>2017</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20171124&amp;DB=EPODOC&amp;CC=CN&amp;NR=107393558A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20171124&amp;DB=EPODOC&amp;CC=CN&amp;NR=107393558A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CAI GANGLIN</creatorcontrib><title>Voice activity detection method and apparatus thereof</title><description>The invention provides a voice activity detection method and an apparatus thereof. The method comprises the following steps of calculating an auditory characteristic of a sound signal, wherein the auditory characteristic includes a first dimension parameter related to a priori signal to noise ratio, a second dimension parameter related to a posterior signal-to-noise ratio and a third dimension parameter related to the time domain signal; comparing the first dimension parameter, the second dimension parameter and the third dimension parameter with corresponding auditory thresholds respectively so as to acquire a detection result. In the invention, the priori signal to noise ratio and the posterior signal-to-noise ratio are used to combine the time domain signal to represent the auditory characteristic, and the extracted auditory characteristic can be used for comparing to the auditory threshold and detecting real-time voice activities. Under a single microphone system, the auditory characteristic under a remot</description><subject>ACOUSTICS</subject><subject>MUSICAL INSTRUMENTS</subject><subject>PHYSICS</subject><subject>SPEECH ANALYSIS OR SYNTHESIS</subject><subject>SPEECH OR AUDIO CODING OR DECODING</subject><subject>SPEECH OR VOICE PROCESSING</subject><subject>SPEECH RECOGNITION</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2017</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDANy89MTlVITC7JLMssqVRISS1JBbLz8xRyU0sy8lMUEvOAuKAgsSixpLRYoSQjtSg1P42HgTUtMac4lRdKczMourmGOHvophbkx6cWFyQmp-allsQ7-xkamBtbGpuaWjgaE6MGAIYhLYI</recordid><startdate>20171124</startdate><enddate>20171124</enddate><creator>CAI GANGLIN</creator><scope>EVB</scope></search><sort><creationdate>20171124</creationdate><title>Voice activity detection method and apparatus thereof</title><author>CAI GANGLIN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN107393558A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2017</creationdate><topic>ACOUSTICS</topic><topic>MUSICAL INSTRUMENTS</topic><topic>PHYSICS</topic><topic>SPEECH ANALYSIS OR SYNTHESIS</topic><topic>SPEECH OR AUDIO CODING OR DECODING</topic><topic>SPEECH OR VOICE PROCESSING</topic><topic>SPEECH RECOGNITION</topic><toplevel>online_resources</toplevel><creatorcontrib>CAI GANGLIN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CAI GANGLIN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Voice activity detection method and apparatus thereof</title><date>2017-11-24</date><risdate>2017</risdate><abstract>The invention provides a voice activity detection method and an apparatus thereof. The method comprises the following steps of calculating an auditory characteristic of a sound signal, wherein the auditory characteristic includes a first dimension parameter related to a priori signal to noise ratio, a second dimension parameter related to a posterior signal-to-noise ratio and a third dimension parameter related to the time domain signal; comparing the first dimension parameter, the second dimension parameter and the third dimension parameter with corresponding auditory thresholds respectively so as to acquire a detection result. In the invention, the priori signal to noise ratio and the posterior signal-to-noise ratio are used to combine the time domain signal to represent the auditory characteristic, and the extracted auditory characteristic can be used for comparing to the auditory threshold and detecting real-time voice activities. Under a single microphone system, the auditory characteristic under a remot</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN107393558A
source esp@cenet
subjects ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
title Voice activity detection method and apparatus thereof
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T04%3A42%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CAI%20GANGLIN&rft.date=2017-11-24&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN107393558A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true