Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies
We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speec...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 4476 |
---|---|
container_issue | |
container_start_page | 4473 |
container_title | |
container_volume | |
creator | Hermus, K. Girin, L. Van hamme, H. Irhimeh, S. |
description | We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques. |
doi_str_mv | 10.1109/ICASSP.2008.4518649 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4518649</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4518649</ieee_id><sourcerecordid>4518649</sourcerecordid><originalsourceid>FETCH-LOGICAL-i220t-577a25dec6349950c1647f87e9c246283adcf35cd8c30d57ebd793abff0966133</originalsourceid><addsrcrecordid>eNo1UNtKAzEUjDew1n5BX_IDW3PdbB6l1AsUFKrgW8kmJ22kTWqyK_TvXbHOyzDMMDCD0JSSGaVE3z3P71er1xkjpJkJSZta6DN0QwUTgopG6HM0YlzpimrycYEmWjX_HueXaEQlI1VNhb5Gk1I-yQAhudRyhOKidGFvupAiTh53W8DfKdgQN9j2XZW8xz7DVw_RHrFNsUt9_g1G0_XZ7HA5ANgtbk0Bh4eOrcn7FIPFJjpsDpBDcoOCCHkToNyiK292BSYnHqP3h8Xb_KlavjwOI5dVYIx0lVTKMOnA1lxoLYmltVC-UaAtEzVruHHWc2ldYzlxUkHrlOam9Z7ouqacj9H0rzcAwPqQh4n5uD5dx38AGypgEQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Hermus, K. ; Girin, L. ; Van hamme, H. ; Irhimeh, S.</creator><creatorcontrib>Hermus, K. ; Girin, L. ; Van hamme, H. ; Irhimeh, S.</creatorcontrib><description>We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424414833</identifier><identifier>ISBN: 1424414830</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1424414849</identifier><identifier>EISBN: 9781424414840</identifier><identifier>DOI: 10.1109/ICASSP.2008.4518649</identifier><language>eng</language><publisher>IEEE</publisher><subject>Acoustic noise ; Cutoff frequency ; Frequency estimation ; Natural languages ; Power harmonic filters ; Signal processing algorithms ; spectral analysis ; Speech analysis ; speech coding ; Speech processing ; Speech synthesis ; Voltage-controlled oscillators</subject><ispartof>2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.4473-4476</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4518649$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4518649$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Hermus, K.</creatorcontrib><creatorcontrib>Girin, L.</creatorcontrib><creatorcontrib>Van hamme, H.</creatorcontrib><creatorcontrib>Irhimeh, S.</creatorcontrib><title>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</title><title>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</title><addtitle>ICASSP</addtitle><description>We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.</description><subject>Acoustic noise</subject><subject>Cutoff frequency</subject><subject>Frequency estimation</subject><subject>Natural languages</subject><subject>Power harmonic filters</subject><subject>Signal processing algorithms</subject><subject>spectral analysis</subject><subject>Speech analysis</subject><subject>speech coding</subject><subject>Speech processing</subject><subject>Speech synthesis</subject><subject>Voltage-controlled oscillators</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424414833</isbn><isbn>1424414830</isbn><isbn>1424414849</isbn><isbn>9781424414840</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UNtKAzEUjDew1n5BX_IDW3PdbB6l1AsUFKrgW8kmJ22kTWqyK_TvXbHOyzDMMDCD0JSSGaVE3z3P71er1xkjpJkJSZta6DN0QwUTgopG6HM0YlzpimrycYEmWjX_HueXaEQlI1VNhb5Gk1I-yQAhudRyhOKidGFvupAiTh53W8DfKdgQN9j2XZW8xz7DVw_RHrFNsUt9_g1G0_XZ7HA5ANgtbk0Bh4eOrcn7FIPFJjpsDpBDcoOCCHkToNyiK292BSYnHqP3h8Xb_KlavjwOI5dVYIx0lVTKMOnA1lxoLYmltVC-UaAtEzVruHHWc2ldYzlxUkHrlOam9Z7ouqacj9H0rzcAwPqQh4n5uD5dx38AGypgEQ</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Hermus, K.</creator><creator>Girin, L.</creator><creator>Van hamme, H.</creator><creator>Irhimeh, S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20080101</creationdate><title>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</title><author>Hermus, K. ; Girin, L. ; Van hamme, H. ; Irhimeh, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i220t-577a25dec6349950c1647f87e9c246283adcf35cd8c30d57ebd793abff0966133</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Acoustic noise</topic><topic>Cutoff frequency</topic><topic>Frequency estimation</topic><topic>Natural languages</topic><topic>Power harmonic filters</topic><topic>Signal processing algorithms</topic><topic>spectral analysis</topic><topic>Speech analysis</topic><topic>speech coding</topic><topic>Speech processing</topic><topic>Speech synthesis</topic><topic>Voltage-controlled oscillators</topic><toplevel>online_resources</toplevel><creatorcontrib>Hermus, K.</creatorcontrib><creatorcontrib>Girin, L.</creatorcontrib><creatorcontrib>Van hamme, H.</creatorcontrib><creatorcontrib>Irhimeh, S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hermus, K.</au><au>Girin, L.</au><au>Van hamme, H.</au><au>Irhimeh, S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</atitle><btitle>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</btitle><stitle>ICASSP</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>4473</spage><epage>4476</epage><pages>4473-4476</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424414833</isbn><isbn>1424414830</isbn><eisbn>1424414849</eisbn><eisbn>9781424414840</eisbn><abstract>We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2008.4518649</doi><tpages>4</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.4473-4476 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_4518649 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Acoustic noise Cutoff frequency Frequency estimation Natural languages Power harmonic filters Signal processing algorithms spectral analysis Speech analysis speech coding Speech processing Speech synthesis Voltage-controlled oscillators |
title | Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T13%3A55%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Estimation%20of%20the%20voicing%20cut-off%20frequency%20contour%20of%20natural%20speech%20based%20on%20harmonic%20and%20aperiodic%20energies&rft.btitle=2008%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing&rft.au=Hermus,%20K.&rft.date=2008-01-01&rft.spage=4473&rft.epage=4476&rft.pages=4473-4476&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424414833&rft.isbn_list=1424414830&rft_id=info:doi/10.1109/ICASSP.2008.4518649&rft_dat=%3Cieee_6IE%3E4518649%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424414849&rft.eisbn_list=9781424414840&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4518649&rfr_iscdi=true |