Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies

We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speec...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hermus, K., Girin, L., Van hamme, H., Irhimeh, S.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4476
container_issue
container_start_page 4473
container_title
container_volume
creator Hermus, K.
Girin, L.
Van hamme, H.
Irhimeh, S.
description We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.
doi_str_mv 10.1109/ICASSP.2008.4518649
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4518649</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4518649</ieee_id><sourcerecordid>4518649</sourcerecordid><originalsourceid>FETCH-LOGICAL-i220t-577a25dec6349950c1647f87e9c246283adcf35cd8c30d57ebd793abff0966133</originalsourceid><addsrcrecordid>eNo1UNtKAzEUjDew1n5BX_IDW3PdbB6l1AsUFKrgW8kmJ22kTWqyK_TvXbHOyzDMMDCD0JSSGaVE3z3P71er1xkjpJkJSZta6DN0QwUTgopG6HM0YlzpimrycYEmWjX_HueXaEQlI1VNhb5Gk1I-yQAhudRyhOKidGFvupAiTh53W8DfKdgQN9j2XZW8xz7DVw_RHrFNsUt9_g1G0_XZ7HA5ANgtbk0Bh4eOrcn7FIPFJjpsDpBDcoOCCHkToNyiK292BSYnHqP3h8Xb_KlavjwOI5dVYIx0lVTKMOnA1lxoLYmltVC-UaAtEzVruHHWc2ldYzlxUkHrlOam9Z7ouqacj9H0rzcAwPqQh4n5uD5dx38AGypgEQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Hermus, K. ; Girin, L. ; Van hamme, H. ; Irhimeh, S.</creator><creatorcontrib>Hermus, K. ; Girin, L. ; Van hamme, H. ; Irhimeh, S.</creatorcontrib><description>We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424414833</identifier><identifier>ISBN: 1424414830</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1424414849</identifier><identifier>EISBN: 9781424414840</identifier><identifier>DOI: 10.1109/ICASSP.2008.4518649</identifier><language>eng</language><publisher>IEEE</publisher><subject>Acoustic noise ; Cutoff frequency ; Frequency estimation ; Natural languages ; Power harmonic filters ; Signal processing algorithms ; spectral analysis ; Speech analysis ; speech coding ; Speech processing ; Speech synthesis ; Voltage-controlled oscillators</subject><ispartof>2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.4473-4476</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4518649$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4518649$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Hermus, K.</creatorcontrib><creatorcontrib>Girin, L.</creatorcontrib><creatorcontrib>Van hamme, H.</creatorcontrib><creatorcontrib>Irhimeh, S.</creatorcontrib><title>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</title><title>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</title><addtitle>ICASSP</addtitle><description>We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.</description><subject>Acoustic noise</subject><subject>Cutoff frequency</subject><subject>Frequency estimation</subject><subject>Natural languages</subject><subject>Power harmonic filters</subject><subject>Signal processing algorithms</subject><subject>spectral analysis</subject><subject>Speech analysis</subject><subject>speech coding</subject><subject>Speech processing</subject><subject>Speech synthesis</subject><subject>Voltage-controlled oscillators</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424414833</isbn><isbn>1424414830</isbn><isbn>1424414849</isbn><isbn>9781424414840</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UNtKAzEUjDew1n5BX_IDW3PdbB6l1AsUFKrgW8kmJ22kTWqyK_TvXbHOyzDMMDCD0JSSGaVE3z3P71er1xkjpJkJSZta6DN0QwUTgopG6HM0YlzpimrycYEmWjX_HueXaEQlI1VNhb5Gk1I-yQAhudRyhOKidGFvupAiTh53W8DfKdgQN9j2XZW8xz7DVw_RHrFNsUt9_g1G0_XZ7HA5ANgtbk0Bh4eOrcn7FIPFJjpsDpBDcoOCCHkToNyiK292BSYnHqP3h8Xb_KlavjwOI5dVYIx0lVTKMOnA1lxoLYmltVC-UaAtEzVruHHWc2ldYzlxUkHrlOam9Z7ouqacj9H0rzcAwPqQh4n5uD5dx38AGypgEQ</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Hermus, K.</creator><creator>Girin, L.</creator><creator>Van hamme, H.</creator><creator>Irhimeh, S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20080101</creationdate><title>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</title><author>Hermus, K. ; Girin, L. ; Van hamme, H. ; Irhimeh, S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i220t-577a25dec6349950c1647f87e9c246283adcf35cd8c30d57ebd793abff0966133</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Acoustic noise</topic><topic>Cutoff frequency</topic><topic>Frequency estimation</topic><topic>Natural languages</topic><topic>Power harmonic filters</topic><topic>Signal processing algorithms</topic><topic>spectral analysis</topic><topic>Speech analysis</topic><topic>speech coding</topic><topic>Speech processing</topic><topic>Speech synthesis</topic><topic>Voltage-controlled oscillators</topic><toplevel>online_resources</toplevel><creatorcontrib>Hermus, K.</creatorcontrib><creatorcontrib>Girin, L.</creatorcontrib><creatorcontrib>Van hamme, H.</creatorcontrib><creatorcontrib>Irhimeh, S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hermus, K.</au><au>Girin, L.</au><au>Van hamme, H.</au><au>Irhimeh, S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies</atitle><btitle>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</btitle><stitle>ICASSP</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>4473</spage><epage>4476</epage><pages>4473-4476</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424414833</isbn><isbn>1424414830</isbn><eisbn>1424414849</eisbn><eisbn>9781424414840</eisbn><abstract>We present a new algorithm for the automatic estimation of the voicing cut-off frequency (VCO), i.e., the frequency that separates the periodic low-frequency part from the aperiodic high-frequency part in voiced segments of natural speech. Starting from the power spectrum of a two pitch period speech frame, we define the VCO to be located at the frequency for which the sum of the periodic and aperiodic energy in the spectral band below and above that frequency respectively, is maximised. By formulating the problem in terms of a score function we are able to apply a dynamic programming based smoothing technique. Remarkably smooth and accurate VCO contours were obtained, despite the simplicity of the proposed algorithm. In a formal evaluation the algorithm compares favourably to two existing VCO estimation techniques.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2008.4518649</doi><tpages>4</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.4473-4476
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_4518649
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Acoustic noise
Cutoff frequency
Frequency estimation
Natural languages
Power harmonic filters
Signal processing algorithms
spectral analysis
Speech analysis
speech coding
Speech processing
Speech synthesis
Voltage-controlled oscillators
title Estimation of the voicing cut-off frequency contour of natural speech based on harmonic and aperiodic energies
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-14T13%3A55%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Estimation%20of%20the%20voicing%20cut-off%20frequency%20contour%20of%20natural%20speech%20based%20on%20harmonic%20and%20aperiodic%20energies&rft.btitle=2008%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing&rft.au=Hermus,%20K.&rft.date=2008-01-01&rft.spage=4473&rft.epage=4476&rft.pages=4473-4476&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424414833&rft.isbn_list=1424414830&rft_id=info:doi/10.1109/ICASSP.2008.4518649&rft_dat=%3Cieee_6IE%3E4518649%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424414849&rft.eisbn_list=9781424414840&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4518649&rfr_iscdi=true