A spectro-temporal algorithm for pitch frequency estimation from noisy observations

A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-vari...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Shahnaz, C., Zhu, W.-P., Ahmad, M.O.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1707
container_issue
container_start_page 1704
container_title
container_volume
creator Shahnaz, C.
Zhu, W.-P.
Ahmad, M.O.
description A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.
doi_str_mv 10.1109/ISCAS.2008.4541765
format Conference Proceeding
fullrecord <record><control><sourceid>proquest_6IE</sourceid><recordid>TN_cdi_ieee_primary_4541765</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4541765</ieee_id><sourcerecordid>34537213</sourcerecordid><originalsourceid>FETCH-LOGICAL-i206t-ab75105b2fc39ec9dafddcc41c5497fdb83478c087ac157a3895cfcf4a6e1da43</originalsourceid><addsrcrecordid>eNo1kEtPwzAQhM2jEqH0D8DFJ24ufsbOsap4VKrEoXCOHMemRkkcbBep_56Iwl5Gmvm0ml0AbgleEoKrh81uvdotKcZqyQUnshRn4JpwyjkpFSfnoKBEKEQEFRdgUUn1nzF5CQpMJUGcYToDhcKo5KVg-AosUvrE03DBqKAF2K1gGq3JMaBs-zFE3UHdfYTo876HLkQ4-mz20EX7dbCDOUKbsu919mGYzNDDIfh0hKFJNn7_2ukGzJzukl386Ry8Pz2-rV_Q9vV5s15tkae4zEg3UhAsGuoMq6ypWu3a1hhOjOCVdG2jGJfKYCW1IUJqpiphnHFcl5a0mrM5uD_tHWOYyqVc9z4Z23V6sOGQajYdKSlhE3h3Ar21th7j1D8e67-nsh9J2Ga5</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>34537213</pqid></control><display><type>conference_proceeding</type><title>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.</creator><creatorcontrib>Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.</creatorcontrib><description>A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.</description><identifier>ISSN: 0271-4302</identifier><identifier>ISBN: 9781424416837</identifier><identifier>ISBN: 1424416833</identifier><identifier>EISSN: 2158-1525</identifier><identifier>EISBN: 1424416841</identifier><identifier>EISBN: 9781424416844</identifier><identifier>DOI: 10.1109/ISCAS.2008.4541765</identifier><identifier>LCCN: 80-646530</identifier><language>eng</language><publisher>IEEE</publisher><subject>Additive noise ; Degradation ; Filters ; Frequency estimation ; Noise reduction ; Personal digital assistants ; Signal processing algorithms ; Signal to noise ratio ; Speech analysis ; Speech enhancement</subject><ispartof>2008 IEEE International Symposium on Circuits and Systems, 2008, p.1704-1707</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4541765$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,314,777,781,786,787,2052,27905,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4541765$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Shahnaz, C.</creatorcontrib><creatorcontrib>Zhu, W.-P.</creatorcontrib><creatorcontrib>Ahmad, M.O.</creatorcontrib><title>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</title><title>2008 IEEE International Symposium on Circuits and Systems</title><addtitle>ISCAS</addtitle><description>A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.</description><subject>Additive noise</subject><subject>Degradation</subject><subject>Filters</subject><subject>Frequency estimation</subject><subject>Noise reduction</subject><subject>Personal digital assistants</subject><subject>Signal processing algorithms</subject><subject>Signal to noise ratio</subject><subject>Speech analysis</subject><subject>Speech enhancement</subject><issn>0271-4302</issn><issn>2158-1525</issn><isbn>9781424416837</isbn><isbn>1424416833</isbn><isbn>1424416841</isbn><isbn>9781424416844</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1kEtPwzAQhM2jEqH0D8DFJ24ufsbOsap4VKrEoXCOHMemRkkcbBep_56Iwl5Gmvm0ml0AbgleEoKrh81uvdotKcZqyQUnshRn4JpwyjkpFSfnoKBEKEQEFRdgUUn1nzF5CQpMJUGcYToDhcKo5KVg-AosUvrE03DBqKAF2K1gGq3JMaBs-zFE3UHdfYTo876HLkQ4-mz20EX7dbCDOUKbsu919mGYzNDDIfh0hKFJNn7_2ukGzJzukl386Ry8Pz2-rV_Q9vV5s15tkae4zEg3UhAsGuoMq6ypWu3a1hhOjOCVdG2jGJfKYCW1IUJqpiphnHFcl5a0mrM5uD_tHWOYyqVc9z4Z23V6sOGQajYdKSlhE3h3Ar21th7j1D8e67-nsh9J2Ga5</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Shahnaz, C.</creator><creator>Zhu, W.-P.</creator><creator>Ahmad, M.O.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20080101</creationdate><title>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</title><author>Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i206t-ab75105b2fc39ec9dafddcc41c5497fdb83478c087ac157a3895cfcf4a6e1da43</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Additive noise</topic><topic>Degradation</topic><topic>Filters</topic><topic>Frequency estimation</topic><topic>Noise reduction</topic><topic>Personal digital assistants</topic><topic>Signal processing algorithms</topic><topic>Signal to noise ratio</topic><topic>Speech analysis</topic><topic>Speech enhancement</topic><toplevel>online_resources</toplevel><creatorcontrib>Shahnaz, C.</creatorcontrib><creatorcontrib>Zhu, W.-P.</creatorcontrib><creatorcontrib>Ahmad, M.O.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shahnaz, C.</au><au>Zhu, W.-P.</au><au>Ahmad, M.O.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</atitle><btitle>2008 IEEE International Symposium on Circuits and Systems</btitle><stitle>ISCAS</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>1704</spage><epage>1707</epage><pages>1704-1707</pages><issn>0271-4302</issn><eissn>2158-1525</eissn><isbn>9781424416837</isbn><isbn>1424416833</isbn><eisbn>1424416841</eisbn><eisbn>9781424416844</eisbn><abstract>A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.</abstract><pub>IEEE</pub><doi>10.1109/ISCAS.2008.4541765</doi><tpages>4</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 0271-4302
ispartof 2008 IEEE International Symposium on Circuits and Systems, 2008, p.1704-1707
issn 0271-4302
2158-1525
language eng
recordid cdi_ieee_primary_4541765
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Additive noise
Degradation
Filters
Frequency estimation
Noise reduction
Personal digital assistants
Signal processing algorithms
Signal to noise ratio
Speech analysis
Speech enhancement
title A spectro-temporal algorithm for pitch frequency estimation from noisy observations
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T13%3A57%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20spectro-temporal%20algorithm%20for%20pitch%20frequency%20estimation%20from%20noisy%20observations&rft.btitle=2008%20IEEE%20International%20Symposium%20on%20Circuits%20and%20Systems&rft.au=Shahnaz,%20C.&rft.date=2008-01-01&rft.spage=1704&rft.epage=1707&rft.pages=1704-1707&rft.issn=0271-4302&rft.eissn=2158-1525&rft.isbn=9781424416837&rft.isbn_list=1424416833&rft_id=info:doi/10.1109/ISCAS.2008.4541765&rft_dat=%3Cproquest_6IE%3E34537213%3C/proquest_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424416841&rft.eisbn_list=9781424416844&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=34537213&rft_id=info:pmid/&rft_ieee_id=4541765&rfr_iscdi=true