A spectro-temporal algorithm for pitch frequency estimation from noisy observations
A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-vari...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1707 |
---|---|
container_issue | |
container_start_page | 1704 |
container_title | |
container_volume | |
creator | Shahnaz, C. Zhu, W.-P. Ahmad, M.O. |
description | A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods. |
doi_str_mv | 10.1109/ISCAS.2008.4541765 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>proquest_6IE</sourceid><recordid>TN_cdi_ieee_primary_4541765</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4541765</ieee_id><sourcerecordid>34537213</sourcerecordid><originalsourceid>FETCH-LOGICAL-i206t-ab75105b2fc39ec9dafddcc41c5497fdb83478c087ac157a3895cfcf4a6e1da43</originalsourceid><addsrcrecordid>eNo1kEtPwzAQhM2jEqH0D8DFJ24ufsbOsap4VKrEoXCOHMemRkkcbBep_56Iwl5Gmvm0ml0AbgleEoKrh81uvdotKcZqyQUnshRn4JpwyjkpFSfnoKBEKEQEFRdgUUn1nzF5CQpMJUGcYToDhcKo5KVg-AosUvrE03DBqKAF2K1gGq3JMaBs-zFE3UHdfYTo876HLkQ4-mz20EX7dbCDOUKbsu919mGYzNDDIfh0hKFJNn7_2ukGzJzukl386Ry8Pz2-rV_Q9vV5s15tkae4zEg3UhAsGuoMq6ypWu3a1hhOjOCVdG2jGJfKYCW1IUJqpiphnHFcl5a0mrM5uD_tHWOYyqVc9z4Z23V6sOGQajYdKSlhE3h3Ar21th7j1D8e67-nsh9J2Ga5</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype><pqid>34537213</pqid></control><display><type>conference_proceeding</type><title>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.</creator><creatorcontrib>Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.</creatorcontrib><description>A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.</description><identifier>ISSN: 0271-4302</identifier><identifier>ISBN: 9781424416837</identifier><identifier>ISBN: 1424416833</identifier><identifier>EISSN: 2158-1525</identifier><identifier>EISBN: 1424416841</identifier><identifier>EISBN: 9781424416844</identifier><identifier>DOI: 10.1109/ISCAS.2008.4541765</identifier><identifier>LCCN: 80-646530</identifier><language>eng</language><publisher>IEEE</publisher><subject>Additive noise ; Degradation ; Filters ; Frequency estimation ; Noise reduction ; Personal digital assistants ; Signal processing algorithms ; Signal to noise ratio ; Speech analysis ; Speech enhancement</subject><ispartof>2008 IEEE International Symposium on Circuits and Systems, 2008, p.1704-1707</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4541765$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,314,777,781,786,787,2052,27905,27906,54901</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4541765$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Shahnaz, C.</creatorcontrib><creatorcontrib>Zhu, W.-P.</creatorcontrib><creatorcontrib>Ahmad, M.O.</creatorcontrib><title>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</title><title>2008 IEEE International Symposium on Circuits and Systems</title><addtitle>ISCAS</addtitle><description>A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.</description><subject>Additive noise</subject><subject>Degradation</subject><subject>Filters</subject><subject>Frequency estimation</subject><subject>Noise reduction</subject><subject>Personal digital assistants</subject><subject>Signal processing algorithms</subject><subject>Signal to noise ratio</subject><subject>Speech analysis</subject><subject>Speech enhancement</subject><issn>0271-4302</issn><issn>2158-1525</issn><isbn>9781424416837</isbn><isbn>1424416833</isbn><isbn>1424416841</isbn><isbn>9781424416844</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1kEtPwzAQhM2jEqH0D8DFJ24ufsbOsap4VKrEoXCOHMemRkkcbBep_56Iwl5Gmvm0ml0AbgleEoKrh81uvdotKcZqyQUnshRn4JpwyjkpFSfnoKBEKEQEFRdgUUn1nzF5CQpMJUGcYToDhcKo5KVg-AosUvrE03DBqKAF2K1gGq3JMaBs-zFE3UHdfYTo876HLkQ4-mz20EX7dbCDOUKbsu919mGYzNDDIfh0hKFJNn7_2ukGzJzukl386Ry8Pz2-rV_Q9vV5s15tkae4zEg3UhAsGuoMq6ypWu3a1hhOjOCVdG2jGJfKYCW1IUJqpiphnHFcl5a0mrM5uD_tHWOYyqVc9z4Z23V6sOGQajYdKSlhE3h3Ar21th7j1D8e67-nsh9J2Ga5</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Shahnaz, C.</creator><creator>Zhu, W.-P.</creator><creator>Ahmad, M.O.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20080101</creationdate><title>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</title><author>Shahnaz, C. ; Zhu, W.-P. ; Ahmad, M.O.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i206t-ab75105b2fc39ec9dafddcc41c5497fdb83478c087ac157a3895cfcf4a6e1da43</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Additive noise</topic><topic>Degradation</topic><topic>Filters</topic><topic>Frequency estimation</topic><topic>Noise reduction</topic><topic>Personal digital assistants</topic><topic>Signal processing algorithms</topic><topic>Signal to noise ratio</topic><topic>Speech analysis</topic><topic>Speech enhancement</topic><toplevel>online_resources</toplevel><creatorcontrib>Shahnaz, C.</creatorcontrib><creatorcontrib>Zhu, W.-P.</creatorcontrib><creatorcontrib>Ahmad, M.O.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shahnaz, C.</au><au>Zhu, W.-P.</au><au>Ahmad, M.O.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A spectro-temporal algorithm for pitch frequency estimation from noisy observations</atitle><btitle>2008 IEEE International Symposium on Circuits and Systems</btitle><stitle>ISCAS</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>1704</spage><epage>1707</epage><pages>1704-1707</pages><issn>0271-4302</issn><eissn>2158-1525</eissn><isbn>9781424416837</isbn><isbn>1424416833</isbn><eisbn>1424416841</eisbn><eisbn>9781424416844</eisbn><abstract>A novel algorithm for pitch frequency (PF) estimation from non-stationary noise-corrupted speech observations is presented in this paper based on both spectral pre-processing and temporal representation. A modified power spectral subtraction based de-noising scheme that allows tracking the time-variation of the underlying non-stationary noise is put forward to enhance speech prior to PF estimation. The de-noised speech is then utilized to propose a squared difference function of the Linear Prediction (LP) residual which is expected to reveal more prominent dips at integral multiples of the pitch period compared to that revealed by the LP residual. The dips at different pitch-harmonic locations are added and weighted by a periodicity dependent weighting factor for every possible pitch period thus yielding a weighted and harmonically summed temporal function which is globally minimized to extract the desired PF. Simulation results using the Keele database show the superior efficacy of the proposed method in the presence of a multi-talker babble noise relative to some of the existing methods.</abstract><pub>IEEE</pub><doi>10.1109/ISCAS.2008.4541765</doi><tpages>4</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0271-4302 |
ispartof | 2008 IEEE International Symposium on Circuits and Systems, 2008, p.1704-1707 |
issn | 0271-4302 2158-1525 |
language | eng |
recordid | cdi_ieee_primary_4541765 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Additive noise Degradation Filters Frequency estimation Noise reduction Personal digital assistants Signal processing algorithms Signal to noise ratio Speech analysis Speech enhancement |
title | A spectro-temporal algorithm for pitch frequency estimation from noisy observations |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T13%3A57%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20spectro-temporal%20algorithm%20for%20pitch%20frequency%20estimation%20from%20noisy%20observations&rft.btitle=2008%20IEEE%20International%20Symposium%20on%20Circuits%20and%20Systems&rft.au=Shahnaz,%20C.&rft.date=2008-01-01&rft.spage=1704&rft.epage=1707&rft.pages=1704-1707&rft.issn=0271-4302&rft.eissn=2158-1525&rft.isbn=9781424416837&rft.isbn_list=1424416833&rft_id=info:doi/10.1109/ISCAS.2008.4541765&rft_dat=%3Cproquest_6IE%3E34537213%3C/proquest_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424416841&rft.eisbn_list=9781424416844&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=34537213&rft_id=info:pmid/&rft_ieee_id=4541765&rfr_iscdi=true |