Transient Noise Reduction Using Nonlocal Diffusion Filters

Enhancement of speech signals for hands-free communication systems has attracted significant research efforts in the last few decades. Still, many aspects and applications remain open and require further research. One of the important open problems is the single-channel transient noise reduction. In...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on audio, speech, and language processing speech, and language processing, 2011-08, Vol.19 (6), p.1584-1599
Hauptverfasser:	Talmon, R, Cohen, I, Gannot, S
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic noise Algorithms Applied sciences Detection, estimation, filtering, equalization, prediction Diffusion Estimates Exact sciences and technology impulse noise Information, signal and communications theory Kernel Mathematical models Noise Noise reduction Signal and communications theory Signal processing Signal, noise Spectra Speech Speech enhancement Speech processing Studies Telecommunications and information theory Transient analysis transient noise
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1599
container_issue	6
container_start_page	1584
container_title	IEEE transactions on audio, speech, and language processing
container_volume	19
creator	Talmon, R Cohen, I Gannot, S
description	Enhancement of speech signals for hands-free communication systems has attracted significant research efforts in the last few decades. Still, many aspects and applications remain open and require further research. One of the important open problems is the single-channel transient noise reduction. In this paper, we present a novel approach for transient noise reduction that relies on non-local (NL) neighborhood filters. In particular, we propose an algorithm for the enhancement of a speech signal contaminated by repeating transient noise events. We assume that the time duration of each reoccurring transient event is relatively short compared to speech phonemes and model the speech source as an auto-regressive (AR) process. The proposed algorithm consists of two stages. In the first stage, we estimate the power spectral density (PSD) of the transient noise by employing a NL neighborhood filter. In the second stage, we utilize the optimally modified log spectral amplitude (OM-LSA) estimator for denoising the speech using the noise PSD estimate from the first stage. Based on a statistical model for the measurements and diffusion interpretation of NL filtering, we obtain further insight into the algorithm behavior. In particular, for given transient noise, we determine whether estimation of the noise PSD is feasible using our approach, how to properly set the algorithm parameters, and what is the expected performance of the algorithm. Experimental study shows good results in enhancing speech signals contaminated by transient noise, such as typical household noises, construction sounds, keyboard typing, and metronome clacks.
doi_str_mv	10.1109/TASL.2010.2093651
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_5640657</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5640657</ieee_id><sourcerecordid>1010881197</sourcerecordid><originalsourceid>FETCH-LOGICAL-c355t-e3883c7c275083321a3088cff70160094c07cc3aa85842b9dc9d12ece771e9f63</originalsourceid><addsrcrecordid>eNpdkE1LAzEQhoMoWKs_QLwsguBlaybf8SbVqlAUtD2HmGYlZbtbk92D_94sLT14yiTzzEvmQegS8AQA67vFw-d8QnC-Eqyp4HCERsC5KqUm7PhQgzhFZymtMWZUMBih-0W0TQq-6Yq3NiRffPhV77rQNsUyheY7vzZ162xdPIaq6tPQmIW68zGdo5PK1slf7M8xWs6eFtOXcv7-_Dp9mJeOct6VnipFnXREcqwoJWApVspVlcQgMNbMYekctVZxxciXXjm9AuKdlxK8rgQdo9td7ja2P71PndmE5Hxd28a3fTKQt1YKQMuMXv9D120fm_w7o4FJIammGYId5GKbUvSV2cawsfE3J5lBphlkmkGm2cvMMzf7YJuyjCpLcyEdBgljQLHGmbvaccF7f2hzwbDgkv4BV_J7EQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>914767393</pqid></control><display><type>article</type><title>Transient Noise Reduction Using Nonlocal Diffusion Filters</title><source>IEEE Electronic Library (IEL)</source><creator>Talmon, R ; Cohen, I ; Gannot, S</creator><creatorcontrib>Talmon, R ; Cohen, I ; Gannot, S</creatorcontrib><description>Enhancement of speech signals for hands-free communication systems has attracted significant research efforts in the last few decades. Still, many aspects and applications remain open and require further research. One of the important open problems is the single-channel transient noise reduction. In this paper, we present a novel approach for transient noise reduction that relies on non-local (NL) neighborhood filters. In particular, we propose an algorithm for the enhancement of a speech signal contaminated by repeating transient noise events. We assume that the time duration of each reoccurring transient event is relatively short compared to speech phonemes and model the speech source as an auto-regressive (AR) process. The proposed algorithm consists of two stages. In the first stage, we estimate the power spectral density (PSD) of the transient noise by employing a NL neighborhood filter. In the second stage, we utilize the optimally modified log spectral amplitude (OM-LSA) estimator for denoising the speech using the noise PSD estimate from the first stage. Based on a statistical model for the measurements and diffusion interpretation of NL filtering, we obtain further insight into the algorithm behavior. In particular, for given transient noise, we determine whether estimation of the noise PSD is feasible using our approach, how to properly set the algorithm parameters, and what is the expected performance of the algorithm. Experimental study shows good results in enhancing speech signals contaminated by transient noise, such as typical household noises, construction sounds, keyboard typing, and metronome clacks.</description><identifier>ISSN: 1558-7916</identifier><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 1558-7924</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASL.2010.2093651</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway, NJ: IEEE</publisher><subject>Acoustic noise ; Algorithms ; Applied sciences ; Detection, estimation, filtering, equalization, prediction ; Diffusion ; Estimates ; Exact sciences and technology ; impulse noise ; Information, signal and communications theory ; Kernel ; Mathematical models ; Noise ; Noise reduction ; Signal and communications theory ; Signal processing ; Signal, noise ; Spectra ; Speech ; Speech enhancement ; Speech processing ; Studies ; Telecommunications and information theory ; Transient analysis ; transient noise</subject><ispartof>IEEE transactions on audio, speech, and language processing, 2011-08, Vol.19 (6), p.1584-1599</ispartof><rights>2015 INIST-CNRS</rights><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Aug 2011</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c355t-e3883c7c275083321a3088cff70160094c07cc3aa85842b9dc9d12ece771e9f63</citedby><cites>FETCH-LOGICAL-c355t-e3883c7c275083321a3088cff70160094c07cc3aa85842b9dc9d12ece771e9f63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5640657$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5640657$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=24413090$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Talmon, R</creatorcontrib><creatorcontrib>Cohen, I</creatorcontrib><creatorcontrib>Gannot, S</creatorcontrib><title>Transient Noise Reduction Using Nonlocal Diffusion Filters</title><title>IEEE transactions on audio, speech, and language processing</title><addtitle>TASL</addtitle><description>Enhancement of speech signals for hands-free communication systems has attracted significant research efforts in the last few decades. Still, many aspects and applications remain open and require further research. One of the important open problems is the single-channel transient noise reduction. In this paper, we present a novel approach for transient noise reduction that relies on non-local (NL) neighborhood filters. In particular, we propose an algorithm for the enhancement of a speech signal contaminated by repeating transient noise events. We assume that the time duration of each reoccurring transient event is relatively short compared to speech phonemes and model the speech source as an auto-regressive (AR) process. The proposed algorithm consists of two stages. In the first stage, we estimate the power spectral density (PSD) of the transient noise by employing a NL neighborhood filter. In the second stage, we utilize the optimally modified log spectral amplitude (OM-LSA) estimator for denoising the speech using the noise PSD estimate from the first stage. Based on a statistical model for the measurements and diffusion interpretation of NL filtering, we obtain further insight into the algorithm behavior. In particular, for given transient noise, we determine whether estimation of the noise PSD is feasible using our approach, how to properly set the algorithm parameters, and what is the expected performance of the algorithm. Experimental study shows good results in enhancing speech signals contaminated by transient noise, such as typical household noises, construction sounds, keyboard typing, and metronome clacks.</description><subject>Acoustic noise</subject><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Diffusion</subject><subject>Estimates</subject><subject>Exact sciences and technology</subject><subject>impulse noise</subject><subject>Information, signal and communications theory</subject><subject>Kernel</subject><subject>Mathematical models</subject><subject>Noise</subject><subject>Noise reduction</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal, noise</subject><subject>Spectra</subject><subject>Speech</subject><subject>Speech enhancement</subject><subject>Speech processing</subject><subject>Studies</subject><subject>Telecommunications and information theory</subject><subject>Transient analysis</subject><subject>transient noise</subject><issn>1558-7916</issn><issn>2329-9290</issn><issn>1558-7924</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkE1LAzEQhoMoWKs_QLwsguBlaybf8SbVqlAUtD2HmGYlZbtbk92D_94sLT14yiTzzEvmQegS8AQA67vFw-d8QnC-Eqyp4HCERsC5KqUm7PhQgzhFZymtMWZUMBih-0W0TQq-6Yq3NiRffPhV77rQNsUyheY7vzZ162xdPIaq6tPQmIW68zGdo5PK1slf7M8xWs6eFtOXcv7-_Dp9mJeOct6VnipFnXREcqwoJWApVspVlcQgMNbMYekctVZxxciXXjm9AuKdlxK8rgQdo9td7ja2P71PndmE5Hxd28a3fTKQt1YKQMuMXv9D120fm_w7o4FJIammGYId5GKbUvSV2cawsfE3J5lBphlkmkGm2cvMMzf7YJuyjCpLcyEdBgljQLHGmbvaccF7f2hzwbDgkv4BV_J7EQ</recordid><startdate>20110801</startdate><enddate>20110801</enddate><creator>Talmon, R</creator><creator>Cohen, I</creator><creator>Gannot, S</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20110801</creationdate><title>Transient Noise Reduction Using Nonlocal Diffusion Filters</title><author>Talmon, R ; Cohen, I ; Gannot, S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c355t-e3883c7c275083321a3088cff70160094c07cc3aa85842b9dc9d12ece771e9f63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Acoustic noise</topic><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Diffusion</topic><topic>Estimates</topic><topic>Exact sciences and technology</topic><topic>impulse noise</topic><topic>Information, signal and communications theory</topic><topic>Kernel</topic><topic>Mathematical models</topic><topic>Noise</topic><topic>Noise reduction</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal, noise</topic><topic>Spectra</topic><topic>Speech</topic><topic>Speech enhancement</topic><topic>Speech processing</topic><topic>Studies</topic><topic>Telecommunications and information theory</topic><topic>Transient analysis</topic><topic>transient noise</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Talmon, R</creatorcontrib><creatorcontrib>Cohen, I</creatorcontrib><creatorcontrib>Gannot, S</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Talmon, R</au><au>Cohen, I</au><au>Gannot, S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Transient Noise Reduction Using Nonlocal Diffusion Filters</atitle><jtitle>IEEE transactions on audio, speech, and language processing</jtitle><stitle>TASL</stitle><date>2011-08-01</date><risdate>2011</risdate><volume>19</volume><issue>6</issue><spage>1584</spage><epage>1599</epage><pages>1584-1599</pages><issn>1558-7916</issn><issn>2329-9290</issn><eissn>1558-7924</eissn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Enhancement of speech signals for hands-free communication systems has attracted significant research efforts in the last few decades. Still, many aspects and applications remain open and require further research. One of the important open problems is the single-channel transient noise reduction. In this paper, we present a novel approach for transient noise reduction that relies on non-local (NL) neighborhood filters. In particular, we propose an algorithm for the enhancement of a speech signal contaminated by repeating transient noise events. We assume that the time duration of each reoccurring transient event is relatively short compared to speech phonemes and model the speech source as an auto-regressive (AR) process. The proposed algorithm consists of two stages. In the first stage, we estimate the power spectral density (PSD) of the transient noise by employing a NL neighborhood filter. In the second stage, we utilize the optimally modified log spectral amplitude (OM-LSA) estimator for denoising the speech using the noise PSD estimate from the first stage. Based on a statistical model for the measurements and diffusion interpretation of NL filtering, we obtain further insight into the algorithm behavior. In particular, for given transient noise, we determine whether estimation of the noise PSD is feasible using our approach, how to properly set the algorithm parameters, and what is the expected performance of the algorithm. Experimental study shows good results in enhancing speech signals contaminated by transient noise, such as typical household noises, construction sounds, keyboard typing, and metronome clacks.</abstract><cop>Piscataway, NJ</cop><pub>IEEE</pub><doi>10.1109/TASL.2010.2093651</doi><tpages>16</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1558-7916
ispartof	IEEE transactions on audio, speech, and language processing, 2011-08, Vol.19 (6), p.1584-1599
issn	1558-7916 2329-9290 1558-7924 2329-9304
language	eng
recordid	cdi_ieee_primary_5640657
source	IEEE Electronic Library (IEL)
subjects	Acoustic noise Algorithms Applied sciences Detection, estimation, filtering, equalization, prediction Diffusion Estimates Exact sciences and technology impulse noise Information, signal and communications theory Kernel Mathematical models Noise Noise reduction Signal and communications theory Signal processing Signal, noise Spectra Speech Speech enhancement Speech processing Studies Telecommunications and information theory Transient analysis transient noise
title	Transient Noise Reduction Using Nonlocal Diffusion Filters
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-01T04%3A35%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Transient%20Noise%20Reduction%20Using%20Nonlocal%20Diffusion%20Filters&rft.jtitle=IEEE%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Talmon,%20R&rft.date=2011-08-01&rft.volume=19&rft.issue=6&rft.spage=1584&rft.epage=1599&rft.pages=1584-1599&rft.issn=1558-7916&rft.eissn=1558-7924&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASL.2010.2093651&rft_dat=%3Cproquest_RIE%3E1010881197%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=914767393&rft_id=info:pmid/&rft_ieee_id=5640657&rfr_iscdi=true