Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction

In multi-microphone speech enhancement, reverberation as well as additive noise and/or interfering speech are commonly suppressed by deconvolution and spatial filtering, e.g., using multi-channel linear prediction (MCLP) on the one hand and beamforming, e.g., a generalized sidelobe canceler (GSC), o...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2020, Vol.28, p.740-754
Hauptverfasser:	Dietzen, Thomas, Doclo, Simon, Moonen, Marc, van Waterschoot, Toon
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Array signal processing Beamforming Cancellation Deconvolution Dereverberation interfering speech cancellation Kalman filter Kalman filters Linear prediction Microphones Microprocessors multi-channel linear prediction (MCLP) Noise control Noise reduction Sidelobes Spatial filtering Speech enhancement Speech processing
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	754
container_issue
container_start_page	740
container_title	IEEE/ACM transactions on audio, speech, and language processing
container_volume	28
creator	Dietzen, Thomas Doclo, Simon Moonen, Marc van Waterschoot, Toon
description	In multi-microphone speech enhancement, reverberation as well as additive noise and/or interfering speech are commonly suppressed by deconvolution and spatial filtering, e.g., using multi-channel linear prediction (MCLP) on the one hand and beamforming, e.g., a generalized sidelobe canceler (GSC), on the other hand. In this article, we consider several reverberant speech components, whereof some are to be dereverberated and others to be canceled, as well as a diffuse (e.g., babble) noise component to be suppressed. In order to perform both deconvolution and spatial filtering, we integrate MCLP and the GSC into a novel architecture referred to as integrated sidelobe cancellation and linear prediction (ISCLP), where the sidelobe-cancellation (SC) filter and the linear prediction (LP) filter operate in parallel, but on different microphone signal frames. Within ISCLP, we estimate both filters jointly by means of a single Kalman filter. We further propose a spectral Wiener gain post-processor, which is shown to relate to the Kalman filter's posterior state estimate. The presented ISCLP Kalman filter is benchmarked against two state-of-the-art approaches, namely first a pair of alternating Kalman filters respectively performing dereverberation and noise reduction, and second an MCLP+GSC Kalman filter cascade. While the ISCLP Kalman filter is roughly M^2 times less expensive than both reference algorithms, where M denotes the number of microphones, it is shown to perform at least similarly as compared to the former, and to outperform the latter. A MATLAB implementation is available.
doi_str_mv	10.1109/TASLP.2020.2966869
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TASLP_2020_2966869</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8960444</ieee_id><sourcerecordid>2352194183</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-5ce543cb8503ab52c83f9dc38af33f849593456a83a22ce1032fc1b43ebdec033</originalsourceid><addsrcrecordid>eNpNkd1OwkAQhRujiQR5Ab3ZxFuK-9fSvSQoihYlgtfNdjuFJWUXt62J7-RDSgsYr2YyOed8mRzPuyZ4QAgWd8vRIp4PKKZ4QEUYRqE48zqUUeELhvn5aacCX3q9stxgjAkeCjHkHe9naipYOVlBhhY6g8KmgMbSKCgKWWlrkDQZirUB6dDcQaZVe32RxVYaNNFFBQ7l1qFnq02FZnVRaX-mlbO7tTWAFjsAtUb34OALXAquTe2jhutycNqsTpr_2H7LfbW6BPQOWd1Sr7yLXBYl9I6z631MHpbjJz9-e5yOR7GvqAgqP1AQcKbSKMBMpgFVEctFplgkc8byiItAMB6EMmKSUgUEM5orknIGaQYKM9b1bg-5O2c_ayirZGNrZ_bIhLKAEsFJ1KjoQbX_tSwd5MnO6a103wnBSVNM0haTNMUkx2L2ppuDSQPAnyESIeacs1-6PIz1</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2352194183</pqid></control><display><type>article</type><title>Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction</title><source>IEEE Electronic Library (IEL)</source><creator>Dietzen, Thomas ; Doclo, Simon ; Moonen, Marc ; van Waterschoot, Toon</creator><creatorcontrib>Dietzen, Thomas ; Doclo, Simon ; Moonen, Marc ; van Waterschoot, Toon</creatorcontrib><description>In multi-microphone speech enhancement, reverberation as well as additive noise and/or interfering speech are commonly suppressed by deconvolution and spatial filtering, e.g., using multi-channel linear prediction (MCLP) on the one hand and beamforming, e.g., a generalized sidelobe canceler (GSC), on the other hand. In this article, we consider several reverberant speech components, whereof some are to be dereverberated and others to be canceled, as well as a diffuse (e.g., babble) noise component to be suppressed. In order to perform both deconvolution and spatial filtering, we integrate MCLP and the GSC into a novel architecture referred to as integrated sidelobe cancellation and linear prediction (ISCLP), where the sidelobe-cancellation (SC) filter and the linear prediction (LP) filter operate in parallel, but on different microphone signal frames. Within ISCLP, we estimate both filters jointly by means of a single Kalman filter. We further propose a spectral Wiener gain post-processor, which is shown to relate to the Kalman filter's posterior state estimate. The presented ISCLP Kalman filter is benchmarked against two state-of-the-art approaches, namely first a pair of alternating Kalman filters respectively performing dereverberation and noise reduction, and second an MCLP+GSC Kalman filter cascade. While the ISCLP Kalman filter is roughly M^2 times less expensive than both reference algorithms, where M denotes the number of microphones, it is shown to perform at least similarly as compared to the former, and to outperform the latter. A MATLAB implementation is available.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2020.2966869</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; Array signal processing ; Beamforming ; Cancellation ; Deconvolution ; Dereverberation ; interfering speech cancellation ; Kalman filter ; Kalman filters ; Linear prediction ; Microphones ; Microprocessors ; multi-channel linear prediction (MCLP) ; Noise control ; Noise reduction ; Sidelobes ; Spatial filtering ; Speech enhancement ; Speech processing</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.740-754</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-5ce543cb8503ab52c83f9dc38af33f849593456a83a22ce1032fc1b43ebdec033</citedby><cites>FETCH-LOGICAL-c295t-5ce543cb8503ab52c83f9dc38af33f849593456a83a22ce1032fc1b43ebdec033</cites><orcidid>0000-0002-8924-4962 ; 0000-0002-3392-2381 ; 0000-0003-4461-0073</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8960444$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,777,781,793,4010,27904,27905,27906,54739</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8960444$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Dietzen, Thomas</creatorcontrib><creatorcontrib>Doclo, Simon</creatorcontrib><creatorcontrib>Moonen, Marc</creatorcontrib><creatorcontrib>van Waterschoot, Toon</creatorcontrib><title>Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>In multi-microphone speech enhancement, reverberation as well as additive noise and/or interfering speech are commonly suppressed by deconvolution and spatial filtering, e.g., using multi-channel linear prediction (MCLP) on the one hand and beamforming, e.g., a generalized sidelobe canceler (GSC), on the other hand. In this article, we consider several reverberant speech components, whereof some are to be dereverberated and others to be canceled, as well as a diffuse (e.g., babble) noise component to be suppressed. In order to perform both deconvolution and spatial filtering, we integrate MCLP and the GSC into a novel architecture referred to as integrated sidelobe cancellation and linear prediction (ISCLP), where the sidelobe-cancellation (SC) filter and the linear prediction (LP) filter operate in parallel, but on different microphone signal frames. Within ISCLP, we estimate both filters jointly by means of a single Kalman filter. We further propose a spectral Wiener gain post-processor, which is shown to relate to the Kalman filter's posterior state estimate. The presented ISCLP Kalman filter is benchmarked against two state-of-the-art approaches, namely first a pair of alternating Kalman filters respectively performing dereverberation and noise reduction, and second an MCLP+GSC Kalman filter cascade. While the ISCLP Kalman filter is roughly M^2 times less expensive than both reference algorithms, where M denotes the number of microphones, it is shown to perform at least similarly as compared to the former, and to outperform the latter. A MATLAB implementation is available.</description><subject>Algorithms</subject><subject>Array signal processing</subject><subject>Beamforming</subject><subject>Cancellation</subject><subject>Deconvolution</subject><subject>Dereverberation</subject><subject>interfering speech cancellation</subject><subject>Kalman filter</subject><subject>Kalman filters</subject><subject>Linear prediction</subject><subject>Microphones</subject><subject>Microprocessors</subject><subject>multi-channel linear prediction (MCLP)</subject><subject>Noise control</subject><subject>Noise reduction</subject><subject>Sidelobes</subject><subject>Spatial filtering</subject><subject>Speech enhancement</subject><subject>Speech processing</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkd1OwkAQhRujiQR5Ab3ZxFuK-9fSvSQoihYlgtfNdjuFJWUXt62J7-RDSgsYr2YyOed8mRzPuyZ4QAgWd8vRIp4PKKZ4QEUYRqE48zqUUeELhvn5aacCX3q9stxgjAkeCjHkHe9naipYOVlBhhY6g8KmgMbSKCgKWWlrkDQZirUB6dDcQaZVe32RxVYaNNFFBQ7l1qFnq02FZnVRaX-mlbO7tTWAFjsAtUb34OALXAquTe2jhutycNqsTpr_2H7LfbW6BPQOWd1Sr7yLXBYl9I6z631MHpbjJz9-e5yOR7GvqAgqP1AQcKbSKMBMpgFVEctFplgkc8byiItAMB6EMmKSUgUEM5orknIGaQYKM9b1bg-5O2c_ayirZGNrZ_bIhLKAEsFJ1KjoQbX_tSwd5MnO6a103wnBSVNM0haTNMUkx2L2ppuDSQPAnyESIeacs1-6PIz1</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Dietzen, Thomas</creator><creator>Doclo, Simon</creator><creator>Moonen, Marc</creator><creator>van Waterschoot, Toon</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-8924-4962</orcidid><orcidid>https://orcid.org/0000-0002-3392-2381</orcidid><orcidid>https://orcid.org/0000-0003-4461-0073</orcidid></search><sort><creationdate>2020</creationdate><title>Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction</title><author>Dietzen, Thomas ; Doclo, Simon ; Moonen, Marc ; van Waterschoot, Toon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-5ce543cb8503ab52c83f9dc38af33f849593456a83a22ce1032fc1b43ebdec033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Array signal processing</topic><topic>Beamforming</topic><topic>Cancellation</topic><topic>Deconvolution</topic><topic>Dereverberation</topic><topic>interfering speech cancellation</topic><topic>Kalman filter</topic><topic>Kalman filters</topic><topic>Linear prediction</topic><topic>Microphones</topic><topic>Microprocessors</topic><topic>multi-channel linear prediction (MCLP)</topic><topic>Noise control</topic><topic>Noise reduction</topic><topic>Sidelobes</topic><topic>Spatial filtering</topic><topic>Speech enhancement</topic><topic>Speech processing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dietzen, Thomas</creatorcontrib><creatorcontrib>Doclo, Simon</creatorcontrib><creatorcontrib>Moonen, Marc</creatorcontrib><creatorcontrib>van Waterschoot, Toon</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Dietzen, Thomas</au><au>Doclo, Simon</au><au>Moonen, Marc</au><au>van Waterschoot, Toon</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2020</date><risdate>2020</risdate><volume>28</volume><spage>740</spage><epage>754</epage><pages>740-754</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>In multi-microphone speech enhancement, reverberation as well as additive noise and/or interfering speech are commonly suppressed by deconvolution and spatial filtering, e.g., using multi-channel linear prediction (MCLP) on the one hand and beamforming, e.g., a generalized sidelobe canceler (GSC), on the other hand. In this article, we consider several reverberant speech components, whereof some are to be dereverberated and others to be canceled, as well as a diffuse (e.g., babble) noise component to be suppressed. In order to perform both deconvolution and spatial filtering, we integrate MCLP and the GSC into a novel architecture referred to as integrated sidelobe cancellation and linear prediction (ISCLP), where the sidelobe-cancellation (SC) filter and the linear prediction (LP) filter operate in parallel, but on different microphone signal frames. Within ISCLP, we estimate both filters jointly by means of a single Kalman filter. We further propose a spectral Wiener gain post-processor, which is shown to relate to the Kalman filter's posterior state estimate. The presented ISCLP Kalman filter is benchmarked against two state-of-the-art approaches, namely first a pair of alternating Kalman filters respectively performing dereverberation and noise reduction, and second an MCLP+GSC Kalman filter cascade. While the ISCLP Kalman filter is roughly M^2 times less expensive than both reference algorithms, where M denotes the number of microphones, it is shown to perform at least similarly as compared to the former, and to outperform the latter. A MATLAB implementation is available.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2020.2966869</doi><tpages>15</tpages><orcidid>https://orcid.org/0000-0002-8924-4962</orcidid><orcidid>https://orcid.org/0000-0002-3392-2381</orcidid><orcidid>https://orcid.org/0000-0003-4461-0073</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 2329-9290
ispartof	IEEE/ACM transactions on audio, speech, and language processing, 2020, Vol.28, p.740-754
issn	2329-9290 2329-9304
language	eng
recordid	cdi_crossref_primary_10_1109_TASLP_2020_2966869
source	IEEE Electronic Library (IEL)
subjects	Algorithms Array signal processing Beamforming Cancellation Deconvolution Dereverberation interfering speech cancellation Kalman filter Kalman filters Linear prediction Microphones Microprocessors multi-channel linear prediction (MCLP) Noise control Noise reduction Sidelobes Spatial filtering Speech enhancement Speech processing
title	Integrated Sidelobe Cancellation and Linear Prediction Kalman Filter for Joint Multi-Microphone Speech Dereverberation, Interfering Speech Cancellation, and Noise Reduction
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T13%3A51%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Integrated%20Sidelobe%20Cancellation%20and%20Linear%20Prediction%20Kalman%20Filter%20for%20Joint%20Multi-Microphone%20Speech%20Dereverberation,%20Interfering%20Speech%20Cancellation,%20and%20Noise%20Reduction&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Dietzen,%20Thomas&rft.date=2020&rft.volume=28&rft.spage=740&rft.epage=754&rft.pages=740-754&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2020.2966869&rft_dat=%3Cproquest_RIE%3E2352194183%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2352194183&rft_id=info:pmid/&rft_ieee_id=8960444&rfr_iscdi=true