Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction

© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perfor...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE-ACM Transactions on Audio Speech and Language Processing 2019-03, Vol.27 (3), p.544-558
Hauptverfasser: Dietzen, Thomas, Spriet, A, Tirry, W, Doclo, S, Moonen, M, van Waterschoot, T
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 558
container_issue 3
container_start_page 544
container_title IEEE-ACM Transactions on Audio Speech and Language Processing
container_volume 27
creator Dietzen, Thomas
Spriet, A
Tirry, W
Doclo, S
Moonen, M
van Waterschoot, T
description © 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.
format Article
fullrecord <record><control><sourceid>kuleuven</sourceid><recordid>TN_cdi_kuleuven_dspace_123456789_631772</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>123456789_631772</sourcerecordid><originalsourceid>FETCH-kuleuven_dspace_123456789_6317723</originalsourceid><addsrcrecordid>eNqVy71OwzAUhuEMILWC3sPZGFAkx25rPKLwNwBClD06jb-oBmNHdhIB18BFgyokVpje5XkPirlU0pRGGjErFjk_CyEqoY3Ry3nxWcfXnhMPbgKdB_bv2WWKHV0jILF3H7C0cRY-bkE1hxbef-sYiIOlu9EPrqx3HAI83boATvSQYF27N11MtOmBdkcXSJiQtki_-310GfQIO-75cXHYsc9Y_PSoOLm6fKpvypfRY5wQGpt7btFUUi1Xa31mmrWqtJbqP_L0b7IZ3gb1Bc5HZAc</addsrcrecordid><sourcetype>Institutional Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</title><source>Lirias (KU Leuven Association)</source><source>ACM Digital Library Complete</source><source>IEEE Electronic Library (IEL)</source><creator>Dietzen, Thomas ; Spriet, A ; Tirry, W ; Doclo, S ; Moonen, M ; van Waterschoot, T</creator><creatorcontrib>Dietzen, Thomas ; Spriet, A ; Tirry, W ; Doclo, S ; Moonen, M ; van Waterschoot, T</creatorcontrib><description>© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.</description><identifier>ISSN: 2329-9290</identifier><language>eng</language><publisher>Institute of Electrical and Electronics Engineers</publisher><ispartof>IEEE-ACM Transactions on Audio Speech and Language Processing, 2019-03, Vol.27 (3), p.544-558</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,315,780,784,27860</link.rule.ids></links><search><creatorcontrib>Dietzen, Thomas</creatorcontrib><creatorcontrib>Spriet, A</creatorcontrib><creatorcontrib>Tirry, W</creatorcontrib><creatorcontrib>Doclo, S</creatorcontrib><creatorcontrib>Moonen, M</creatorcontrib><creatorcontrib>van Waterschoot, T</creatorcontrib><title>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</title><title>IEEE-ACM Transactions on Audio Speech and Language Processing</title><description>© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.</description><issn>2329-9290</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>FZOIL</sourceid><recordid>eNqVy71OwzAUhuEMILWC3sPZGFAkx25rPKLwNwBClD06jb-oBmNHdhIB18BFgyokVpje5XkPirlU0pRGGjErFjk_CyEqoY3Ry3nxWcfXnhMPbgKdB_bv2WWKHV0jILF3H7C0cRY-bkE1hxbef-sYiIOlu9EPrqx3HAI83boATvSQYF27N11MtOmBdkcXSJiQtki_-310GfQIO-75cXHYsc9Y_PSoOLm6fKpvypfRY5wQGpt7btFUUi1Xa31mmrWqtJbqP_L0b7IZ3gb1Bc5HZAc</recordid><startdate>201903</startdate><enddate>201903</enddate><creator>Dietzen, Thomas</creator><creator>Spriet, A</creator><creator>Tirry, W</creator><creator>Doclo, S</creator><creator>Moonen, M</creator><creator>van Waterschoot, T</creator><general>Institute of Electrical and Electronics Engineers</general><scope>FZOIL</scope></search><sort><creationdate>201903</creationdate><title>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</title><author>Dietzen, Thomas ; Spriet, A ; Tirry, W ; Doclo, S ; Moonen, M ; van Waterschoot, T</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-kuleuven_dspace_123456789_6317723</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dietzen, Thomas</creatorcontrib><creatorcontrib>Spriet, A</creatorcontrib><creatorcontrib>Tirry, W</creatorcontrib><creatorcontrib>Doclo, S</creatorcontrib><creatorcontrib>Moonen, M</creatorcontrib><creatorcontrib>van Waterschoot, T</creatorcontrib><collection>Lirias (KU Leuven Association)</collection><jtitle>IEEE-ACM Transactions on Audio Speech and Language Processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dietzen, Thomas</au><au>Spriet, A</au><au>Tirry, W</au><au>Doclo, S</au><au>Moonen, M</au><au>van Waterschoot, T</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</atitle><jtitle>IEEE-ACM Transactions on Audio Speech and Language Processing</jtitle><date>2019-03</date><risdate>2019</risdate><volume>27</volume><issue>3</issue><spage>544</spage><epage>558</epage><pages>544-558</pages><issn>2329-9290</issn><abstract>© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.</abstract><pub>Institute of Electrical and Electronics Engineers</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2329-9290
ispartof IEEE-ACM Transactions on Audio Speech and Language Processing, 2019-03, Vol.27 (3), p.544-558
issn 2329-9290
language eng
recordid cdi_kuleuven_dspace_123456789_631772
source Lirias (KU Leuven Association); ACM Digital Library Complete; IEEE Electronic Library (IEL)
title Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T07%3A26%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-kuleuven&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparative%20Analysis%20of%20Generalized%20Sidelobe%20Cancellation%20and%20Multi-Channel%20Linear%20Prediction%20for%20Speech%20Dereverberation%20and%20Noise%20Reduction&rft.jtitle=IEEE-ACM%20Transactions%20on%20Audio%20Speech%20and%20Language%20Processing&rft.au=Dietzen,%20Thomas&rft.date=2019-03&rft.volume=27&rft.issue=3&rft.spage=544&rft.epage=558&rft.pages=544-558&rft.issn=2329-9290&rft_id=info:doi/&rft_dat=%3Ckuleuven%3E123456789_631772%3C/kuleuven%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true