Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction
© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perfor...
Gespeichert in:
Veröffentlicht in: | IEEE-ACM Transactions on Audio Speech and Language Processing 2019-03, Vol.27 (3), p.544-558 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 558 |
---|---|
container_issue | 3 |
container_start_page | 544 |
container_title | IEEE-ACM Transactions on Audio Speech and Language Processing |
container_volume | 27 |
creator | Dietzen, Thomas Spriet, A Tirry, W Doclo, S Moonen, M van Waterschoot, T |
description | © 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations. |
format | Article |
fullrecord | <record><control><sourceid>kuleuven</sourceid><recordid>TN_cdi_kuleuven_dspace_123456789_631772</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>123456789_631772</sourcerecordid><originalsourceid>FETCH-kuleuven_dspace_123456789_6317723</originalsourceid><addsrcrecordid>eNqVy71OwzAUhuEMILWC3sPZGFAkx25rPKLwNwBClD06jb-oBmNHdhIB18BFgyokVpje5XkPirlU0pRGGjErFjk_CyEqoY3Ry3nxWcfXnhMPbgKdB_bv2WWKHV0jILF3H7C0cRY-bkE1hxbef-sYiIOlu9EPrqx3HAI83boATvSQYF27N11MtOmBdkcXSJiQtki_-310GfQIO-75cXHYsc9Y_PSoOLm6fKpvypfRY5wQGpt7btFUUi1Xa31mmrWqtJbqP_L0b7IZ3gb1Bc5HZAc</addsrcrecordid><sourcetype>Institutional Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</title><source>Lirias (KU Leuven Association)</source><source>ACM Digital Library Complete</source><source>IEEE Electronic Library (IEL)</source><creator>Dietzen, Thomas ; Spriet, A ; Tirry, W ; Doclo, S ; Moonen, M ; van Waterschoot, T</creator><creatorcontrib>Dietzen, Thomas ; Spriet, A ; Tirry, W ; Doclo, S ; Moonen, M ; van Waterschoot, T</creatorcontrib><description>© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.</description><identifier>ISSN: 2329-9290</identifier><language>eng</language><publisher>Institute of Electrical and Electronics Engineers</publisher><ispartof>IEEE-ACM Transactions on Audio Speech and Language Processing, 2019-03, Vol.27 (3), p.544-558</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,315,780,784,27860</link.rule.ids></links><search><creatorcontrib>Dietzen, Thomas</creatorcontrib><creatorcontrib>Spriet, A</creatorcontrib><creatorcontrib>Tirry, W</creatorcontrib><creatorcontrib>Doclo, S</creatorcontrib><creatorcontrib>Moonen, M</creatorcontrib><creatorcontrib>van Waterschoot, T</creatorcontrib><title>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</title><title>IEEE-ACM Transactions on Audio Speech and Language Processing</title><description>© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.</description><issn>2329-9290</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>FZOIL</sourceid><recordid>eNqVy71OwzAUhuEMILWC3sPZGFAkx25rPKLwNwBClD06jb-oBmNHdhIB18BFgyokVpje5XkPirlU0pRGGjErFjk_CyEqoY3Ry3nxWcfXnhMPbgKdB_bv2WWKHV0jILF3H7C0cRY-bkE1hxbef-sYiIOlu9EPrqx3HAI83boATvSQYF27N11MtOmBdkcXSJiQtki_-310GfQIO-75cXHYsc9Y_PSoOLm6fKpvypfRY5wQGpt7btFUUi1Xa31mmrWqtJbqP_L0b7IZ3gb1Bc5HZAc</recordid><startdate>201903</startdate><enddate>201903</enddate><creator>Dietzen, Thomas</creator><creator>Spriet, A</creator><creator>Tirry, W</creator><creator>Doclo, S</creator><creator>Moonen, M</creator><creator>van Waterschoot, T</creator><general>Institute of Electrical and Electronics Engineers</general><scope>FZOIL</scope></search><sort><creationdate>201903</creationdate><title>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</title><author>Dietzen, Thomas ; Spriet, A ; Tirry, W ; Doclo, S ; Moonen, M ; van Waterschoot, T</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-kuleuven_dspace_123456789_6317723</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dietzen, Thomas</creatorcontrib><creatorcontrib>Spriet, A</creatorcontrib><creatorcontrib>Tirry, W</creatorcontrib><creatorcontrib>Doclo, S</creatorcontrib><creatorcontrib>Moonen, M</creatorcontrib><creatorcontrib>van Waterschoot, T</creatorcontrib><collection>Lirias (KU Leuven Association)</collection><jtitle>IEEE-ACM Transactions on Audio Speech and Language Processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dietzen, Thomas</au><au>Spriet, A</au><au>Tirry, W</au><au>Doclo, S</au><au>Moonen, M</au><au>van Waterschoot, T</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction</atitle><jtitle>IEEE-ACM Transactions on Audio Speech and Language Processing</jtitle><date>2019-03</date><risdate>2019</risdate><volume>27</volume><issue>3</issue><spage>544</spage><epage>558</epage><pages>544-558</pages><issn>2329-9290</issn><abstract>© 2014 IEEE. For blind speech dereverberation, two frameworks are commonly used: On the one hand, the multi-channel linear prediction (MCLP) framework, and on the other hand, data-dependent beamforming, e.g., the generalized sidelobe canceler (GSC) framework. The MCLP framework is designed to perform deconvolution and hence has gained increased prominence in blind speech dereverberation. The GSC framework is commonly used for noise reduction, but may be applied for dereverberation as well. In previous work, we have shown that for the noiseless case, MCLP and the GSC yield in theory mathematically equivalent results in terms of dereverberation. In this paper, we assume additional coherent as well as incoherent-noise components and formally analyze and compare both frameworks in terms of dereverberation and noise reduction performance. Both the theoretical analysis and time domain simulation results demonstrate that unlike the GSC, MCLP expectably shows limited performance in terms of noise reduction, while both perform equally well in terms of dereverberation, provided that the GSC blocking matrix achieves complete blocking of the early reverberant-speech component and sufficiently many microphones are available. In case of incomplete blocking, however, the GSC performs inferior to MCLP in terms of dereverberation, as shown in short-time Fourier transform domain simulations.</abstract><pub>Institute of Electrical and Electronics Engineers</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2329-9290 |
ispartof | IEEE-ACM Transactions on Audio Speech and Language Processing, 2019-03, Vol.27 (3), p.544-558 |
issn | 2329-9290 |
language | eng |
recordid | cdi_kuleuven_dspace_123456789_631772 |
source | Lirias (KU Leuven Association); ACM Digital Library Complete; IEEE Electronic Library (IEL) |
title | Comparative Analysis of Generalized Sidelobe Cancellation and Multi-Channel Linear Prediction for Speech Dereverberation and Noise Reduction |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-05T07%3A26%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-kuleuven&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparative%20Analysis%20of%20Generalized%20Sidelobe%20Cancellation%20and%20Multi-Channel%20Linear%20Prediction%20for%20Speech%20Dereverberation%20and%20Noise%20Reduction&rft.jtitle=IEEE-ACM%20Transactions%20on%20Audio%20Speech%20and%20Language%20Processing&rft.au=Dietzen,%20Thomas&rft.date=2019-03&rft.volume=27&rft.issue=3&rft.spage=544&rft.epage=558&rft.pages=544-558&rft.issn=2329-9290&rft_id=info:doi/&rft_dat=%3Ckuleuven%3E123456789_631772%3C/kuleuven%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |