Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation

Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In th...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593
Hauptverfasser: Xiao, Ke, Wang, Supin, Wan, Mingxi, Wu, Liang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1593
container_issue 9
container_start_page 1585
container_title IEEE/ACM transactions on audio, speech, and language processing
container_volume 26
creator Xiao, Ke
Wang, Supin
Wan, Mingxi
Wu, Liang
description Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.
doi_str_mv 10.1109/TASLP.2018.2834729
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2048170335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8356591</ieee_id><sourcerecordid>2048170335</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRSMEElXpD8DGEuuUsZ2HvSzlKbWAaFlHrj0BV0kc7ESCvyelhdXM4p47mhNF5xSmlIK8Ws9Wi5cpAyqmTPAkZ_IoGjHOZCw5JMd_O5NwGk1C2AIAhVzKPBlF76_KWNWhIU_OBiSrvm09hmBdQ0rnyW2FuvOuUv67-SKrFlF_kGsVBmBILPuqsxvVGLK2NcY3rla2IbO6rWzXGyRLZ_pKdUPZWXRSqirg5DDH0dvd7Xr-EC-e7x_ns0WseUa7WJYK5CbPMkZBay0EgpIZT6VWPKECEw5QAtMyBUiUzE2ppRFMmk2Jea5KPo4u972td589hq7Yut43w8mCQSJoDpynQ4rtU9q7EDyWRettPfxYUCh2Totfp8XOaXFwOkAXe8gi4j8geJqlkvIf1i1zcA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2048170335</pqid></control><display><type>article</type><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><source>IEEE Electronic Library (IEL)</source><creator>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</creator><creatorcontrib>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</creatorcontrib><description>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2018.2834729</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Acoustic noise ; Amplitude modulation ; Consonants (speech) ; Electrolarynx speech ; Electronics ; enhancement ; Filtration ; Frequencies ; Intelligibility ; Noise ; Noise control ; Noise intensity ; Noise reduction ; radiated noise ; Speech ; speech quality ; Subtraction ; Time domain analysis ; time-domain amplitude modulation ; Wiener filtering</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</citedby><cites>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</cites><orcidid>0000-0002-6704-1216 ; 0000-0002-0280-4884 ; 0000-0002-2628-246X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8356591$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8356591$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xiao, Ke</creatorcontrib><creatorcontrib>Wang, Supin</creatorcontrib><creatorcontrib>Wan, Mingxi</creatorcontrib><creatorcontrib>Wu, Liang</creatorcontrib><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</description><subject>Acoustic noise</subject><subject>Amplitude modulation</subject><subject>Consonants (speech)</subject><subject>Electrolarynx speech</subject><subject>Electronics</subject><subject>enhancement</subject><subject>Filtration</subject><subject>Frequencies</subject><subject>Intelligibility</subject><subject>Noise</subject><subject>Noise control</subject><subject>Noise intensity</subject><subject>Noise reduction</subject><subject>radiated noise</subject><subject>Speech</subject><subject>speech quality</subject><subject>Subtraction</subject><subject>Time domain analysis</subject><subject>time-domain amplitude modulation</subject><subject>Wiener filtering</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRSMEElXpD8DGEuuUsZ2HvSzlKbWAaFlHrj0BV0kc7ESCvyelhdXM4p47mhNF5xSmlIK8Ws9Wi5cpAyqmTPAkZ_IoGjHOZCw5JMd_O5NwGk1C2AIAhVzKPBlF76_KWNWhIU_OBiSrvm09hmBdQ0rnyW2FuvOuUv67-SKrFlF_kGsVBmBILPuqsxvVGLK2NcY3rla2IbO6rWzXGyRLZ_pKdUPZWXRSqirg5DDH0dvd7Xr-EC-e7x_ns0WseUa7WJYK5CbPMkZBay0EgpIZT6VWPKECEw5QAtMyBUiUzE2ppRFMmk2Jea5KPo4u972td589hq7Yut43w8mCQSJoDpynQ4rtU9q7EDyWRettPfxYUCh2Totfp8XOaXFwOkAXe8gi4j8geJqlkvIf1i1zcA</recordid><startdate>20180901</startdate><enddate>20180901</enddate><creator>Xiao, Ke</creator><creator>Wang, Supin</creator><creator>Wan, Mingxi</creator><creator>Wu, Liang</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6704-1216</orcidid><orcidid>https://orcid.org/0000-0002-0280-4884</orcidid><orcidid>https://orcid.org/0000-0002-2628-246X</orcidid></search><sort><creationdate>20180901</creationdate><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><author>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Acoustic noise</topic><topic>Amplitude modulation</topic><topic>Consonants (speech)</topic><topic>Electrolarynx speech</topic><topic>Electronics</topic><topic>enhancement</topic><topic>Filtration</topic><topic>Frequencies</topic><topic>Intelligibility</topic><topic>Noise</topic><topic>Noise control</topic><topic>Noise intensity</topic><topic>Noise reduction</topic><topic>radiated noise</topic><topic>Speech</topic><topic>speech quality</topic><topic>Subtraction</topic><topic>Time domain analysis</topic><topic>time-domain amplitude modulation</topic><topic>Wiener filtering</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xiao, Ke</creatorcontrib><creatorcontrib>Wang, Supin</creatorcontrib><creatorcontrib>Wan, Mingxi</creatorcontrib><creatorcontrib>Wu, Liang</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xiao, Ke</au><au>Wang, Supin</au><au>Wan, Mingxi</au><au>Wu, Liang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2018-09-01</date><risdate>2018</risdate><volume>26</volume><issue>9</issue><spage>1585</spage><epage>1593</epage><pages>1585-1593</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2018.2834729</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-6704-1216</orcidid><orcidid>https://orcid.org/0000-0002-0280-4884</orcidid><orcidid>https://orcid.org/0000-0002-2628-246X</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 2329-9290
ispartof IEEE/ACM transactions on audio, speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593
issn 2329-9290
2329-9304
language eng
recordid cdi_proquest_journals_2048170335
source IEEE Electronic Library (IEL)
subjects Acoustic noise
Amplitude modulation
Consonants (speech)
Electrolarynx speech
Electronics
enhancement
Filtration
Frequencies
Intelligibility
Noise
Noise control
Noise intensity
Noise reduction
radiated noise
Speech
speech quality
Subtraction
Time domain analysis
time-domain amplitude modulation
Wiener filtering
title Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T00%3A03%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Radiated%20Noise%20Suppression%20for%20Electrolarynx%20Speech%20Based%20on%20Multiband%20Time-Domain%20Amplitude%20Modulation&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Xiao,%20Ke&rft.date=2018-09-01&rft.volume=26&rft.issue=9&rft.spage=1585&rft.epage=1593&rft.pages=1585-1593&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2018.2834729&rft_dat=%3Cproquest_RIE%3E2048170335%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2048170335&rft_id=info:pmid/&rft_ieee_id=8356591&rfr_iscdi=true