Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation

Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593
Hauptverfasser:	Xiao, Ke, Wang, Supin, Wan, Mingxi, Wu, Liang
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic noise Amplitude modulation Consonants (speech) Electrolarynx speech Electronics enhancement Filtration Frequencies Intelligibility Noise Noise control Noise intensity Noise reduction radiated noise Speech speech quality Subtraction Time domain analysis time-domain amplitude modulation Wiener filtering
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1593
container_issue	9
container_start_page	1585
container_title	IEEE/ACM transactions on audio, speech, and language processing
container_volume	26
creator	Xiao, Ke Wang, Supin Wan, Mingxi Wu, Liang
description	Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.
doi_str_mv	10.1109/TASLP.2018.2834729
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2048170335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8356591</ieee_id><sourcerecordid>2048170335</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRSMEElXpD8DGEuuUsZ2HvSzlKbWAaFlHrj0BV0kc7ESCvyelhdXM4p47mhNF5xSmlIK8Ws9Wi5cpAyqmTPAkZ_IoGjHOZCw5JMd_O5NwGk1C2AIAhVzKPBlF76_KWNWhIU_OBiSrvm09hmBdQ0rnyW2FuvOuUv67-SKrFlF_kGsVBmBILPuqsxvVGLK2NcY3rla2IbO6rWzXGyRLZ_pKdUPZWXRSqirg5DDH0dvd7Xr-EC-e7x_ns0WseUa7WJYK5CbPMkZBay0EgpIZT6VWPKECEw5QAtMyBUiUzE2ppRFMmk2Jea5KPo4u972td589hq7Yut43w8mCQSJoDpynQ4rtU9q7EDyWRettPfxYUCh2Totfp8XOaXFwOkAXe8gi4j8geJqlkvIf1i1zcA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2048170335</pqid></control><display><type>article</type><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><source>IEEE Electronic Library (IEL)</source><creator>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</creator><creatorcontrib>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</creatorcontrib><description>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2018.2834729</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Acoustic noise ; Amplitude modulation ; Consonants (speech) ; Electrolarynx speech ; Electronics ; enhancement ; Filtration ; Frequencies ; Intelligibility ; Noise ; Noise control ; Noise intensity ; Noise reduction ; radiated noise ; Speech ; speech quality ; Subtraction ; Time domain analysis ; time-domain amplitude modulation ; Wiener filtering</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</citedby><cites>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</cites><orcidid>0000-0002-6704-1216 ; 0000-0002-0280-4884 ; 0000-0002-2628-246X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8356591$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8356591$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xiao, Ke</creatorcontrib><creatorcontrib>Wang, Supin</creatorcontrib><creatorcontrib>Wan, Mingxi</creatorcontrib><creatorcontrib>Wu, Liang</creatorcontrib><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</description><subject>Acoustic noise</subject><subject>Amplitude modulation</subject><subject>Consonants (speech)</subject><subject>Electrolarynx speech</subject><subject>Electronics</subject><subject>enhancement</subject><subject>Filtration</subject><subject>Frequencies</subject><subject>Intelligibility</subject><subject>Noise</subject><subject>Noise control</subject><subject>Noise intensity</subject><subject>Noise reduction</subject><subject>radiated noise</subject><subject>Speech</subject><subject>speech quality</subject><subject>Subtraction</subject><subject>Time domain analysis</subject><subject>time-domain amplitude modulation</subject><subject>Wiener filtering</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRSMEElXpD8DGEuuUsZ2HvSzlKbWAaFlHrj0BV0kc7ESCvyelhdXM4p47mhNF5xSmlIK8Ws9Wi5cpAyqmTPAkZ_IoGjHOZCw5JMd_O5NwGk1C2AIAhVzKPBlF76_KWNWhIU_OBiSrvm09hmBdQ0rnyW2FuvOuUv67-SKrFlF_kGsVBmBILPuqsxvVGLK2NcY3rla2IbO6rWzXGyRLZ_pKdUPZWXRSqirg5DDH0dvd7Xr-EC-e7x_ns0WseUa7WJYK5CbPMkZBay0EgpIZT6VWPKECEw5QAtMyBUiUzE2ppRFMmk2Jea5KPo4u972td589hq7Yut43w8mCQSJoDpynQ4rtU9q7EDyWRettPfxYUCh2Totfp8XOaXFwOkAXe8gi4j8geJqlkvIf1i1zcA</recordid><startdate>20180901</startdate><enddate>20180901</enddate><creator>Xiao, Ke</creator><creator>Wang, Supin</creator><creator>Wan, Mingxi</creator><creator>Wu, Liang</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6704-1216</orcidid><orcidid>https://orcid.org/0000-0002-0280-4884</orcidid><orcidid>https://orcid.org/0000-0002-2628-246X</orcidid></search><sort><creationdate>20180901</creationdate><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><author>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Acoustic noise</topic><topic>Amplitude modulation</topic><topic>Consonants (speech)</topic><topic>Electrolarynx speech</topic><topic>Electronics</topic><topic>enhancement</topic><topic>Filtration</topic><topic>Frequencies</topic><topic>Intelligibility</topic><topic>Noise</topic><topic>Noise control</topic><topic>Noise intensity</topic><topic>Noise reduction</topic><topic>radiated noise</topic><topic>Speech</topic><topic>speech quality</topic><topic>Subtraction</topic><topic>Time domain analysis</topic><topic>time-domain amplitude modulation</topic><topic>Wiener filtering</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xiao, Ke</creatorcontrib><creatorcontrib>Wang, Supin</creatorcontrib><creatorcontrib>Wan, Mingxi</creatorcontrib><creatorcontrib>Wu, Liang</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xiao, Ke</au><au>Wang, Supin</au><au>Wan, Mingxi</au><au>Wu, Liang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2018-09-01</date><risdate>2018</risdate><volume>26</volume><issue>9</issue><spage>1585</spage><epage>1593</epage><pages>1585-1593</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2018.2834729</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-6704-1216</orcidid><orcidid>https://orcid.org/0000-0002-0280-4884</orcidid><orcidid>https://orcid.org/0000-0002-2628-246X</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 2329-9290
ispartof	IEEE/ACM transactions on audio, speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593
issn	2329-9290 2329-9304
language	eng
recordid	cdi_proquest_journals_2048170335
source	IEEE Electronic Library (IEL)
subjects	Acoustic noise Amplitude modulation Consonants (speech) Electrolarynx speech Electronics enhancement Filtration Frequencies Intelligibility Noise Noise control Noise intensity Noise reduction radiated noise Speech speech quality Subtraction Time domain analysis time-domain amplitude modulation Wiener filtering
title	Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T00%3A03%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Radiated%20Noise%20Suppression%20for%20Electrolarynx%20Speech%20Based%20on%20Multiband%20Time-Domain%20Amplitude%20Modulation&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Xiao,%20Ke&rft.date=2018-09-01&rft.volume=26&rft.issue=9&rft.spage=1585&rft.epage=1593&rft.pages=1585-1593&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2018.2834729&rft_dat=%3Cproquest_RIE%3E2048170335%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2048170335&rft_id=info:pmid/&rft_ieee_id=8356591&rfr_iscdi=true