Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation
Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In th...
Gespeichert in:
Veröffentlicht in: | IEEE/ACM transactions on audio, speech, and language processing speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1593 |
---|---|
container_issue | 9 |
container_start_page | 1585 |
container_title | IEEE/ACM transactions on audio, speech, and language processing |
container_volume | 26 |
creator | Xiao, Ke Wang, Supin Wan, Mingxi Wu, Liang |
description | Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise. |
doi_str_mv | 10.1109/TASLP.2018.2834729 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2048170335</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8356591</ieee_id><sourcerecordid>2048170335</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</originalsourceid><addsrcrecordid>eNo9kMtOwzAQRSMEElXpD8DGEuuUsZ2HvSzlKbWAaFlHrj0BV0kc7ESCvyelhdXM4p47mhNF5xSmlIK8Ws9Wi5cpAyqmTPAkZ_IoGjHOZCw5JMd_O5NwGk1C2AIAhVzKPBlF76_KWNWhIU_OBiSrvm09hmBdQ0rnyW2FuvOuUv67-SKrFlF_kGsVBmBILPuqsxvVGLK2NcY3rla2IbO6rWzXGyRLZ_pKdUPZWXRSqirg5DDH0dvd7Xr-EC-e7x_ns0WseUa7WJYK5CbPMkZBay0EgpIZT6VWPKECEw5QAtMyBUiUzE2ppRFMmk2Jea5KPo4u972td589hq7Yut43w8mCQSJoDpynQ4rtU9q7EDyWRettPfxYUCh2Totfp8XOaXFwOkAXe8gi4j8geJqlkvIf1i1zcA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2048170335</pqid></control><display><type>article</type><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><source>IEEE Electronic Library (IEL)</source><creator>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</creator><creatorcontrib>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</creatorcontrib><description>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</description><identifier>ISSN: 2329-9290</identifier><identifier>EISSN: 2329-9304</identifier><identifier>DOI: 10.1109/TASLP.2018.2834729</identifier><identifier>CODEN: ITASD8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Acoustic noise ; Amplitude modulation ; Consonants (speech) ; Electrolarynx speech ; Electronics ; enhancement ; Filtration ; Frequencies ; Intelligibility ; Noise ; Noise control ; Noise intensity ; Noise reduction ; radiated noise ; Speech ; speech quality ; Subtraction ; Time domain analysis ; time-domain amplitude modulation ; Wiener filtering</subject><ispartof>IEEE/ACM transactions on audio, speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</citedby><cites>FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</cites><orcidid>0000-0002-6704-1216 ; 0000-0002-0280-4884 ; 0000-0002-2628-246X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8356591$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8356591$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xiao, Ke</creatorcontrib><creatorcontrib>Wang, Supin</creatorcontrib><creatorcontrib>Wan, Mingxi</creatorcontrib><creatorcontrib>Wu, Liang</creatorcontrib><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><title>IEEE/ACM transactions on audio, speech, and language processing</title><addtitle>TASLP</addtitle><description>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</description><subject>Acoustic noise</subject><subject>Amplitude modulation</subject><subject>Consonants (speech)</subject><subject>Electrolarynx speech</subject><subject>Electronics</subject><subject>enhancement</subject><subject>Filtration</subject><subject>Frequencies</subject><subject>Intelligibility</subject><subject>Noise</subject><subject>Noise control</subject><subject>Noise intensity</subject><subject>Noise reduction</subject><subject>radiated noise</subject><subject>Speech</subject><subject>speech quality</subject><subject>Subtraction</subject><subject>Time domain analysis</subject><subject>time-domain amplitude modulation</subject><subject>Wiener filtering</subject><issn>2329-9290</issn><issn>2329-9304</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kMtOwzAQRSMEElXpD8DGEuuUsZ2HvSzlKbWAaFlHrj0BV0kc7ESCvyelhdXM4p47mhNF5xSmlIK8Ws9Wi5cpAyqmTPAkZ_IoGjHOZCw5JMd_O5NwGk1C2AIAhVzKPBlF76_KWNWhIU_OBiSrvm09hmBdQ0rnyW2FuvOuUv67-SKrFlF_kGsVBmBILPuqsxvVGLK2NcY3rla2IbO6rWzXGyRLZ_pKdUPZWXRSqirg5DDH0dvd7Xr-EC-e7x_ns0WseUa7WJYK5CbPMkZBay0EgpIZT6VWPKECEw5QAtMyBUiUzE2ppRFMmk2Jea5KPo4u972td589hq7Yut43w8mCQSJoDpynQ4rtU9q7EDyWRettPfxYUCh2Totfp8XOaXFwOkAXe8gi4j8geJqlkvIf1i1zcA</recordid><startdate>20180901</startdate><enddate>20180901</enddate><creator>Xiao, Ke</creator><creator>Wang, Supin</creator><creator>Wan, Mingxi</creator><creator>Wu, Liang</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-6704-1216</orcidid><orcidid>https://orcid.org/0000-0002-0280-4884</orcidid><orcidid>https://orcid.org/0000-0002-2628-246X</orcidid></search><sort><creationdate>20180901</creationdate><title>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</title><author>Xiao, Ke ; Wang, Supin ; Wan, Mingxi ; Wu, Liang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-9fa09b766210ccc88e0a96359ca3418e4300f02c95004a97dfc9d829dbfe77af3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Acoustic noise</topic><topic>Amplitude modulation</topic><topic>Consonants (speech)</topic><topic>Electrolarynx speech</topic><topic>Electronics</topic><topic>enhancement</topic><topic>Filtration</topic><topic>Frequencies</topic><topic>Intelligibility</topic><topic>Noise</topic><topic>Noise control</topic><topic>Noise intensity</topic><topic>Noise reduction</topic><topic>radiated noise</topic><topic>Speech</topic><topic>speech quality</topic><topic>Subtraction</topic><topic>Time domain analysis</topic><topic>time-domain amplitude modulation</topic><topic>Wiener filtering</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xiao, Ke</creatorcontrib><creatorcontrib>Wang, Supin</creatorcontrib><creatorcontrib>Wan, Mingxi</creatorcontrib><creatorcontrib>Wu, Liang</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xiao, Ke</au><au>Wang, Supin</au><au>Wan, Mingxi</au><au>Wu, Liang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation</atitle><jtitle>IEEE/ACM transactions on audio, speech, and language processing</jtitle><stitle>TASLP</stitle><date>2018-09-01</date><risdate>2018</risdate><volume>26</volume><issue>9</issue><spage>1585</spage><epage>1593</epage><pages>1585-1593</pages><issn>2329-9290</issn><eissn>2329-9304</eissn><coden>ITASD8</coden><abstract>Radiated noise severely degrades the electrolarynx (EL) speech. It cannot be thoroughly suppressed by conventional frequency-domain enhancement methods. In this paper, a new method, called multiband time-domain amplitude modulation (MTAM), is proposed to reduce the radiated noise of EL speech. In the proposed method, the speech components changing slowly that represent the radiated noise are removed by directly modulating the time-domain amplitudes in multiple frequency bands. The EL speech enhanced by the proposed MTAM and the conventional frequency-domain enhancement methods (spectral subtraction and Wiener filtering) are evaluated on both acoustic and perceptual characteristics. The acoustic analysis reveals that the MTAM not only can reduce the radiated noise more thoroughly but can also easily control the residual noise intensity by adjusting a modulation parameter λ. Moreover, the MTAM can avoid causing new artificial noise that cannot be avoided by the conventional frequency-domain enhancement methods. The perceptual analysis indicates that the MTAM also have better performance on increasing the acceptability and the consonant intelligibility of EL speech than spectral subtraction and Wiener filtering. These findings validate that the MTAM indeed works well in suppressing the radiated noise of EL speech and avoiding the artificial noise.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TASLP.2018.2834729</doi><tpages>9</tpages><orcidid>https://orcid.org/0000-0002-6704-1216</orcidid><orcidid>https://orcid.org/0000-0002-0280-4884</orcidid><orcidid>https://orcid.org/0000-0002-2628-246X</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 2329-9290 |
ispartof | IEEE/ACM transactions on audio, speech, and language processing, 2018-09, Vol.26 (9), p.1585-1593 |
issn | 2329-9290 2329-9304 |
language | eng |
recordid | cdi_proquest_journals_2048170335 |
source | IEEE Electronic Library (IEL) |
subjects | Acoustic noise Amplitude modulation Consonants (speech) Electrolarynx speech Electronics enhancement Filtration Frequencies Intelligibility Noise Noise control Noise intensity Noise reduction radiated noise Speech speech quality Subtraction Time domain analysis time-domain amplitude modulation Wiener filtering |
title | Radiated Noise Suppression for Electrolarynx Speech Based on Multiband Time-Domain Amplitude Modulation |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T00%3A03%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Radiated%20Noise%20Suppression%20for%20Electrolarynx%20Speech%20Based%20on%20Multiband%20Time-Domain%20Amplitude%20Modulation&rft.jtitle=IEEE/ACM%20transactions%20on%20audio,%20speech,%20and%20language%20processing&rft.au=Xiao,%20Ke&rft.date=2018-09-01&rft.volume=26&rft.issue=9&rft.spage=1585&rft.epage=1593&rft.pages=1585-1593&rft.issn=2329-9290&rft.eissn=2329-9304&rft.coden=ITASD8&rft_id=info:doi/10.1109/TASLP.2018.2834729&rft_dat=%3Cproquest_RIE%3E2048170335%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2048170335&rft_id=info:pmid/&rft_ieee_id=8356591&rfr_iscdi=true |