Improved comb filter for the separation of the voiced speech signals in the case of two speakers
In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale pr...
Gespeichert in:
Veröffentlicht in: | SN applied sciences 2019-08, Vol.1 (8), p.818, Article 818 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 8 |
container_start_page | 818 |
container_title | SN applied sciences |
container_volume | 1 |
creator | Zeremdini, Jihen Ben Messaoud, Mohamed Anouar Bouzid, Aicha |
description | In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach. |
doi_str_mv | 10.1007/s42452-019-0840-6 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2788429276</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2788429276</sourcerecordid><originalsourceid>FETCH-LOGICAL-c311t-72129aafcc07bbbf350b84b9c9a4b41d53655b9444f7fd39d4648fd797d035bd3</originalsourceid><addsrcrecordid>eNp1kMtOwzAQRS0EElXpB7CzxDrgZxwvUcWjUqVuYG38bFPaONhpEX9P0iBYsZrRzLlXMxeAa4xuMULiLjPCOCkQlgWqGCrKMzAhnNCCSoHPf_uSXoJZzluEEBGSsopOwNti36Z49A7auDcw1LvOJxhigt3Gw-xbnXRXxwbGcJocY217OLfe2w3M9brRuwzr5rS0OvsT-BkHQr_7lK_ARegRP_upU_D6-PAyfy6Wq6fF_H5ZWIpxVwiCidQ6WIuEMSZQjkzFjLRSM8Ow47Tk3EjGWBDBUelYyarghBQOUW4cnYKb0bd_5-Pgc6e28ZCG6xQRVcWIJKLsKTxSNsWckw-qTfVepy-FkRqyVGOWqs9SDVmqQUNGTe7ZZu3Tn_P_om8m0Xa5</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2788429276</pqid></control><display><type>article</type><title>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Zeremdini, Jihen ; Ben Messaoud, Mohamed Anouar ; Bouzid, Aicha</creator><creatorcontrib>Zeremdini, Jihen ; Ben Messaoud, Mohamed Anouar ; Bouzid, Aicha</creatorcontrib><description>In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.</description><identifier>ISSN: 2523-3963</identifier><identifier>EISSN: 2523-3971</identifier><identifier>DOI: 10.1007/s42452-019-0840-6</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Applied and Technical Physics ; Autocorrelation ; Chemistry/Food Science ; Dynamic programming ; Earth Sciences ; Engineering ; Engineering: Signal Processing ; Environment ; Frequency ; Materials Science ; Methods ; Periodic structures ; Research Article ; Resonant frequencies ; Separation ; Speech ; Wavelet transforms</subject><ispartof>SN applied sciences, 2019-08, Vol.1 (8), p.818, Article 818</ispartof><rights>Springer Nature Switzerland AG 2019</rights><rights>Springer Nature Switzerland AG 2019.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c311t-72129aafcc07bbbf350b84b9c9a4b41d53655b9444f7fd39d4648fd797d035bd3</cites><orcidid>0000-0001-5030-4430</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Zeremdini, Jihen</creatorcontrib><creatorcontrib>Ben Messaoud, Mohamed Anouar</creatorcontrib><creatorcontrib>Bouzid, Aicha</creatorcontrib><title>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</title><title>SN applied sciences</title><addtitle>SN Appl. Sci</addtitle><description>In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.</description><subject>Applied and Technical Physics</subject><subject>Autocorrelation</subject><subject>Chemistry/Food Science</subject><subject>Dynamic programming</subject><subject>Earth Sciences</subject><subject>Engineering</subject><subject>Engineering: Signal Processing</subject><subject>Environment</subject><subject>Frequency</subject><subject>Materials Science</subject><subject>Methods</subject><subject>Periodic structures</subject><subject>Research Article</subject><subject>Resonant frequencies</subject><subject>Separation</subject><subject>Speech</subject><subject>Wavelet transforms</subject><issn>2523-3963</issn><issn>2523-3971</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp1kMtOwzAQRS0EElXpB7CzxDrgZxwvUcWjUqVuYG38bFPaONhpEX9P0iBYsZrRzLlXMxeAa4xuMULiLjPCOCkQlgWqGCrKMzAhnNCCSoHPf_uSXoJZzluEEBGSsopOwNti36Z49A7auDcw1LvOJxhigt3Gw-xbnXRXxwbGcJocY217OLfe2w3M9brRuwzr5rS0OvsT-BkHQr_7lK_ARegRP_upU_D6-PAyfy6Wq6fF_H5ZWIpxVwiCidQ6WIuEMSZQjkzFjLRSM8Ow47Tk3EjGWBDBUelYyarghBQOUW4cnYKb0bd_5-Pgc6e28ZCG6xQRVcWIJKLsKTxSNsWckw-qTfVepy-FkRqyVGOWqs9SDVmqQUNGTe7ZZu3Tn_P_om8m0Xa5</recordid><startdate>20190801</startdate><enddate>20190801</enddate><creator>Zeremdini, Jihen</creator><creator>Ben Messaoud, Mohamed Anouar</creator><creator>Bouzid, Aicha</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-5030-4430</orcidid></search><sort><creationdate>20190801</creationdate><title>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</title><author>Zeremdini, Jihen ; Ben Messaoud, Mohamed Anouar ; Bouzid, Aicha</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c311t-72129aafcc07bbbf350b84b9c9a4b41d53655b9444f7fd39d4648fd797d035bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Applied and Technical Physics</topic><topic>Autocorrelation</topic><topic>Chemistry/Food Science</topic><topic>Dynamic programming</topic><topic>Earth Sciences</topic><topic>Engineering</topic><topic>Engineering: Signal Processing</topic><topic>Environment</topic><topic>Frequency</topic><topic>Materials Science</topic><topic>Methods</topic><topic>Periodic structures</topic><topic>Research Article</topic><topic>Resonant frequencies</topic><topic>Separation</topic><topic>Speech</topic><topic>Wavelet transforms</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zeremdini, Jihen</creatorcontrib><creatorcontrib>Ben Messaoud, Mohamed Anouar</creatorcontrib><creatorcontrib>Bouzid, Aicha</creatorcontrib><collection>CrossRef</collection><jtitle>SN applied sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zeremdini, Jihen</au><au>Ben Messaoud, Mohamed Anouar</au><au>Bouzid, Aicha</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</atitle><jtitle>SN applied sciences</jtitle><stitle>SN Appl. Sci</stitle><date>2019-08-01</date><risdate>2019</risdate><volume>1</volume><issue>8</issue><spage>818</spage><pages>818-</pages><artnum>818</artnum><issn>2523-3963</issn><eissn>2523-3971</eissn><abstract>In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s42452-019-0840-6</doi><orcidid>https://orcid.org/0000-0001-5030-4430</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2523-3963 |
ispartof | SN applied sciences, 2019-08, Vol.1 (8), p.818, Article 818 |
issn | 2523-3963 2523-3971 |
language | eng |
recordid | cdi_proquest_journals_2788429276 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | Applied and Technical Physics Autocorrelation Chemistry/Food Science Dynamic programming Earth Sciences Engineering Engineering: Signal Processing Environment Frequency Materials Science Methods Periodic structures Research Article Resonant frequencies Separation Speech Wavelet transforms |
title | Improved comb filter for the separation of the voiced speech signals in the case of two speakers |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T07%3A15%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20comb%20filter%20for%20the%20separation%20of%20the%20voiced%20speech%20signals%20in%20the%20case%20of%20two%20speakers&rft.jtitle=SN%20applied%20sciences&rft.au=Zeremdini,%20Jihen&rft.date=2019-08-01&rft.volume=1&rft.issue=8&rft.spage=818&rft.pages=818-&rft.artnum=818&rft.issn=2523-3963&rft.eissn=2523-3971&rft_id=info:doi/10.1007/s42452-019-0840-6&rft_dat=%3Cproquest_cross%3E2788429276%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2788429276&rft_id=info:pmid/&rfr_iscdi=true |