Improved comb filter for the separation of the voiced speech signals in the case of two speakers

In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale pr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:SN applied sciences 2019-08, Vol.1 (8), p.818, Article 818
Hauptverfasser: Zeremdini, Jihen, Ben Messaoud, Mohamed Anouar, Bouzid, Aicha
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 8
container_start_page 818
container_title SN applied sciences
container_volume 1
creator Zeremdini, Jihen
Ben Messaoud, Mohamed Anouar
Bouzid, Aicha
description In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.
doi_str_mv 10.1007/s42452-019-0840-6
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2788429276</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2788429276</sourcerecordid><originalsourceid>FETCH-LOGICAL-c311t-72129aafcc07bbbf350b84b9c9a4b41d53655b9444f7fd39d4648fd797d035bd3</originalsourceid><addsrcrecordid>eNp1kMtOwzAQRS0EElXpB7CzxDrgZxwvUcWjUqVuYG38bFPaONhpEX9P0iBYsZrRzLlXMxeAa4xuMULiLjPCOCkQlgWqGCrKMzAhnNCCSoHPf_uSXoJZzluEEBGSsopOwNti36Z49A7auDcw1LvOJxhigt3Gw-xbnXRXxwbGcJocY217OLfe2w3M9brRuwzr5rS0OvsT-BkHQr_7lK_ARegRP_upU_D6-PAyfy6Wq6fF_H5ZWIpxVwiCidQ6WIuEMSZQjkzFjLRSM8Ow47Tk3EjGWBDBUelYyarghBQOUW4cnYKb0bd_5-Pgc6e28ZCG6xQRVcWIJKLsKTxSNsWckw-qTfVepy-FkRqyVGOWqs9SDVmqQUNGTe7ZZu3Tn_P_om8m0Xa5</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2788429276</pqid></control><display><type>article</type><title>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Zeremdini, Jihen ; Ben Messaoud, Mohamed Anouar ; Bouzid, Aicha</creator><creatorcontrib>Zeremdini, Jihen ; Ben Messaoud, Mohamed Anouar ; Bouzid, Aicha</creatorcontrib><description>In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.</description><identifier>ISSN: 2523-3963</identifier><identifier>EISSN: 2523-3971</identifier><identifier>DOI: 10.1007/s42452-019-0840-6</identifier><language>eng</language><publisher>Cham: Springer International Publishing</publisher><subject>Applied and Technical Physics ; Autocorrelation ; Chemistry/Food Science ; Dynamic programming ; Earth Sciences ; Engineering ; Engineering: Signal Processing ; Environment ; Frequency ; Materials Science ; Methods ; Periodic structures ; Research Article ; Resonant frequencies ; Separation ; Speech ; Wavelet transforms</subject><ispartof>SN applied sciences, 2019-08, Vol.1 (8), p.818, Article 818</ispartof><rights>Springer Nature Switzerland AG 2019</rights><rights>Springer Nature Switzerland AG 2019.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c311t-72129aafcc07bbbf350b84b9c9a4b41d53655b9444f7fd39d4648fd797d035bd3</cites><orcidid>0000-0001-5030-4430</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Zeremdini, Jihen</creatorcontrib><creatorcontrib>Ben Messaoud, Mohamed Anouar</creatorcontrib><creatorcontrib>Bouzid, Aicha</creatorcontrib><title>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</title><title>SN applied sciences</title><addtitle>SN Appl. Sci</addtitle><description>In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.</description><subject>Applied and Technical Physics</subject><subject>Autocorrelation</subject><subject>Chemistry/Food Science</subject><subject>Dynamic programming</subject><subject>Earth Sciences</subject><subject>Engineering</subject><subject>Engineering: Signal Processing</subject><subject>Environment</subject><subject>Frequency</subject><subject>Materials Science</subject><subject>Methods</subject><subject>Periodic structures</subject><subject>Research Article</subject><subject>Resonant frequencies</subject><subject>Separation</subject><subject>Speech</subject><subject>Wavelet transforms</subject><issn>2523-3963</issn><issn>2523-3971</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp1kMtOwzAQRS0EElXpB7CzxDrgZxwvUcWjUqVuYG38bFPaONhpEX9P0iBYsZrRzLlXMxeAa4xuMULiLjPCOCkQlgWqGCrKMzAhnNCCSoHPf_uSXoJZzluEEBGSsopOwNti36Z49A7auDcw1LvOJxhigt3Gw-xbnXRXxwbGcJocY217OLfe2w3M9brRuwzr5rS0OvsT-BkHQr_7lK_ARegRP_upU_D6-PAyfy6Wq6fF_H5ZWIpxVwiCidQ6WIuEMSZQjkzFjLRSM8Ow47Tk3EjGWBDBUelYyarghBQOUW4cnYKb0bd_5-Pgc6e28ZCG6xQRVcWIJKLsKTxSNsWckw-qTfVepy-FkRqyVGOWqs9SDVmqQUNGTe7ZZu3Tn_P_om8m0Xa5</recordid><startdate>20190801</startdate><enddate>20190801</enddate><creator>Zeremdini, Jihen</creator><creator>Ben Messaoud, Mohamed Anouar</creator><creator>Bouzid, Aicha</creator><general>Springer International Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-5030-4430</orcidid></search><sort><creationdate>20190801</creationdate><title>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</title><author>Zeremdini, Jihen ; Ben Messaoud, Mohamed Anouar ; Bouzid, Aicha</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c311t-72129aafcc07bbbf350b84b9c9a4b41d53655b9444f7fd39d4648fd797d035bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Applied and Technical Physics</topic><topic>Autocorrelation</topic><topic>Chemistry/Food Science</topic><topic>Dynamic programming</topic><topic>Earth Sciences</topic><topic>Engineering</topic><topic>Engineering: Signal Processing</topic><topic>Environment</topic><topic>Frequency</topic><topic>Materials Science</topic><topic>Methods</topic><topic>Periodic structures</topic><topic>Research Article</topic><topic>Resonant frequencies</topic><topic>Separation</topic><topic>Speech</topic><topic>Wavelet transforms</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zeremdini, Jihen</creatorcontrib><creatorcontrib>Ben Messaoud, Mohamed Anouar</creatorcontrib><creatorcontrib>Bouzid, Aicha</creatorcontrib><collection>CrossRef</collection><jtitle>SN applied sciences</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zeremdini, Jihen</au><au>Ben Messaoud, Mohamed Anouar</au><au>Bouzid, Aicha</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improved comb filter for the separation of the voiced speech signals in the case of two speakers</atitle><jtitle>SN applied sciences</jtitle><stitle>SN Appl. Sci</stitle><date>2019-08-01</date><risdate>2019</risdate><volume>1</volume><issue>8</issue><spage>818</spage><pages>818-</pages><artnum>818</artnum><issn>2523-3963</issn><eissn>2523-3971</eissn><abstract>In this paper, we present a method for separating voiced sounds from a composite signal. This method is mainly based on the separation by modified comb filter. This filter is keyed to the average values of the estimated pitch. This estimation is performed through an autocorrelation of multi-scale product analysis to separate the effects of the source and the vocal tract. The “autocorrelation of the multi-scale product” method allows noise elimination and the apparition of a signal periodic structure. Peaks that appear are used to calculate the mean fundamental frequency of the target speaker which will be used in the corresponding comb filters to determine the target speaker contribution. After the subtraction of this contribution from the mixture, we obtain the intrusion speaker. This separation is validated by its application on Cooke database and a part of VCTK database and compared to recent methods as Wang–Brown, Hu–Wang, Zhang–Liu and Quan systems. Results confirm the performance of the proposed approach.</abstract><cop>Cham</cop><pub>Springer International Publishing</pub><doi>10.1007/s42452-019-0840-6</doi><orcidid>https://orcid.org/0000-0001-5030-4430</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2523-3963
ispartof SN applied sciences, 2019-08, Vol.1 (8), p.818, Article 818
issn 2523-3963
2523-3971
language eng
recordid cdi_proquest_journals_2788429276
source EZB-FREE-00999 freely available EZB journals
subjects Applied and Technical Physics
Autocorrelation
Chemistry/Food Science
Dynamic programming
Earth Sciences
Engineering
Engineering: Signal Processing
Environment
Frequency
Materials Science
Methods
Periodic structures
Research Article
Resonant frequencies
Separation
Speech
Wavelet transforms
title Improved comb filter for the separation of the voiced speech signals in the case of two speakers
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T07%3A15%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improved%20comb%20filter%20for%20the%20separation%20of%20the%20voiced%20speech%20signals%20in%20the%20case%20of%20two%20speakers&rft.jtitle=SN%20applied%20sciences&rft.au=Zeremdini,%20Jihen&rft.date=2019-08-01&rft.volume=1&rft.issue=8&rft.spage=818&rft.pages=818-&rft.artnum=818&rft.issn=2523-3963&rft.eissn=2523-3971&rft_id=info:doi/10.1007/s42452-019-0840-6&rft_dat=%3Cproquest_cross%3E2788429276%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2788429276&rft_id=info:pmid/&rfr_iscdi=true