Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news

In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of speech technology 2019-12, Vol.22 (4), p.893-909
Hauptverfasser: Karim, Dabbabi, Salah, Hajji, Adnen, Cherif
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 909
container_issue 4
container_start_page 893
container_title International journal of speech technology
container_volume 22
creator Karim, Dabbabi
Salah, Hajji
Adnen, Cherif
description In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.
doi_str_mv 10.1007/s10772-019-09633-6
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2312828267</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312828267</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIlMIPcLLE2eC1Wyc-olJaRCUucMXyK-DSJsFOVZWvxyU8bmgPuxrN7O4MQudAL4HS4ioBLQpGKEhCpeCciAM0gHGGSgB6mGdeAmEjEMfoJKUlpVQWkg3Q83xnYnDhQ3ehqfHNFG9D94rvydrrOuGqiTi1Xr_5iO1qkzofQ_2CQ_2LuqDjj7qpsImNdlanLuHab9MpOqr0Kvmz7z5ET7fTx8mcLB5md5PrBbEcZEeMYaV12oGUlpkRN74sHC_lOD8ps4HCUQZCckaFF7aSTlTcGGOzK2dGUPIhuuj3trF53_jUqWWziXU-qRgHVuYSRWaxnmVjk1L0lWpjWOu4U0DVPkfV56hyjuorRyWyiPei1O69-_i3-h_VJ5Yrdis</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2312828267</pqid></control><display><type>article</type><title>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</title><source>SpringerLink Journals</source><creator>Karim, Dabbabi ; Salah, Hajji ; Adnen, Cherif</creator><creatorcontrib>Karim, Dabbabi ; Salah, Hajji ; Adnen, Cherif</creatorcontrib><description>In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.</description><identifier>ISSN: 1381-2416</identifier><identifier>EISSN: 1572-8110</identifier><identifier>DOI: 10.1007/s10772-019-09633-6</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial Intelligence ; Classification ; Cluster analysis ; Clustering ; Criteria ; Engineering ; Evolutionary algorithms ; Evolutionary computation ; News ; Optimization ; Signal,Image and Speech Processing ; Social Sciences ; Speech</subject><ispartof>International journal of speech technology, 2019-12, Vol.22 (4), p.893-909</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</citedby><cites>FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10772-019-09633-6$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10772-019-09633-6$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Karim, Dabbabi</creatorcontrib><creatorcontrib>Salah, Hajji</creatorcontrib><creatorcontrib>Adnen, Cherif</creatorcontrib><title>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</title><title>International journal of speech technology</title><addtitle>Int J Speech Technol</addtitle><description>In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Criteria</subject><subject>Engineering</subject><subject>Evolutionary algorithms</subject><subject>Evolutionary computation</subject><subject>News</subject><subject>Optimization</subject><subject>Signal,Image and Speech Processing</subject><subject>Social Sciences</subject><subject>Speech</subject><issn>1381-2416</issn><issn>1572-8110</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIlMIPcLLE2eC1Wyc-olJaRCUucMXyK-DSJsFOVZWvxyU8bmgPuxrN7O4MQudAL4HS4ioBLQpGKEhCpeCciAM0gHGGSgB6mGdeAmEjEMfoJKUlpVQWkg3Q83xnYnDhQ3ehqfHNFG9D94rvydrrOuGqiTi1Xr_5iO1qkzofQ_2CQ_2LuqDjj7qpsImNdlanLuHab9MpOqr0Kvmz7z5ET7fTx8mcLB5md5PrBbEcZEeMYaV12oGUlpkRN74sHC_lOD8ps4HCUQZCckaFF7aSTlTcGGOzK2dGUPIhuuj3trF53_jUqWWziXU-qRgHVuYSRWaxnmVjk1L0lWpjWOu4U0DVPkfV56hyjuorRyWyiPei1O69-_i3-h_VJ5Yrdis</recordid><startdate>20191201</startdate><enddate>20191201</enddate><creator>Karim, Dabbabi</creator><creator>Salah, Hajji</creator><creator>Adnen, Cherif</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>20191201</creationdate><title>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</title><author>Karim, Dabbabi ; Salah, Hajji ; Adnen, Cherif</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Criteria</topic><topic>Engineering</topic><topic>Evolutionary algorithms</topic><topic>Evolutionary computation</topic><topic>News</topic><topic>Optimization</topic><topic>Signal,Image and Speech Processing</topic><topic>Social Sciences</topic><topic>Speech</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Karim, Dabbabi</creatorcontrib><creatorcontrib>Salah, Hajji</creatorcontrib><creatorcontrib>Adnen, Cherif</creatorcontrib><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>International journal of speech technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Karim, Dabbabi</au><au>Salah, Hajji</au><au>Adnen, Cherif</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</atitle><jtitle>International journal of speech technology</jtitle><stitle>Int J Speech Technol</stitle><date>2019-12-01</date><risdate>2019</risdate><volume>22</volume><issue>4</issue><spage>893</spage><epage>909</epage><pages>893-909</pages><issn>1381-2416</issn><eissn>1572-8110</eissn><abstract>In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10772-019-09633-6</doi><tpages>17</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1381-2416
ispartof International journal of speech technology, 2019-12, Vol.22 (4), p.893-909
issn 1381-2416
1572-8110
language eng
recordid cdi_proquest_journals_2312828267
source SpringerLink Journals
subjects Algorithms
Artificial Intelligence
Classification
Cluster analysis
Clustering
Criteria
Engineering
Evolutionary algorithms
Evolutionary computation
News
Optimization
Signal,Image and Speech Processing
Social Sciences
Speech
title Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T09%3A46%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hybridization%20DE%20with%20K-means%20for%20speaker%20clustering%20in%20speaker%20diarization%20of%20broadcasts%20news&rft.jtitle=International%20journal%20of%20speech%20technology&rft.au=Karim,%20Dabbabi&rft.date=2019-12-01&rft.volume=22&rft.issue=4&rft.spage=893&rft.epage=909&rft.pages=893-909&rft.issn=1381-2416&rft.eissn=1572-8110&rft_id=info:doi/10.1007/s10772-019-09633-6&rft_dat=%3Cproquest_cross%3E2312828267%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2312828267&rft_id=info:pmid/&rfr_iscdi=true