Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news
In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND...
Gespeichert in:
Veröffentlicht in: | International journal of speech technology 2019-12, Vol.22 (4), p.893-909 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 909 |
---|---|
container_issue | 4 |
container_start_page | 893 |
container_title | International journal of speech technology |
container_volume | 22 |
creator | Karim, Dabbabi Salah, Hajji Adnen, Cherif |
description | In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant. |
doi_str_mv | 10.1007/s10772-019-09633-6 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2312828267</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2312828267</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</originalsourceid><addsrcrecordid>eNp9UMtOwzAQtBBIlMIPcLLE2eC1Wyc-olJaRCUucMXyK-DSJsFOVZWvxyU8bmgPuxrN7O4MQudAL4HS4ioBLQpGKEhCpeCciAM0gHGGSgB6mGdeAmEjEMfoJKUlpVQWkg3Q83xnYnDhQ3ehqfHNFG9D94rvydrrOuGqiTi1Xr_5iO1qkzofQ_2CQ_2LuqDjj7qpsImNdlanLuHab9MpOqr0Kvmz7z5ET7fTx8mcLB5md5PrBbEcZEeMYaV12oGUlpkRN74sHC_lOD8ps4HCUQZCckaFF7aSTlTcGGOzK2dGUPIhuuj3trF53_jUqWWziXU-qRgHVuYSRWaxnmVjk1L0lWpjWOu4U0DVPkfV56hyjuorRyWyiPei1O69-_i3-h_VJ5Yrdis</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2312828267</pqid></control><display><type>article</type><title>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</title><source>SpringerLink Journals</source><creator>Karim, Dabbabi ; Salah, Hajji ; Adnen, Cherif</creator><creatorcontrib>Karim, Dabbabi ; Salah, Hajji ; Adnen, Cherif</creatorcontrib><description>In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.</description><identifier>ISSN: 1381-2416</identifier><identifier>EISSN: 1572-8110</identifier><identifier>DOI: 10.1007/s10772-019-09633-6</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Artificial Intelligence ; Classification ; Cluster analysis ; Clustering ; Criteria ; Engineering ; Evolutionary algorithms ; Evolutionary computation ; News ; Optimization ; Signal,Image and Speech Processing ; Social Sciences ; Speech</subject><ispartof>International journal of speech technology, 2019-12, Vol.22 (4), p.893-909</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2019</rights><rights>Copyright Springer Nature B.V. 2019</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</citedby><cites>FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10772-019-09633-6$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10772-019-09633-6$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Karim, Dabbabi</creatorcontrib><creatorcontrib>Salah, Hajji</creatorcontrib><creatorcontrib>Adnen, Cherif</creatorcontrib><title>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</title><title>International journal of speech technology</title><addtitle>Int J Speech Technol</addtitle><description>In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.</description><subject>Algorithms</subject><subject>Artificial Intelligence</subject><subject>Classification</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Criteria</subject><subject>Engineering</subject><subject>Evolutionary algorithms</subject><subject>Evolutionary computation</subject><subject>News</subject><subject>Optimization</subject><subject>Signal,Image and Speech Processing</subject><subject>Social Sciences</subject><subject>Speech</subject><issn>1381-2416</issn><issn>1572-8110</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp9UMtOwzAQtBBIlMIPcLLE2eC1Wyc-olJaRCUucMXyK-DSJsFOVZWvxyU8bmgPuxrN7O4MQudAL4HS4ioBLQpGKEhCpeCciAM0gHGGSgB6mGdeAmEjEMfoJKUlpVQWkg3Q83xnYnDhQ3ehqfHNFG9D94rvydrrOuGqiTi1Xr_5iO1qkzofQ_2CQ_2LuqDjj7qpsImNdlanLuHab9MpOqr0Kvmz7z5ET7fTx8mcLB5md5PrBbEcZEeMYaV12oGUlpkRN74sHC_lOD8ps4HCUQZCckaFF7aSTlTcGGOzK2dGUPIhuuj3trF53_jUqWWziXU-qRgHVuYSRWaxnmVjk1L0lWpjWOu4U0DVPkfV56hyjuorRyWyiPei1O69-_i3-h_VJ5Yrdis</recordid><startdate>20191201</startdate><enddate>20191201</enddate><creator>Karim, Dabbabi</creator><creator>Salah, Hajji</creator><creator>Adnen, Cherif</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope></search><sort><creationdate>20191201</creationdate><title>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</title><author>Karim, Dabbabi ; Salah, Hajji ; Adnen, Cherif</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-bb28cdad199c2b43be87d389597991107d021693206e6cf9d6f3bbbc138db4183</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Algorithms</topic><topic>Artificial Intelligence</topic><topic>Classification</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Criteria</topic><topic>Engineering</topic><topic>Evolutionary algorithms</topic><topic>Evolutionary computation</topic><topic>News</topic><topic>Optimization</topic><topic>Signal,Image and Speech Processing</topic><topic>Social Sciences</topic><topic>Speech</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Karim, Dabbabi</creatorcontrib><creatorcontrib>Salah, Hajji</creatorcontrib><creatorcontrib>Adnen, Cherif</creatorcontrib><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><jtitle>International journal of speech technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Karim, Dabbabi</au><au>Salah, Hajji</au><au>Adnen, Cherif</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news</atitle><jtitle>International journal of speech technology</jtitle><stitle>Int J Speech Technol</stitle><date>2019-12-01</date><risdate>2019</risdate><volume>22</volume><issue>4</issue><spage>893</spage><epage>909</epage><pages>893-909</pages><issn>1381-2416</issn><eissn>1572-8110</eissn><abstract>In this paper, we address the problem of optimal non-hierarchical clustering in the speaker clustering phase for the speaker diarization task of news broadcasts. A new hybridization combining differential evolution (DE) algorithm and K-means algorithm is proposed and tested on TV news database (TVND). To optimize the classification of speakers, two criteria, namely trace within criterion (TRW) and variance ratio criterion (VRC), were used as clustering validity indices, correcting every possible grouping of speakers’ segments. Concerning the encoding of the classification of clusters to be optimized, it is performed by the cluster centers in DE algorithm. Therefore, a problem of rearrangement of centers in the populations can be generated, which cannot ensure an efficient search by applying evolutionary operators. For this purpose, an efficient heuristic was also proposed for this rearrangement. Non-hybrid DE variants were applied with and without the rearrangement of cluster centers, and compared with the corresponding hybrid K-means variants. The experimental results have showed the high-efficiency of hybrid K-means variants with the rearrangement of cluster centers compared with those without the rearrangement of cluster centers and non-hybrid DE variants. Also, the obtained results using hybrid and non-hybrid DE variants with the rearrangement of cluster centers were quite similar using both TWR and VRC criteria. Moreover, the best efficiency was acquired using hybrid DE variants thanks to these two criteria from which a value of 13.05% of DER has been reached by hybrid b6e6rl variant.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s10772-019-09633-6</doi><tpages>17</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1381-2416 |
ispartof | International journal of speech technology, 2019-12, Vol.22 (4), p.893-909 |
issn | 1381-2416 1572-8110 |
language | eng |
recordid | cdi_proquest_journals_2312828267 |
source | SpringerLink Journals |
subjects | Algorithms Artificial Intelligence Classification Cluster analysis Clustering Criteria Engineering Evolutionary algorithms Evolutionary computation News Optimization Signal,Image and Speech Processing Social Sciences Speech |
title | Hybridization DE with K-means for speaker clustering in speaker diarization of broadcasts news |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T09%3A46%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Hybridization%20DE%20with%20K-means%20for%20speaker%20clustering%20in%20speaker%20diarization%20of%20broadcasts%20news&rft.jtitle=International%20journal%20of%20speech%20technology&rft.au=Karim,%20Dabbabi&rft.date=2019-12-01&rft.volume=22&rft.issue=4&rft.spage=893&rft.epage=909&rft.pages=893-909&rft.issn=1381-2416&rft.eissn=1572-8110&rft_id=info:doi/10.1007/s10772-019-09633-6&rft_dat=%3Cproquest_cross%3E2312828267%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2312828267&rft_id=info:pmid/&rfr_iscdi=true |