Comparing Two Partitions of Non-Equal Sets of Units

Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time perio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Metodološki zvezki (Spletna izd.) 2018-01, Vol.15 (1), p.1-21
Hauptverfasser: Cugmas, Marjan, Ferligoj, Anuška
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 21
container_issue 1
container_start_page 1
container_title Metodološki zvezki (Spletna izd.)
container_volume 15
creator Cugmas, Marjan
Ferligoj, Anuška
description Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2117823504</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2117823504</sourcerecordid><originalsourceid>FETCH-LOGICAL-p98t-dca1da8859f0ad3d286dac12562de6da6d51bbf12979ee051b03be3e32238c0f3</originalsourceid><addsrcrecordid>eNo9jU1LxDAURYMoOIzzHwKuAy_vNZ1kKWX8gEEF63pIm0Q6jEmnSfHvW1Rc3XPP4t4LtpJaVQKA5OU_I12zTc5HWNAQVNqsGDXpc7TTED94-5X4q53KUIYUM0-BP6codufZnvibLz_mPQ4l37CrYE_Zb_5yzdr7Xds8iv3Lw1Nztxej0UW43kpntVYmgHXkUNfO9hJVjc4vWDsluy5INFvjPSwFqPPkCZF0D4HW7PZ3dpzSefa5HI5pnuLyeEAptxpJQUXf6a9Bug</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2117823504</pqid></control><display><type>article</type><title>Comparing Two Partitions of Non-Equal Sets of Units</title><source>Alma/SFX Local Collection</source><creator>Cugmas, Marjan ; Ferligoj, Anuška</creator><creatorcontrib>Cugmas, Marjan ; Ferligoj, Anuška</creatorcontrib><description>Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.</description><identifier>ISSN: 1854-0023</identifier><identifier>EISSN: 1854-0031</identifier><language>eng</language><publisher>Ljubljana: Anuska Ferligoj</publisher><subject>Classification ; Cluster analysis ; Clustering ; Confidence intervals ; Expected values ; Methods ; Statistical analysis ; Statistics</subject><ispartof>Metodološki zvezki (Spletna izd.), 2018-01, Vol.15 (1), p.1-21</ispartof><rights>Copyright Anuska Ferligoj 2018</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784</link.rule.ids></links><search><creatorcontrib>Cugmas, Marjan</creatorcontrib><creatorcontrib>Ferligoj, Anuška</creatorcontrib><title>Comparing Two Partitions of Non-Equal Sets of Units</title><title>Metodološki zvezki (Spletna izd.)</title><description>Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.</description><subject>Classification</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Confidence intervals</subject><subject>Expected values</subject><subject>Methods</subject><subject>Statistical analysis</subject><subject>Statistics</subject><issn>1854-0023</issn><issn>1854-0031</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNo9jU1LxDAURYMoOIzzHwKuAy_vNZ1kKWX8gEEF63pIm0Q6jEmnSfHvW1Rc3XPP4t4LtpJaVQKA5OU_I12zTc5HWNAQVNqsGDXpc7TTED94-5X4q53KUIYUM0-BP6codufZnvibLz_mPQ4l37CrYE_Zb_5yzdr7Xds8iv3Lw1Nztxej0UW43kpntVYmgHXkUNfO9hJVjc4vWDsluy5INFvjPSwFqPPkCZF0D4HW7PZ3dpzSefa5HI5pnuLyeEAptxpJQUXf6a9Bug</recordid><startdate>20180101</startdate><enddate>20180101</enddate><creator>Cugmas, Marjan</creator><creator>Ferligoj, Anuška</creator><general>Anuska Ferligoj</general><scope>3V.</scope><scope>7XB</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BYOGL</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>M2O</scope><scope>MBDVC</scope><scope>PADUT</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20180101</creationdate><title>Comparing Two Partitions of Non-Equal Sets of Units</title><author>Cugmas, Marjan ; Ferligoj, Anuška</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p98t-dca1da8859f0ad3d286dac12562de6da6d51bbf12979ee051b03be3e32238c0f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Classification</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Confidence intervals</topic><topic>Expected values</topic><topic>Methods</topic><topic>Statistical analysis</topic><topic>Statistics</topic><toplevel>online_resources</toplevel><creatorcontrib>Cugmas, Marjan</creatorcontrib><creatorcontrib>Ferligoj, Anuška</creatorcontrib><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>East Europe, Central Europe Database</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Research Library China</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Metodološki zvezki (Spletna izd.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cugmas, Marjan</au><au>Ferligoj, Anuška</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparing Two Partitions of Non-Equal Sets of Units</atitle><jtitle>Metodološki zvezki (Spletna izd.)</jtitle><date>2018-01-01</date><risdate>2018</risdate><volume>15</volume><issue>1</issue><spage>1</spage><epage>21</epage><pages>1-21</pages><issn>1854-0023</issn><eissn>1854-0031</eissn><abstract>Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.</abstract><cop>Ljubljana</cop><pub>Anuska Ferligoj</pub><tpages>21</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1854-0023
ispartof Metodološki zvezki (Spletna izd.), 2018-01, Vol.15 (1), p.1-21
issn 1854-0023
1854-0031
language eng
recordid cdi_proquest_journals_2117823504
source Alma/SFX Local Collection
subjects Classification
Cluster analysis
Clustering
Confidence intervals
Expected values
Methods
Statistical analysis
Statistics
title Comparing Two Partitions of Non-Equal Sets of Units
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T07%3A14%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparing%20Two%20Partitions%20of%20Non-Equal%20Sets%20of%20Units&rft.jtitle=Metodolo%C5%A1ki%20zvezki%20(Spletna%20izd.)&rft.au=Cugmas,%20Marjan&rft.date=2018-01-01&rft.volume=15&rft.issue=1&rft.spage=1&rft.epage=21&rft.pages=1-21&rft.issn=1854-0023&rft.eissn=1854-0031&rft_id=info:doi/&rft_dat=%3Cproquest%3E2117823504%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2117823504&rft_id=info:pmid/&rfr_iscdi=true