Comparing Two Partitions of Non-Equal Sets of Units
Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time perio...
Gespeichert in:
Veröffentlicht in: | Metodološki zvezki (Spletna izd.) 2018-01, Vol.15 (1), p.1-21 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 21 |
---|---|
container_issue | 1 |
container_start_page | 1 |
container_title | Metodološki zvezki (Spletna izd.) |
container_volume | 15 |
creator | Cugmas, Marjan Ferligoj, Anuška |
description | Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2117823504</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2117823504</sourcerecordid><originalsourceid>FETCH-LOGICAL-p98t-dca1da8859f0ad3d286dac12562de6da6d51bbf12979ee051b03be3e32238c0f3</originalsourceid><addsrcrecordid>eNo9jU1LxDAURYMoOIzzHwKuAy_vNZ1kKWX8gEEF63pIm0Q6jEmnSfHvW1Rc3XPP4t4LtpJaVQKA5OU_I12zTc5HWNAQVNqsGDXpc7TTED94-5X4q53KUIYUM0-BP6codufZnvibLz_mPQ4l37CrYE_Zb_5yzdr7Xds8iv3Lw1Nztxej0UW43kpntVYmgHXkUNfO9hJVjc4vWDsluy5INFvjPSwFqPPkCZF0D4HW7PZ3dpzSefa5HI5pnuLyeEAptxpJQUXf6a9Bug</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2117823504</pqid></control><display><type>article</type><title>Comparing Two Partitions of Non-Equal Sets of Units</title><source>Alma/SFX Local Collection</source><creator>Cugmas, Marjan ; Ferligoj, Anuška</creator><creatorcontrib>Cugmas, Marjan ; Ferligoj, Anuška</creatorcontrib><description>Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.</description><identifier>ISSN: 1854-0023</identifier><identifier>EISSN: 1854-0031</identifier><language>eng</language><publisher>Ljubljana: Anuska Ferligoj</publisher><subject>Classification ; Cluster analysis ; Clustering ; Confidence intervals ; Expected values ; Methods ; Statistical analysis ; Statistics</subject><ispartof>Metodološki zvezki (Spletna izd.), 2018-01, Vol.15 (1), p.1-21</ispartof><rights>Copyright Anuska Ferligoj 2018</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784</link.rule.ids></links><search><creatorcontrib>Cugmas, Marjan</creatorcontrib><creatorcontrib>Ferligoj, Anuška</creatorcontrib><title>Comparing Two Partitions of Non-Equal Sets of Units</title><title>Metodološki zvezki (Spletna izd.)</title><description>Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.</description><subject>Classification</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Confidence intervals</subject><subject>Expected values</subject><subject>Methods</subject><subject>Statistical analysis</subject><subject>Statistics</subject><issn>1854-0023</issn><issn>1854-0031</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNo9jU1LxDAURYMoOIzzHwKuAy_vNZ1kKWX8gEEF63pIm0Q6jEmnSfHvW1Rc3XPP4t4LtpJaVQKA5OU_I12zTc5HWNAQVNqsGDXpc7TTED94-5X4q53KUIYUM0-BP6codufZnvibLz_mPQ4l37CrYE_Zb_5yzdr7Xds8iv3Lw1Nztxej0UW43kpntVYmgHXkUNfO9hJVjc4vWDsluy5INFvjPSwFqPPkCZF0D4HW7PZ3dpzSefa5HI5pnuLyeEAptxpJQUXf6a9Bug</recordid><startdate>20180101</startdate><enddate>20180101</enddate><creator>Cugmas, Marjan</creator><creator>Ferligoj, Anuška</creator><general>Anuska Ferligoj</general><scope>3V.</scope><scope>7XB</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BYOGL</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>M2O</scope><scope>MBDVC</scope><scope>PADUT</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20180101</creationdate><title>Comparing Two Partitions of Non-Equal Sets of Units</title><author>Cugmas, Marjan ; Ferligoj, Anuška</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p98t-dca1da8859f0ad3d286dac12562de6da6d51bbf12979ee051b03be3e32238c0f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Classification</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Confidence intervals</topic><topic>Expected values</topic><topic>Methods</topic><topic>Statistical analysis</topic><topic>Statistics</topic><toplevel>online_resources</toplevel><creatorcontrib>Cugmas, Marjan</creatorcontrib><creatorcontrib>Ferligoj, Anuška</creatorcontrib><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>East Europe, Central Europe Database</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Research Library China</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Metodološki zvezki (Spletna izd.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Cugmas, Marjan</au><au>Ferligoj, Anuška</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparing Two Partitions of Non-Equal Sets of Units</atitle><jtitle>Metodološki zvezki (Spletna izd.)</jtitle><date>2018-01-01</date><risdate>2018</risdate><volume>15</volume><issue>1</issue><spage>1</spage><epage>21</epage><pages>1-21</pages><issn>1854-0023</issn><eissn>1854-0031</eissn><abstract>Rand (1971) proposed what has since become a well-known index for comparing two partitions obtained on the same set of units. The index takes a value on the interval between 0 and 1, where a higher value indicates more similar partitions. Sometimes, e.g. when the units are observed in two time periods, the splitting and merging of clusters should be considered differently, according to the operationalization of the stability of clusters. The Rand Index is symmetric in the sense that both the splitting and merging of clusters lower the value of the index. In such a nonsymmetric case, one of the Wallace indexes (Wallace, 1983) can be used. Further, there are several cases when one wants to compare two partitions obtained on different sets of units, where the intersection of these sets of units is a non-empty set of units. In this instance, the new units and units which leave the clusters from the first partition can be considered as a factor lowering the value of the index. Therefore, a modified Rand index is presented. Because the splitting and merging of clusters have to be considered differently in some situations, an asymmetric modified Wallace Index is also proposed. For all presented indices, the correction for chance is described, which allows different values of a selected index to be compared.</abstract><cop>Ljubljana</cop><pub>Anuska Ferligoj</pub><tpages>21</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1854-0023 |
ispartof | Metodološki zvezki (Spletna izd.), 2018-01, Vol.15 (1), p.1-21 |
issn | 1854-0023 1854-0031 |
language | eng |
recordid | cdi_proquest_journals_2117823504 |
source | Alma/SFX Local Collection |
subjects | Classification Cluster analysis Clustering Confidence intervals Expected values Methods Statistical analysis Statistics |
title | Comparing Two Partitions of Non-Equal Sets of Units |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T07%3A14%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparing%20Two%20Partitions%20of%20Non-Equal%20Sets%20of%20Units&rft.jtitle=Metodolo%C5%A1ki%20zvezki%20(Spletna%20izd.)&rft.au=Cugmas,%20Marjan&rft.date=2018-01-01&rft.volume=15&rft.issue=1&rft.spage=1&rft.epage=21&rft.pages=1-21&rft.issn=1854-0023&rft.eissn=1854-0031&rft_id=info:doi/&rft_dat=%3Cproquest%3E2117823504%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2117823504&rft_id=info:pmid/&rfr_iscdi=true |