Peer groups for organisational learning: clustering with practical constraints

Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considera...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2020-11
Hauptverfasser:	Kennedy, Daniel William, Cameron, Jessica, Paul Pao-Yen Wu, Mengersen, Kerrie
Format:	Artikel
Sprache:	eng
Schlagworte:	Business Case studies Clustering Constraints Decision making Goodness of fit Learning Organizational learning Stability Statistics - Applications
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Kennedy, Daniel William Cameron, Jessica Paul Pao-Yen Wu Mengersen, Kerrie
description	Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considerations. Additionally, statistical peer groups are constructed from many different variables, and can be difficult to understand, especially for non-statistical audiences. We developed methodology to apply business constraints to clustering solutions and allow the decision-maker to choose the balance between statistical goodness-of-fit and conformity to business constraints. Several tools were utilised to identify complex distinguishing features in peer groups, and a number of visualisations are developed to explain high-dimensional clusters for non-statistical audiences. In a case study where peer group size was required to be small ($\leq 100$ members), we applied constrained clustering to a noisy high-dimensional data-set over two subsequent years, ensuring that the clusters were sufficiently stable between years. Our approach not only satisfied clustering constraints on the test data, but maintained an almost monotonic negative relationship between goodness-of-fit and stability between subsequent years. We demonstrated in the context of the case study how distinguishing features between clusters can be communicated clearly to different stakeholders with substantial and limited statistical knowledge.
doi_str_mv	10.48550/arxiv.2011.08405
format	Article
fullrecord	<record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2011_08405</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2461807129</sourcerecordid><originalsourceid>FETCH-LOGICAL-a529-643481086c4273784dbb8ead633295c1999d137eb50674b0b1ab70bfbb747bc73</originalsourceid><addsrcrecordid>eNotz0tLxDAUhuEgCA7j_ABXBlx3PLk1qTsZvMGgLmZfkjStGWpTk9TLv7fOuDpn8fLBg9AFgTVXQsC1jt_-c02BkDUoDuIELShjpFCc0jO0SmkPALSUVAi2QM-vzkXcxTCNCbch4hA7Pfiksw-D7nHvdBz80N1g208puzj_-MvnNzxGbbO3c2PDkHLUfsjpHJ22uk9u9X-XaHd_t9s8FtuXh6fN7bbQglZFyRlXBFRpOZVMKt4Yo5xuSsZoJSypqqohTDojoJTcgCHaSDCtMZJLYyVbosvj7AFbj9G_6_hT_6HrA3ouro7FGMPH5FKu92GKsyjVlJdEgSS0Yr8R1VpW</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2461807129</pqid></control><display><type>article</type><title>Peer groups for organisational learning: clustering with practical constraints</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Kennedy, Daniel William ; Cameron, Jessica ; Paul Pao-Yen Wu ; Mengersen, Kerrie</creator><creatorcontrib>Kennedy, Daniel William ; Cameron, Jessica ; Paul Pao-Yen Wu ; Mengersen, Kerrie</creatorcontrib><description>Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considerations. Additionally, statistical peer groups are constructed from many different variables, and can be difficult to understand, especially for non-statistical audiences. We developed methodology to apply business constraints to clustering solutions and allow the decision-maker to choose the balance between statistical goodness-of-fit and conformity to business constraints. Several tools were utilised to identify complex distinguishing features in peer groups, and a number of visualisations are developed to explain high-dimensional clusters for non-statistical audiences. In a case study where peer group size was required to be small ($\leq 100$ members), we applied constrained clustering to a noisy high-dimensional data-set over two subsequent years, ensuring that the clusters were sufficiently stable between years. Our approach not only satisfied clustering constraints on the test data, but maintained an almost monotonic negative relationship between goodness-of-fit and stability between subsequent years. We demonstrated in the context of the case study how distinguishing features between clusters can be communicated clearly to different stakeholders with substantial and limited statistical knowledge.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2011.08405</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Business ; Case studies ; Clustering ; Constraints ; Decision making ; Goodness of fit ; Learning ; Organizational learning ; Stability ; Statistics - Applications</subject><ispartof>arXiv.org, 2020-11</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27925</link.rule.ids><backlink>$$Uhttps://doi.org/10.1371/journal.pone.0251723$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2011.08405$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Kennedy, Daniel William</creatorcontrib><creatorcontrib>Cameron, Jessica</creatorcontrib><creatorcontrib>Paul Pao-Yen Wu</creatorcontrib><creatorcontrib>Mengersen, Kerrie</creatorcontrib><title>Peer groups for organisational learning: clustering with practical constraints</title><title>arXiv.org</title><description>Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considerations. Additionally, statistical peer groups are constructed from many different variables, and can be difficult to understand, especially for non-statistical audiences. We developed methodology to apply business constraints to clustering solutions and allow the decision-maker to choose the balance between statistical goodness-of-fit and conformity to business constraints. Several tools were utilised to identify complex distinguishing features in peer groups, and a number of visualisations are developed to explain high-dimensional clusters for non-statistical audiences. In a case study where peer group size was required to be small ($\leq 100$ members), we applied constrained clustering to a noisy high-dimensional data-set over two subsequent years, ensuring that the clusters were sufficiently stable between years. Our approach not only satisfied clustering constraints on the test data, but maintained an almost monotonic negative relationship between goodness-of-fit and stability between subsequent years. We demonstrated in the context of the case study how distinguishing features between clusters can be communicated clearly to different stakeholders with substantial and limited statistical knowledge.</description><subject>Business</subject><subject>Case studies</subject><subject>Clustering</subject><subject>Constraints</subject><subject>Decision making</subject><subject>Goodness of fit</subject><subject>Learning</subject><subject>Organizational learning</subject><subject>Stability</subject><subject>Statistics - Applications</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotz0tLxDAUhuEgCA7j_ABXBlx3PLk1qTsZvMGgLmZfkjStGWpTk9TLv7fOuDpn8fLBg9AFgTVXQsC1jt_-c02BkDUoDuIELShjpFCc0jO0SmkPALSUVAi2QM-vzkXcxTCNCbch4hA7Pfiksw-D7nHvdBz80N1g208puzj_-MvnNzxGbbO3c2PDkHLUfsjpHJ22uk9u9X-XaHd_t9s8FtuXh6fN7bbQglZFyRlXBFRpOZVMKt4Yo5xuSsZoJSypqqohTDojoJTcgCHaSDCtMZJLYyVbosvj7AFbj9G_6_hT_6HrA3ouro7FGMPH5FKu92GKsyjVlJdEgSS0Yr8R1VpW</recordid><startdate>20201117</startdate><enddate>20201117</enddate><creator>Kennedy, Daniel William</creator><creator>Cameron, Jessica</creator><creator>Paul Pao-Yen Wu</creator><creator>Mengersen, Kerrie</creator><general>Cornell University Library, arXiv.org</general><scope>7X5</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>K6~</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20201117</creationdate><title>Peer groups for organisational learning: clustering with practical constraints</title><author>Kennedy, Daniel William ; Cameron, Jessica ; Paul Pao-Yen Wu ; Mengersen, Kerrie</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a529-643481086c4273784dbb8ead633295c1999d137eb50674b0b1ab70bfbb747bc73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Business</topic><topic>Case studies</topic><topic>Clustering</topic><topic>Constraints</topic><topic>Decision making</topic><topic>Goodness of fit</topic><topic>Learning</topic><topic>Organizational learning</topic><topic>Stability</topic><topic>Statistics - Applications</topic><toplevel>online_resources</toplevel><creatorcontrib>Kennedy, Daniel William</creatorcontrib><creatorcontrib>Cameron, Jessica</creatorcontrib><creatorcontrib>Paul Pao-Yen Wu</creatorcontrib><creatorcontrib>Mengersen, Kerrie</creatorcontrib><collection>Entrepreneurship Database (ProQuest)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Business Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kennedy, Daniel William</au><au>Cameron, Jessica</au><au>Paul Pao-Yen Wu</au><au>Mengersen, Kerrie</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Peer groups for organisational learning: clustering with practical constraints</atitle><jtitle>arXiv.org</jtitle><date>2020-11-17</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Peer-grouping is used in many sectors for organisational learning, policy implementation, and benchmarking. Clustering provides a statistical, data-driven method for constructing meaningful peer groups, but peer groups must be compatible with business constraints such as size and stability considerations. Additionally, statistical peer groups are constructed from many different variables, and can be difficult to understand, especially for non-statistical audiences. We developed methodology to apply business constraints to clustering solutions and allow the decision-maker to choose the balance between statistical goodness-of-fit and conformity to business constraints. Several tools were utilised to identify complex distinguishing features in peer groups, and a number of visualisations are developed to explain high-dimensional clusters for non-statistical audiences. In a case study where peer group size was required to be small ($\leq 100$ members), we applied constrained clustering to a noisy high-dimensional data-set over two subsequent years, ensuring that the clusters were sufficiently stable between years. Our approach not only satisfied clustering constraints on the test data, but maintained an almost monotonic negative relationship between goodness-of-fit and stability between subsequent years. We demonstrated in the context of the case study how distinguishing features between clusters can be communicated clearly to different stakeholders with substantial and limited statistical knowledge.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2011.08405</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2020-11
issn	2331-8422
language	eng
recordid	cdi_arxiv_primary_2011_08405
source	arXiv.org; Free E- Journals
subjects	Business Case studies Clustering Constraints Decision making Goodness of fit Learning Organizational learning Stability Statistics - Applications
title	Peer groups for organisational learning: clustering with practical constraints
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T01%3A48%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Peer%20groups%20for%20organisational%20learning:%20clustering%20with%20practical%20constraints&rft.jtitle=arXiv.org&rft.au=Kennedy,%20Daniel%20William&rft.date=2020-11-17&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2011.08405&rft_dat=%3Cproquest_arxiv%3E2461807129%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2461807129&rft_id=info:pmid/&rfr_iscdi=true