Correspondence measures for assessing replication success

Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replicati...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Psychological methods 2023-07
Hauptverfasser:	Steiner, Peter M, Sheehan, Patrick, Wong, Vivian C
Format:	Artikel
Sprache:	eng
Schlagworte:	Causal Analysis Experimental Replication Human Probability
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	Psychological methods
container_volume
creator	Steiner, Peter M Sheehan, Patrick Wong, Vivian C
description	Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
doi_str_mv	10.1037/met0000597
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2854436390</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2854436390</sourcerecordid><originalsourceid>FETCH-LOGICAL-c245t-6b099d2fc9fe5fe43a8c3c1cb8e914c6c9c0d37246cbb9b6353761b0582ae6e53</originalsourceid><addsrcrecordid>eNpdkE1LxDAQhoMo7rp68QdIwYsI1aT5aOYoxS9Y8KLgraTpVLq0TU3ag__eLLsqOJcZXh5ehoeQc0ZvGOX5bY8TjSMhPyBLBhxSJhQ_jDfVWQoa3hfkJIQNpUxwLY7JgucCtAK-JFA47zGMbqhxsJj0aMIcg6RxPjEhYAjt8JF4HLvWmql1QxJma2N8So4a0wU82-8VeXu4fy2e0vXL43Nxt05tJuSUqooC1FljoUHZoOBGW26ZrTQCE1ZZsLTmeSaUrSqoFJc8V6yiUmcGFUq-Ile73tG7zxnDVPZtsNh1ZkA3hzLTUgiuONCIXv5DN272Q_wuUiKTAAq2hdc7ynoXgsemHH3bG_9VMlpuhZZ_QiN8sa-cqx7rX_THIP8GVxRwlg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2842599695</pqid></control><display><type>article</type><title>Correspondence measures for assessing replication success</title><source>EBSCOhost APA PsycARTICLES</source><creator>Steiner, Peter M ; Sheehan, Patrick ; Wong, Vivian C</creator><creatorcontrib>Steiner, Peter M ; Sheehan, Patrick ; Wong, Vivian C</creatorcontrib><description>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</description><identifier>ISSN: 1082-989X</identifier><identifier>EISSN: 1939-1463</identifier><identifier>DOI: 10.1037/met0000597</identifier><identifier>PMID: 37498693</identifier><language>eng</language><publisher>United States: American Psychological Association</publisher><subject>Causal Analysis ; Experimental Replication ; Human ; Probability</subject><ispartof>Psychological methods, 2023-07</ispartof><rights>2023, American Psychological Association</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c245t-6b099d2fc9fe5fe43a8c3c1cb8e914c6c9c0d37246cbb9b6353761b0582ae6e53</citedby><orcidid>0000-0001-7063-6897</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37498693$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Steiner, Peter M</creatorcontrib><creatorcontrib>Sheehan, Patrick</creatorcontrib><creatorcontrib>Wong, Vivian C</creatorcontrib><title>Correspondence measures for assessing replication success</title><title>Psychological methods</title><addtitle>Psychol Methods</addtitle><description>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</description><subject>Causal Analysis</subject><subject>Experimental Replication</subject><subject>Human</subject><subject>Probability</subject><issn>1082-989X</issn><issn>1939-1463</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpdkE1LxDAQhoMo7rp68QdIwYsI1aT5aOYoxS9Y8KLgraTpVLq0TU3ag__eLLsqOJcZXh5ehoeQc0ZvGOX5bY8TjSMhPyBLBhxSJhQ_jDfVWQoa3hfkJIQNpUxwLY7JgucCtAK-JFA47zGMbqhxsJj0aMIcg6RxPjEhYAjt8JF4HLvWmql1QxJma2N8So4a0wU82-8VeXu4fy2e0vXL43Nxt05tJuSUqooC1FljoUHZoOBGW26ZrTQCE1ZZsLTmeSaUrSqoFJc8V6yiUmcGFUq-Ile73tG7zxnDVPZtsNh1ZkA3hzLTUgiuONCIXv5DN272Q_wuUiKTAAq2hdc7ynoXgsemHH3bG_9VMlpuhZZ_QiN8sa-cqx7rX_THIP8GVxRwlg</recordid><startdate>20230727</startdate><enddate>20230727</enddate><creator>Steiner, Peter M</creator><creator>Sheehan, Patrick</creator><creator>Wong, Vivian C</creator><general>American Psychological Association</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7RZ</scope><scope>PSYQQ</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-7063-6897</orcidid></search><sort><creationdate>20230727</creationdate><title>Correspondence measures for assessing replication success</title><author>Steiner, Peter M ; Sheehan, Patrick ; Wong, Vivian C</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c245t-6b099d2fc9fe5fe43a8c3c1cb8e914c6c9c0d37246cbb9b6353761b0582ae6e53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Causal Analysis</topic><topic>Experimental Replication</topic><topic>Human</topic><topic>Probability</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Steiner, Peter M</creatorcontrib><creatorcontrib>Sheehan, Patrick</creatorcontrib><creatorcontrib>Wong, Vivian C</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>APA PsycArticles®</collection><collection>ProQuest One Psychology</collection><collection>MEDLINE - Academic</collection><jtitle>Psychological methods</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Steiner, Peter M</au><au>Sheehan, Patrick</au><au>Wong, Vivian C</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Correspondence measures for assessing replication success</atitle><jtitle>Psychological methods</jtitle><addtitle>Psychol Methods</addtitle><date>2023-07-27</date><risdate>2023</risdate><issn>1082-989X</issn><eissn>1939-1463</eissn><abstract>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</abstract><cop>United States</cop><pub>American Psychological Association</pub><pmid>37498693</pmid><doi>10.1037/met0000597</doi><orcidid>https://orcid.org/0000-0001-7063-6897</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1082-989X
ispartof	Psychological methods, 2023-07
issn	1082-989X 1939-1463
language	eng
recordid	cdi_proquest_miscellaneous_2854436390
source	EBSCOhost APA PsycARTICLES
subjects	Causal Analysis Experimental Replication Human Probability
title	Correspondence measures for assessing replication success
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T16%3A02%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Correspondence%20measures%20for%20assessing%20replication%20success&rft.jtitle=Psychological%20methods&rft.au=Steiner,%20Peter%20M&rft.date=2023-07-27&rft.issn=1082-989X&rft.eissn=1939-1463&rft_id=info:doi/10.1037/met0000597&rft_dat=%3Cproquest_cross%3E2854436390%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2842599695&rft_id=info:pmid/37498693&rfr_iscdi=true