Correspondence measures for assessing replication success
Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replicati...
Gespeichert in:
Veröffentlicht in: | Psychological methods 2023-07 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | Psychological methods |
container_volume | |
creator | Steiner, Peter M Sheehan, Patrick Wong, Vivian C |
description | Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved). |
doi_str_mv | 10.1037/met0000597 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2854436390</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2854436390</sourcerecordid><originalsourceid>FETCH-LOGICAL-c245t-6b099d2fc9fe5fe43a8c3c1cb8e914c6c9c0d37246cbb9b6353761b0582ae6e53</originalsourceid><addsrcrecordid>eNpdkE1LxDAQhoMo7rp68QdIwYsI1aT5aOYoxS9Y8KLgraTpVLq0TU3ag__eLLsqOJcZXh5ehoeQc0ZvGOX5bY8TjSMhPyBLBhxSJhQ_jDfVWQoa3hfkJIQNpUxwLY7JgucCtAK-JFA47zGMbqhxsJj0aMIcg6RxPjEhYAjt8JF4HLvWmql1QxJma2N8So4a0wU82-8VeXu4fy2e0vXL43Nxt05tJuSUqooC1FljoUHZoOBGW26ZrTQCE1ZZsLTmeSaUrSqoFJc8V6yiUmcGFUq-Ile73tG7zxnDVPZtsNh1ZkA3hzLTUgiuONCIXv5DN272Q_wuUiKTAAq2hdc7ynoXgsemHH3bG_9VMlpuhZZ_QiN8sa-cqx7rX_THIP8GVxRwlg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2842599695</pqid></control><display><type>article</type><title>Correspondence measures for assessing replication success</title><source>EBSCOhost APA PsycARTICLES</source><creator>Steiner, Peter M ; Sheehan, Patrick ; Wong, Vivian C</creator><creatorcontrib>Steiner, Peter M ; Sheehan, Patrick ; Wong, Vivian C</creatorcontrib><description>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</description><identifier>ISSN: 1082-989X</identifier><identifier>EISSN: 1939-1463</identifier><identifier>DOI: 10.1037/met0000597</identifier><identifier>PMID: 37498693</identifier><language>eng</language><publisher>United States: American Psychological Association</publisher><subject>Causal Analysis ; Experimental Replication ; Human ; Probability</subject><ispartof>Psychological methods, 2023-07</ispartof><rights>2023, American Psychological Association</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c245t-6b099d2fc9fe5fe43a8c3c1cb8e914c6c9c0d37246cbb9b6353761b0582ae6e53</citedby><orcidid>0000-0001-7063-6897</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37498693$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Steiner, Peter M</creatorcontrib><creatorcontrib>Sheehan, Patrick</creatorcontrib><creatorcontrib>Wong, Vivian C</creatorcontrib><title>Correspondence measures for assessing replication success</title><title>Psychological methods</title><addtitle>Psychol Methods</addtitle><description>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</description><subject>Causal Analysis</subject><subject>Experimental Replication</subject><subject>Human</subject><subject>Probability</subject><issn>1082-989X</issn><issn>1939-1463</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpdkE1LxDAQhoMo7rp68QdIwYsI1aT5aOYoxS9Y8KLgraTpVLq0TU3ag__eLLsqOJcZXh5ehoeQc0ZvGOX5bY8TjSMhPyBLBhxSJhQ_jDfVWQoa3hfkJIQNpUxwLY7JgucCtAK-JFA47zGMbqhxsJj0aMIcg6RxPjEhYAjt8JF4HLvWmql1QxJma2N8So4a0wU82-8VeXu4fy2e0vXL43Nxt05tJuSUqooC1FljoUHZoOBGW26ZrTQCE1ZZsLTmeSaUrSqoFJc8V6yiUmcGFUq-Ile73tG7zxnDVPZtsNh1ZkA3hzLTUgiuONCIXv5DN272Q_wuUiKTAAq2hdc7ynoXgsemHH3bG_9VMlpuhZZ_QiN8sa-cqx7rX_THIP8GVxRwlg</recordid><startdate>20230727</startdate><enddate>20230727</enddate><creator>Steiner, Peter M</creator><creator>Sheehan, Patrick</creator><creator>Wong, Vivian C</creator><general>American Psychological Association</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7RZ</scope><scope>PSYQQ</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-7063-6897</orcidid></search><sort><creationdate>20230727</creationdate><title>Correspondence measures for assessing replication success</title><author>Steiner, Peter M ; Sheehan, Patrick ; Wong, Vivian C</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c245t-6b099d2fc9fe5fe43a8c3c1cb8e914c6c9c0d37246cbb9b6353761b0582ae6e53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Causal Analysis</topic><topic>Experimental Replication</topic><topic>Human</topic><topic>Probability</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Steiner, Peter M</creatorcontrib><creatorcontrib>Sheehan, Patrick</creatorcontrib><creatorcontrib>Wong, Vivian C</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>APA PsycArticles®</collection><collection>ProQuest One Psychology</collection><collection>MEDLINE - Academic</collection><jtitle>Psychological methods</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Steiner, Peter M</au><au>Sheehan, Patrick</au><au>Wong, Vivian C</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Correspondence measures for assessing replication success</atitle><jtitle>Psychological methods</jtitle><addtitle>Psychol Methods</addtitle><date>2023-07-27</date><risdate>2023</risdate><issn>1082-989X</issn><eissn>1939-1463</eissn><abstract>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</abstract><cop>United States</cop><pub>American Psychological Association</pub><pmid>37498693</pmid><doi>10.1037/met0000597</doi><orcidid>https://orcid.org/0000-0001-7063-6897</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1082-989X |
ispartof | Psychological methods, 2023-07 |
issn | 1082-989X 1939-1463 |
language | eng |
recordid | cdi_proquest_miscellaneous_2854436390 |
source | EBSCOhost APA PsycARTICLES |
subjects | Causal Analysis Experimental Replication Human Probability |
title | Correspondence measures for assessing replication success |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T16%3A02%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Correspondence%20measures%20for%20assessing%20replication%20success&rft.jtitle=Psychological%20methods&rft.au=Steiner,%20Peter%20M&rft.date=2023-07-27&rft.issn=1082-989X&rft.eissn=1939-1463&rft_id=info:doi/10.1037/met0000597&rft_dat=%3Cproquest_cross%3E2854436390%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2842599695&rft_id=info:pmid/37498693&rfr_iscdi=true |