Assessing Reproducibility of Data Obtained With Instruments Based on Continuous Measurements

Data obtained with any research tool must be reproducible, a concept referred to as reliability. Three techniques are often used to evaluate reliability of tools using continuous data in aging research: intraclass correlation coefficients (ICC), Pearson correlations, and paired t tests. These are of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Experimental aging research 2000-10, Vol.26 (4), p.353-365
Hauptverfasser:	BEDARD, Michel, MARTIN, Nancy J, KRUEGER, Paul, BRAZIL, Kevin
Format:	Artikel
Sprache:	eng
Schlagworte:	Aging Biological and medical sciences Data Interpretation, Statistical Databases, Factual Fundamental and applied biological sciences. Psychology Humans Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Psychometrics Psychometrics. Statistics. Methodology Reproducibility of Results
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	365
container_issue	4
container_start_page	353
container_title	Experimental aging research
container_volume	26
creator	BEDARD, Michel MARTIN, Nancy J KRUEGER, Paul BRAZIL, Kevin
description	Data obtained with any research tool must be reproducible, a concept referred to as reliability. Three techniques are often used to evaluate reliability of tools using continuous data in aging research: intraclass correlation coefficients (ICC), Pearson correlations, and paired t tests. These are often construed as equivalent when applied to reliability. This is not correct, and may lead researchers to select instruments based on statistics that may not reflect actual reliability. The purpose of this paper is to compare the reliability estimates produced by these three techniques and determine the preferable technique. A hypothetical dataset was produced to evaluate the reliability estimates obtained with ICC, Pearson correlations, and paired t tests in three different situations. For each situation two sets of 20 observations were created to simulate an intrarater or inter-rater paradigm, based on 20 participants with two observations per participant. Situations were designed to demonstrate good agreement, systematic bias, or substantial random measurement error. In the situation demonstrating good agreement, all three techniques supported the conclusionthat the data were reliable. In the situation demonstrating systematic bias, the ICC and t test suggested the data were not reliable, whereas the Pearson correlation suggested high reliability despite the systematic discrepancy. In the situation representing substantial randommeasurement error where low reliability was expected, the ICC and Pearson coefficient accurately illustrated this. The t test suggested the data were reliable. The ICC is the preferred technique to measure reliability. Although there are some limitations associated with the use of this technique, they can be overcome.
doi_str_mv	10.1080/036107300750015741
format	Article
fullrecord	<record><control><sourceid>proquest_pasca</sourceid><recordid>TN_cdi_pascalfrancis_primary_798129</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>70778490</sourcerecordid><originalsourceid>FETCH-LOGICAL-c430t-693a235c3b976a4a026021b00ad7a44507bad93504f03ad9e74e99a5e5bb8f693</originalsourceid><addsrcrecordid>eNqFkE2LFDEQhoMo7rj6BzxIQPDWWukknc7Bwzp-LawsiKIHoanuTmukO1lTaXT-vRln1MOCnioUz1O8eRm7L-CxgBaegGwEGAlgNIDQRokbbCPANpVqtLnJNnugKsTHE3aH6CsAaCnkbXYiCiWsEhv26YzIEfnwmb91VymO6-B7P_u843HizzEjv-wz-uBG_sHnL_w8UE7r4kIm_gyprGPg2xiyD2tcib9xSGtyv4C77NaEM7l7x3nK3r988W77urq4fHW-PbuoBiUhV42VWEs9yN6aBhVC3UAtegAcDSqlwfQ4WqlBTSDLyxnlrEXtdN-3U7FP2aPD3fKBb6uj3C2eBjfPGFzJ1BkwplUWClgfwCFFouSm7ir5BdOuE9DtO-2ud1qkB8fra7-48a9yLLEAD48A0oDzlDAMnv5wxrai3od8eqB8mGJa8HtM89hl3M0x_VbkP2O0__Wva13-keVPuoWj0g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>70778490</pqid></control><display><type>article</type><title>Assessing Reproducibility of Data Obtained With Instruments Based on Continuous Measurements</title><source>MEDLINE</source><source>Taylor & Francis Journals Complete</source><creator>BEDARD, Michel ; MARTIN, Nancy J ; KRUEGER, Paul ; BRAZIL, Kevin</creator><creatorcontrib>BEDARD, Michel ; MARTIN, Nancy J ; KRUEGER, Paul ; BRAZIL, Kevin</creatorcontrib><description>Data obtained with any research tool must be reproducible, a concept referred to as reliability. Three techniques are often used to evaluate reliability of tools using continuous data in aging research: intraclass correlation coefficients (ICC), Pearson correlations, and paired t tests. These are often construed as equivalent when applied to reliability. This is not correct, and may lead researchers to select instruments based on statistics that may not reflect actual reliability. The purpose of this paper is to compare the reliability estimates produced by these three techniques and determine the preferable technique. A hypothetical dataset was produced to evaluate the reliability estimates obtained with ICC, Pearson correlations, and paired t tests in three different situations. For each situation two sets of 20 observations were created to simulate an intrarater or inter-rater paradigm, based on 20 participants with two observations per participant. Situations were designed to demonstrate good agreement, systematic bias, or substantial random measurement error. In the situation demonstrating good agreement, all three techniques supported the conclusionthat the data were reliable. In the situation demonstrating systematic bias, the ICC and t test suggested the data were not reliable, whereas the Pearson correlation suggested high reliability despite the systematic discrepancy. In the situation representing substantial randommeasurement error where low reliability was expected, the ICC and Pearson coefficient accurately illustrated this. The t test suggested the data were reliable. The ICC is the preferred technique to measure reliability. Although there are some limitations associated with the use of this technique, they can be overcome.</description><identifier>ISSN: 0361-073X</identifier><identifier>EISSN: 1096-4657</identifier><identifier>DOI: 10.1080/036107300750015741</identifier><identifier>PMID: 11091941</identifier><identifier>CODEN: EAGRDS</identifier><language>eng</language><publisher>Philadelphia, PA: Informa UK Ltd</publisher><subject>Aging ; Biological and medical sciences ; Data Interpretation, Statistical ; Databases, Factual ; Fundamental and applied biological sciences. Psychology ; Humans ; Psychology. Psychoanalysis. Psychiatry ; Psychology. Psychophysiology ; Psychometrics ; Psychometrics. Statistics. Methodology ; Reproducibility of Results</subject><ispartof>Experimental aging research, 2000-10, Vol.26 (4), p.353-365</ispartof><rights>Copyright Taylor & Francis Group, LLC 2000</rights><rights>2001 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c430t-693a235c3b976a4a026021b00ad7a44507bad93504f03ad9e74e99a5e5bb8f693</citedby><cites>FETCH-LOGICAL-c430t-693a235c3b976a4a026021b00ad7a44507bad93504f03ad9e74e99a5e5bb8f693</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.tandfonline.com/doi/pdf/10.1080/036107300750015741$$EPDF$$P50$$Ginformaworld$$H</linktopdf><linktohtml>$$Uhttps://www.tandfonline.com/doi/full/10.1080/036107300750015741$$EHTML$$P50$$Ginformaworld$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,59620,60409</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=798129$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/11091941$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>BEDARD, Michel</creatorcontrib><creatorcontrib>MARTIN, Nancy J</creatorcontrib><creatorcontrib>KRUEGER, Paul</creatorcontrib><creatorcontrib>BRAZIL, Kevin</creatorcontrib><title>Assessing Reproducibility of Data Obtained With Instruments Based on Continuous Measurements</title><title>Experimental aging research</title><addtitle>Exp Aging Res</addtitle><description>Data obtained with any research tool must be reproducible, a concept referred to as reliability. Three techniques are often used to evaluate reliability of tools using continuous data in aging research: intraclass correlation coefficients (ICC), Pearson correlations, and paired t tests. These are often construed as equivalent when applied to reliability. This is not correct, and may lead researchers to select instruments based on statistics that may not reflect actual reliability. The purpose of this paper is to compare the reliability estimates produced by these three techniques and determine the preferable technique. A hypothetical dataset was produced to evaluate the reliability estimates obtained with ICC, Pearson correlations, and paired t tests in three different situations. For each situation two sets of 20 observations were created to simulate an intrarater or inter-rater paradigm, based on 20 participants with two observations per participant. Situations were designed to demonstrate good agreement, systematic bias, or substantial random measurement error. In the situation demonstrating good agreement, all three techniques supported the conclusionthat the data were reliable. In the situation demonstrating systematic bias, the ICC and t test suggested the data were not reliable, whereas the Pearson correlation suggested high reliability despite the systematic discrepancy. In the situation representing substantial randommeasurement error where low reliability was expected, the ICC and Pearson coefficient accurately illustrated this. The t test suggested the data were reliable. The ICC is the preferred technique to measure reliability. Although there are some limitations associated with the use of this technique, they can be overcome.</description><subject>Aging</subject><subject>Biological and medical sciences</subject><subject>Data Interpretation, Statistical</subject><subject>Databases, Factual</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Humans</subject><subject>Psychology. Psychoanalysis. Psychiatry</subject><subject>Psychology. Psychophysiology</subject><subject>Psychometrics</subject><subject>Psychometrics. Statistics. Methodology</subject><subject>Reproducibility of Results</subject><issn>0361-073X</issn><issn>1096-4657</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2000</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkE2LFDEQhoMo7rj6BzxIQPDWWukknc7Bwzp-LawsiKIHoanuTmukO1lTaXT-vRln1MOCnioUz1O8eRm7L-CxgBaegGwEGAlgNIDQRokbbCPANpVqtLnJNnugKsTHE3aH6CsAaCnkbXYiCiWsEhv26YzIEfnwmb91VymO6-B7P_u843HizzEjv-wz-uBG_sHnL_w8UE7r4kIm_gyprGPg2xiyD2tcib9xSGtyv4C77NaEM7l7x3nK3r988W77urq4fHW-PbuoBiUhV42VWEs9yN6aBhVC3UAtegAcDSqlwfQ4WqlBTSDLyxnlrEXtdN-3U7FP2aPD3fKBb6uj3C2eBjfPGFzJ1BkwplUWClgfwCFFouSm7ir5BdOuE9DtO-2ud1qkB8fra7-48a9yLLEAD48A0oDzlDAMnv5wxrai3od8eqB8mGJa8HtM89hl3M0x_VbkP2O0__Wva13-keVPuoWj0g</recordid><startdate>20001001</startdate><enddate>20001001</enddate><creator>BEDARD, Michel</creator><creator>MARTIN, Nancy J</creator><creator>KRUEGER, Paul</creator><creator>BRAZIL, Kevin</creator><general>Informa UK Ltd</general><general>Taylor & Francis</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20001001</creationdate><title>Assessing Reproducibility of Data Obtained With Instruments Based on Continuous Measurements</title><author>BEDARD, Michel ; MARTIN, Nancy J ; KRUEGER, Paul ; BRAZIL, Kevin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c430t-693a235c3b976a4a026021b00ad7a44507bad93504f03ad9e74e99a5e5bb8f693</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2000</creationdate><topic>Aging</topic><topic>Biological and medical sciences</topic><topic>Data Interpretation, Statistical</topic><topic>Databases, Factual</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Humans</topic><topic>Psychology. Psychoanalysis. Psychiatry</topic><topic>Psychology. Psychophysiology</topic><topic>Psychometrics</topic><topic>Psychometrics. Statistics. Methodology</topic><topic>Reproducibility of Results</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>BEDARD, Michel</creatorcontrib><creatorcontrib>MARTIN, Nancy J</creatorcontrib><creatorcontrib>KRUEGER, Paul</creatorcontrib><creatorcontrib>BRAZIL, Kevin</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Experimental aging research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>BEDARD, Michel</au><au>MARTIN, Nancy J</au><au>KRUEGER, Paul</au><au>BRAZIL, Kevin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessing Reproducibility of Data Obtained With Instruments Based on Continuous Measurements</atitle><jtitle>Experimental aging research</jtitle><addtitle>Exp Aging Res</addtitle><date>2000-10-01</date><risdate>2000</risdate><volume>26</volume><issue>4</issue><spage>353</spage><epage>365</epage><pages>353-365</pages><issn>0361-073X</issn><eissn>1096-4657</eissn><coden>EAGRDS</coden><abstract>Data obtained with any research tool must be reproducible, a concept referred to as reliability. Three techniques are often used to evaluate reliability of tools using continuous data in aging research: intraclass correlation coefficients (ICC), Pearson correlations, and paired t tests. These are often construed as equivalent when applied to reliability. This is not correct, and may lead researchers to select instruments based on statistics that may not reflect actual reliability. The purpose of this paper is to compare the reliability estimates produced by these three techniques and determine the preferable technique. A hypothetical dataset was produced to evaluate the reliability estimates obtained with ICC, Pearson correlations, and paired t tests in three different situations. For each situation two sets of 20 observations were created to simulate an intrarater or inter-rater paradigm, based on 20 participants with two observations per participant. Situations were designed to demonstrate good agreement, systematic bias, or substantial random measurement error. In the situation demonstrating good agreement, all three techniques supported the conclusionthat the data were reliable. In the situation demonstrating systematic bias, the ICC and t test suggested the data were not reliable, whereas the Pearson correlation suggested high reliability despite the systematic discrepancy. In the situation representing substantial randommeasurement error where low reliability was expected, the ICC and Pearson coefficient accurately illustrated this. The t test suggested the data were reliable. The ICC is the preferred technique to measure reliability. Although there are some limitations associated with the use of this technique, they can be overcome.</abstract><cop>Philadelphia, PA</cop><pub>Informa UK Ltd</pub><pmid>11091941</pmid><doi>10.1080/036107300750015741</doi><tpages>13</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0361-073X
ispartof	Experimental aging research, 2000-10, Vol.26 (4), p.353-365
issn	0361-073X 1096-4657
language	eng
recordid	cdi_pascalfrancis_primary_798129
source	MEDLINE; Taylor & Francis Journals Complete
subjects	Aging Biological and medical sciences Data Interpretation, Statistical Databases, Factual Fundamental and applied biological sciences. Psychology Humans Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Psychometrics Psychometrics. Statistics. Methodology Reproducibility of Results
title	Assessing Reproducibility of Data Obtained With Instruments Based on Continuous Measurements
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T00%3A08%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pasca&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessing%20Reproducibility%20of%20Data%20Obtained%20With%20Instruments%20Based%20on%20Continuous%20Measurements&rft.jtitle=Experimental%20aging%20research&rft.au=BEDARD,%20Michel&rft.date=2000-10-01&rft.volume=26&rft.issue=4&rft.spage=353&rft.epage=365&rft.pages=353-365&rft.issn=0361-073X&rft.eissn=1096-4657&rft.coden=EAGRDS&rft_id=info:doi/10.1080/036107300750015741&rft_dat=%3Cproquest_pasca%3E70778490%3C/proquest_pasca%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=70778490&rft_id=info:pmid/11091941&rfr_iscdi=true