Bayesian approaches to the weighted kappa-like inter-rater agreement measures

Inter-rater agreement measures are used to estimate the degree of agreement between two or more assessors. When the agreement table is ordinal, different weight functions that incorporate row and column scores are used along with the agreement measures. The selection of row and column scores is effe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Statistical methods in medical research 2021-10, Vol.30 (10), p.2329-2351
Hauptverfasser: Tran, Quoc Duyet, Demirhan, Haydar, Dolgun, Anil
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2351
container_issue 10
container_start_page 2329
container_title Statistical methods in medical research
container_volume 30
creator Tran, Quoc Duyet
Demirhan, Haydar
Dolgun, Anil
description Inter-rater agreement measures are used to estimate the degree of agreement between two or more assessors. When the agreement table is ordinal, different weight functions that incorporate row and column scores are used along with the agreement measures. The selection of row and column scores is effectual on the estimated degree of agreement. The weighted measures are prone to the anomalies frequently seen in agreement tables such as unbalanced table structures or grey zones due to the assessment behaviour of the raters. In this study, Bayesian approaches for the estimation of inter-rater agreement measures are proposed. The Bayesian approaches make it possible to include prior information on the assessment behaviour of the raters in the analysis and impose order restrictions on the row and column scores. In this way, we improve the accuracy of the agreement measures and mitigate the impact of the anomalies in the estimation of the strength of agreement between the raters. The elicitation of prior distributions is described theoretically and practically for the Bayesian estimation of five agreement measures with three different weights using an agreement table having two grey zones. A Monte Carlo simulation study is conducted to assess the classification accuracy of the Bayesian and classical approaches for the considered agreement measures for a given level of agreement. Recommendations for the selection of the highest performing agreement measure and weight combination are made in the breakdown of the table structure and sample size.
doi_str_mv 10.1177/09622802211037068
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2566039792</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_09622802211037068</sage_id><sourcerecordid>2566039792</sourcerecordid><originalsourceid>FETCH-LOGICAL-c368t-52b4c6c42a9c7be94a7a8a6f8ddeb5a2c62167f555d6f4f283819ae0ed4cb2473</originalsourceid><addsrcrecordid>eNp1kE1Lw0AQhhdRbK3-AC8S8OIldb-yuzlq8QsqXvQcJptJm7ZJ6m6C9N-7pVVBkYGZw_vMO8NLyDmjY8a0vqap4txQzhmjQlNlDsiQSa1jKoQ8JMOtHm-BATnxfkEp1VSmx2QgpJRGCTEkz7ewQV9BE8F67Vqwc_RR10bdHKMPrGbzDotoGTSIV9USo6rp0MUOQo9g5hBrbLqoRvC9Q39KjkpYeTzbzxF5u797nTzG05eHp8nNNLZCmS5OeC6tspJDanWOqQQNBlRpigLzBLhVnCldJklSqFKW3AjDUkCKhbQ5l1qMyNXON7z83qPvsrryFlcraLDtfcYTpahIdcoDevkLXbS9a8J3gTKMMyFDjQjbUda13jsss7WranCbjNFsm3X2J-uwc7F37vMai--Nr3ADMN4BHmb4c_Z_x0_qhIX5</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2581213434</pqid></control><display><type>article</type><title>Bayesian approaches to the weighted kappa-like inter-rater agreement measures</title><source>MEDLINE</source><source>Applied Social Sciences Index &amp; Abstracts (ASSIA)</source><source>SAGE Complete A-Z List</source><creator>Tran, Quoc Duyet ; Demirhan, Haydar ; Dolgun, Anil</creator><creatorcontrib>Tran, Quoc Duyet ; Demirhan, Haydar ; Dolgun, Anil</creatorcontrib><description>Inter-rater agreement measures are used to estimate the degree of agreement between two or more assessors. When the agreement table is ordinal, different weight functions that incorporate row and column scores are used along with the agreement measures. The selection of row and column scores is effectual on the estimated degree of agreement. The weighted measures are prone to the anomalies frequently seen in agreement tables such as unbalanced table structures or grey zones due to the assessment behaviour of the raters. In this study, Bayesian approaches for the estimation of inter-rater agreement measures are proposed. The Bayesian approaches make it possible to include prior information on the assessment behaviour of the raters in the analysis and impose order restrictions on the row and column scores. In this way, we improve the accuracy of the agreement measures and mitigate the impact of the anomalies in the estimation of the strength of agreement between the raters. The elicitation of prior distributions is described theoretically and practically for the Bayesian estimation of five agreement measures with three different weights using an agreement table having two grey zones. A Monte Carlo simulation study is conducted to assess the classification accuracy of the Bayesian and classical approaches for the considered agreement measures for a given level of agreement. Recommendations for the selection of the highest performing agreement measure and weight combination are made in the breakdown of the table structure and sample size.</description><identifier>ISSN: 0962-2802</identifier><identifier>EISSN: 1477-0334</identifier><identifier>DOI: 10.1177/09622802211037068</identifier><identifier>PMID: 34448633</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Agreements ; Anomalies ; Assessors ; Bayes Theorem ; Bayesian analysis ; Classification ; Computer Simulation ; Elicitation ; Humans ; Monte Carlo Method ; Monte Carlo simulation ; Observer Variation ; Reproducibility of Results ; Simulation ; Weighting functions</subject><ispartof>Statistical methods in medical research, 2021-10, Vol.30 (10), p.2329-2351</ispartof><rights>The Author(s) 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c368t-52b4c6c42a9c7be94a7a8a6f8ddeb5a2c62167f555d6f4f283819ae0ed4cb2473</citedby><cites>FETCH-LOGICAL-c368t-52b4c6c42a9c7be94a7a8a6f8ddeb5a2c62167f555d6f4f283819ae0ed4cb2473</cites><orcidid>0000-0002-8565-4710</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/09622802211037068$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/09622802211037068$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,776,780,21799,27903,27904,30978,43600,43601</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34448633$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Tran, Quoc Duyet</creatorcontrib><creatorcontrib>Demirhan, Haydar</creatorcontrib><creatorcontrib>Dolgun, Anil</creatorcontrib><title>Bayesian approaches to the weighted kappa-like inter-rater agreement measures</title><title>Statistical methods in medical research</title><addtitle>Stat Methods Med Res</addtitle><description>Inter-rater agreement measures are used to estimate the degree of agreement between two or more assessors. When the agreement table is ordinal, different weight functions that incorporate row and column scores are used along with the agreement measures. The selection of row and column scores is effectual on the estimated degree of agreement. The weighted measures are prone to the anomalies frequently seen in agreement tables such as unbalanced table structures or grey zones due to the assessment behaviour of the raters. In this study, Bayesian approaches for the estimation of inter-rater agreement measures are proposed. The Bayesian approaches make it possible to include prior information on the assessment behaviour of the raters in the analysis and impose order restrictions on the row and column scores. In this way, we improve the accuracy of the agreement measures and mitigate the impact of the anomalies in the estimation of the strength of agreement between the raters. The elicitation of prior distributions is described theoretically and practically for the Bayesian estimation of five agreement measures with three different weights using an agreement table having two grey zones. A Monte Carlo simulation study is conducted to assess the classification accuracy of the Bayesian and classical approaches for the considered agreement measures for a given level of agreement. Recommendations for the selection of the highest performing agreement measure and weight combination are made in the breakdown of the table structure and sample size.</description><subject>Agreements</subject><subject>Anomalies</subject><subject>Assessors</subject><subject>Bayes Theorem</subject><subject>Bayesian analysis</subject><subject>Classification</subject><subject>Computer Simulation</subject><subject>Elicitation</subject><subject>Humans</subject><subject>Monte Carlo Method</subject><subject>Monte Carlo simulation</subject><subject>Observer Variation</subject><subject>Reproducibility of Results</subject><subject>Simulation</subject><subject>Weighting functions</subject><issn>0962-2802</issn><issn>1477-0334</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>7QJ</sourceid><recordid>eNp1kE1Lw0AQhhdRbK3-AC8S8OIldb-yuzlq8QsqXvQcJptJm7ZJ6m6C9N-7pVVBkYGZw_vMO8NLyDmjY8a0vqap4txQzhmjQlNlDsiQSa1jKoQ8JMOtHm-BATnxfkEp1VSmx2QgpJRGCTEkz7ewQV9BE8F67Vqwc_RR10bdHKMPrGbzDotoGTSIV9USo6rp0MUOQo9g5hBrbLqoRvC9Q39KjkpYeTzbzxF5u797nTzG05eHp8nNNLZCmS5OeC6tspJDanWOqQQNBlRpigLzBLhVnCldJklSqFKW3AjDUkCKhbQ5l1qMyNXON7z83qPvsrryFlcraLDtfcYTpahIdcoDevkLXbS9a8J3gTKMMyFDjQjbUda13jsss7WranCbjNFsm3X2J-uwc7F37vMai--Nr3ADMN4BHmb4c_Z_x0_qhIX5</recordid><startdate>202110</startdate><enddate>202110</enddate><creator>Tran, Quoc Duyet</creator><creator>Demirhan, Haydar</creator><creator>Dolgun, Anil</creator><general>SAGE Publications</general><general>Sage Publications Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QJ</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>K9.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-8565-4710</orcidid></search><sort><creationdate>202110</creationdate><title>Bayesian approaches to the weighted kappa-like inter-rater agreement measures</title><author>Tran, Quoc Duyet ; Demirhan, Haydar ; Dolgun, Anil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c368t-52b4c6c42a9c7be94a7a8a6f8ddeb5a2c62167f555d6f4f283819ae0ed4cb2473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Agreements</topic><topic>Anomalies</topic><topic>Assessors</topic><topic>Bayes Theorem</topic><topic>Bayesian analysis</topic><topic>Classification</topic><topic>Computer Simulation</topic><topic>Elicitation</topic><topic>Humans</topic><topic>Monte Carlo Method</topic><topic>Monte Carlo simulation</topic><topic>Observer Variation</topic><topic>Reproducibility of Results</topic><topic>Simulation</topic><topic>Weighting functions</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tran, Quoc Duyet</creatorcontrib><creatorcontrib>Demirhan, Haydar</creatorcontrib><creatorcontrib>Dolgun, Anil</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Applied Social Sciences Index &amp; Abstracts (ASSIA)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Statistical methods in medical research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tran, Quoc Duyet</au><au>Demirhan, Haydar</au><au>Dolgun, Anil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bayesian approaches to the weighted kappa-like inter-rater agreement measures</atitle><jtitle>Statistical methods in medical research</jtitle><addtitle>Stat Methods Med Res</addtitle><date>2021-10</date><risdate>2021</risdate><volume>30</volume><issue>10</issue><spage>2329</spage><epage>2351</epage><pages>2329-2351</pages><issn>0962-2802</issn><eissn>1477-0334</eissn><abstract>Inter-rater agreement measures are used to estimate the degree of agreement between two or more assessors. When the agreement table is ordinal, different weight functions that incorporate row and column scores are used along with the agreement measures. The selection of row and column scores is effectual on the estimated degree of agreement. The weighted measures are prone to the anomalies frequently seen in agreement tables such as unbalanced table structures or grey zones due to the assessment behaviour of the raters. In this study, Bayesian approaches for the estimation of inter-rater agreement measures are proposed. The Bayesian approaches make it possible to include prior information on the assessment behaviour of the raters in the analysis and impose order restrictions on the row and column scores. In this way, we improve the accuracy of the agreement measures and mitigate the impact of the anomalies in the estimation of the strength of agreement between the raters. The elicitation of prior distributions is described theoretically and practically for the Bayesian estimation of five agreement measures with three different weights using an agreement table having two grey zones. A Monte Carlo simulation study is conducted to assess the classification accuracy of the Bayesian and classical approaches for the considered agreement measures for a given level of agreement. Recommendations for the selection of the highest performing agreement measure and weight combination are made in the breakdown of the table structure and sample size.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><pmid>34448633</pmid><doi>10.1177/09622802211037068</doi><tpages>23</tpages><orcidid>https://orcid.org/0000-0002-8565-4710</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0962-2802
ispartof Statistical methods in medical research, 2021-10, Vol.30 (10), p.2329-2351
issn 0962-2802
1477-0334
language eng
recordid cdi_proquest_miscellaneous_2566039792
source MEDLINE; Applied Social Sciences Index & Abstracts (ASSIA); SAGE Complete A-Z List
subjects Agreements
Anomalies
Assessors
Bayes Theorem
Bayesian analysis
Classification
Computer Simulation
Elicitation
Humans
Monte Carlo Method
Monte Carlo simulation
Observer Variation
Reproducibility of Results
Simulation
Weighting functions
title Bayesian approaches to the weighted kappa-like inter-rater agreement measures
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T16%3A17%3A11IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bayesian%20approaches%20to%20the%20weighted%20kappa-like%20inter-rater%20agreement%20measures&rft.jtitle=Statistical%20methods%20in%20medical%20research&rft.au=Tran,%20Quoc%20Duyet&rft.date=2021-10&rft.volume=30&rft.issue=10&rft.spage=2329&rft.epage=2351&rft.pages=2329-2351&rft.issn=0962-2802&rft.eissn=1477-0334&rft_id=info:doi/10.1177/09622802211037068&rft_dat=%3Cproquest_cross%3E2566039792%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2581213434&rft_id=info:pmid/34448633&rft_sage_id=10.1177_09622802211037068&rfr_iscdi=true