Comparing the methods of measuring multi-rater agreement on an ordinal rating scale: a simulation study with an application to real data

Agreement among raters is an important issue in medicine, as well as in education and psychology. The agreement among two raters on a nominal or ordinal rating scale has been investigated in many articles. The multi-rater case with normally distributed ratings has also been explored at length. Howev...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of applied statistics 2013-07, Vol.40 (7), p.1506-1519
Hauptverfasser:	Sertdemir, Y., Burgut, H. R., Alparslan, Z. N., Unal, I., Gunasti, S.
Format:	Artikel
Sprache:	eng
Schlagworte:	agreement Agreements Applied statistics Bias bounded ordinal scale Comparative analysis Data simulation Dermatology Education multi-rater normal distribution Psychology Ratings Ratings & rankings Simulation skewed distribution Studies
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Agreement among raters is an important issue in medicine, as well as in education and psychology. The agreement among two raters on a nominal or ordinal rating scale has been investigated in many articles. The multi-rater case with normally distributed ratings has also been explored at length. However, there is a lack of research on multiple raters using an ordinal rating scale. In this simulation study, several methods were compared with analyze rater agreement. The special case that was focused on was the multi-rater case using a bounded ordinal rating scale. The proposed methods for agreement were compared within different settings. Three main ordinal data simulation settings were used (normal, skewed and shifted data). In addition, the proposed methods were applied to a real data set from dermatology. The simulation results showed that the Kendall's W and mean gamma highly overestimated the agreement in data sets with shifts in data. ICC 4 for bounded data should be avoided in agreement studies with rating scales
ISSN:	0266-4763 1360-0532
DOI:	10.1080/02664763.2013.788617