The Effectiveness of Machine Score-Ability Ratings in Predicting Automated Scoring Performance

This study sought to provide a framework for evaluating machine score-ability of items using a new score-ability rating scale, and to determine the extent to which ratings were predictive of observed automated scoring performance. The study listed and described a set of factors that are thought to i...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Applied measurement in education 2018-07, Vol.31 (3), p.215-232
Hauptverfasser:	Lottridge, Susan, Wood, Scott, Shaw, Dan
Format:	Artikel
Sprache:	eng
Schlagworte:	Automation Computer Assisted Testing Correlation Educational tests & measurements Evaluation Criteria Mathematics Tests Outcome Measures Prediction Program Effectiveness Rating Scales Ratings & rankings Reading Tests Science Tests Scores Scoring Secondary School Students Summative Evaluation Test Scoring Machines
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This study sought to provide a framework for evaluating machine score-ability of items using a new score-ability rating scale, and to determine the extent to which ratings were predictive of observed automated scoring performance. The study listed and described a set of factors that are thought to influence machine score-ability; these factors informed the score-ability rating applied by expert raters. Five Reading items, six Science items, and 10 Math items were examined. Experts in automated scoring served as reviewers, providing independent ratings of score-ability before engine calibration. Following the rating, engines were calibrated and their performances were evaluated using common industry criteria. Three derived criteria from the engine evaluations were computed: the score-ability value in the rating scale based on the empirical results, the number of industry evaluation criteria met by the engine, the approval status of the engine based on the number of criteria met. The results indicated that the score-ability ratings were moderately correlated with Science score-ability, the ratings were weakly correlated with Math score-ability, and were not correlated with Reading score-ability.
ISSN:	0895-7347 1532-4818
DOI:	10.1080/08957347.2018.1464452