Monitoring the performance of human and automated scores for spoken responses

As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Language testing 2018-01, Vol.35 (1), p.101-120
Hauptverfasser:	Wang, Zhen, Zechner, Klaus, Sun, Yu
Format:	Artikel
Sprache:	eng
Schlagworte:	Automation Comparative Analysis Interrater Reliability Language Tests Regression (Statistics) Responses Scores Scoring Speech Tests Spoken language
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there is a need in testing organizations to establish rigorous procedures for monitoring the performance of both human and automated scoring processes during operational administrations. This paper provides an overview of the automated speech scoring system SpeechRaterSM and how to use charts and evaluation statistics to monitor and evaluate automated scores and human rater scores of spoken constructed responses.
ISSN:	0265-5322 1477-0946
DOI:	10.1177/0265532216679451