Developing a corpus of plagiarised short answers

Plagiarism is widely acknowledged to be a significant and increasing problem for higher education institutions (McCabe 2005; Judge 2008). A wide range of solutions, including several commercial systems, have been proposed to assist the educator in the task of identifying plagiarised work, or even to...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Language Resources and Evaluation 2011-03, Vol.45 (1), p.5-24
Hauptverfasser:	Clough, Paul, Stevenson, Mark
Format:	Artikel
Sprache:	eng
Schlagworte:	Authoring Authorship Computational Linguistics Computer Science Construction Education Higher education Higher education institutions Identification Information retrieval Language and Literature Legal proceedings Linguistics Natural resources Object oriented programming Paraphrase Plagiarism Question answer sequences Search engines Simulation Social Sciences Students Tasks Text analysis Wikipedia
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Plagiarism is widely acknowledged to be a significant and increasing problem for higher education institutions (McCabe 2005; Judge 2008). A wide range of solutions, including several commercial systems, have been proposed to assist the educator in the task of identifying plagiarised work, or even to detect them automatically. Direct comparison of these systems is made difficult by the problems in obtaining genuine examples of plagiarised student work. We describe our initial experiences with constructing a corpus consisting of answers to short questions in which plagiarism has been simulated. This corpus is designed to represent types of plagiarism that are not included in existing corpora and will be a useful addition to the set of resources available for the evaluation of plagiarism detection systems.
ISSN:	1574-020X 1572-8412 1574-0218
DOI:	10.1007/s10579-009-9112-1