The Swell Language Learner Corpus: From Design to Annotation

The article presents a new language learner corpus for Swedish, SweLL, and the methodology from collection and pesudonymisation to protect personal information of learners to annotation adapted to second language learning. The main aim is to deliver a well-annotated corpus of essays written by secon...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Northern European Journal of Language Technology 2019, Vol.6, p.67
Hauptverfasser: Volodina, Elena, Granstedt, Lena, Matsson, Arild, Megyesi, Beáta, Pilán, Ildikó, Prentice, Julia, Rosén, Dan, Rudebeck, Lisa, Schenström, Carl-Johan, Sundberg, Gunlög, Wirén, Mats
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The article presents a new language learner corpus for Swedish, SweLL, and the methodology from collection and pesudonymisation to protect personal information of learners to annotation adapted to second language learning. The main aim is to deliver a well-annotated corpus of essays written by second language learners of Swedish and make it available for research through a browsable environment. To that end, a new annotation tool and a new project management tool have been implemented, – both with the main purpose to ensure reliability and quality of the final corpus. In the article we discuss reasoning behind metadata selection, principles of gold corpus compilation and argue for separation of normalization from correction annotation.
ISSN:2000-1533
DOI:10.3384/nejlt.2000-1533.19667