The Chinese and English Learner Language Corpus (CELL Corpus)
The Chinese and English Learner Language Corpus (referred to as ‘the CELL Corpus’ hereafter) is designed as a learner language corpus. The CELL Corpus, as a learner language corpus, is thus designed as a collection of text data chiefly composed of Chinese and English academic essay-type assignments...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Dataset |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The Chinese and English Learner Language Corpus (referred to as ‘the CELL Corpus’ hereafter) is designed as a learner language corpus. The CELL Corpus, as a learner language corpus, is thus designed as a collection of text data chiefly composed of Chinese and English academic essay-type assignments written by university undergraduate students. The students submitted their academic essays as assignments for the assessment purpose of the courses they enrolled for, which suggests the authenticity of the text data collected. In addition to the text data, the CELL Corpus is also designed to include the meta data of the students whose academic essays were collected. The meta data collected represent five types of demographic information of the students, which are namely: age, gender, place of birth, first language and public examination results for Chinese Language and English Language. The two datasets (i.e. text data and meta data) of the CELL Corpus are delineated in the following sub-sections.
The datasets uploaded on this webpage are solely utilized for the establishment of the CELL Corpus (https://cellcorpusouhk.com/). Approval from the authors must be sought if anyone would like to download the datasets for research purposes. |
---|---|
DOI: | 10.17632/gs4ppd7sz3.2 |