LUS: Mizo Monolingual Corpus

Mizo or Lushai language is the official language of Mizoram, a state in the north-eastern part of India. It is an under-resourced language that falls under the Tibeto-Burman language family and is highly tonal in nature. LUS dataset comprises monolingual corpus crawled from different Mizo news websi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Lalrempuii, Candy Lalrempuii, Soni, Badal Soni
Format: Dataset
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Mizo or Lushai language is the official language of Mizoram, a state in the north-eastern part of India. It is an under-resourced language that falls under the Tibeto-Burman language family and is highly tonal in nature. LUS dataset comprises monolingual corpus crawled from different Mizo news websites such as Zalen (https://zalen.in/) and Times of Mizoram (https://www.timesofmizoram.com/). The dataset consists of a total of 101827 Mizo language sentences for research and academic purposes.
DOI:10.21227/4kx5-wc43