Word frequencies and bigrams in bahasa Melayu

This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular tex...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of Modern Languages=Jurnal Bahasa Moden 2017-07, Vol.15 (1)
Hauptverfasser: Zuraidah Mohd. Don, Gerry Knowles
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper reports on some preliminary findings on word frequency in the MALEX database. The most frequent words are described, attention being paid to the position of content words in the frequency list. Nouns emerge as the most problematic to a particular set of genres, or even to a particular text. Bigrams are studied both as sequences of individual words and as sequences of grammatical tags. Whereas the tag sequences reflect syntactic rules and thus the hierarchical struture of syntax, sequence of individual words reflect quite a different kind of linear structure which has begun to emerge in recent years in corpus linguistcs.
ISSN:1675-526X
2462-1986