A Corpus Investigation on the Journal of Social Sciences of the Turkic World
In recent years, a rapid development in computer technologies has been witnessed and feasibility of data access has been increased. In today's world, restoring documents, or data in general, and transferring them to interested parties are ordinary tasks. The amount of restored documents has als...
Gespeichert in:
Veröffentlicht in: | Universal journal of educational research (Print) 2018-06, Vol.6 (6), p.1199-1206 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In recent years, a rapid development in computer technologies has been witnessed and feasibility of data access has been increased. In today's world, restoring documents, or data in general, and transferring them to interested parties are ordinary tasks. The amount of restored documents has also increased expeditiously and this development has required new technologies to emerge for building knowledge from large data sets. Basic applications of text mining include gathering and processing text to extract information that embodies raw data. Thus, basic text mining applications can help researchers to reach valuable knowledge from a mass of documents. This study investigated academic articles published in "bilig" ("Journal of Social Sciences of the Turkic World") between 1996 and 2017 to find the frequencies of words and letters used in academic Turkish. Basic text mining of 4,850,817 words in 19437 pages from 81 "bilig" issues was completed using a natural language processing library, Zemberek and a programming language, R. |
---|---|
ISSN: | 2332-3205 2332-3213 |
DOI: | 10.13189/ujer.2018.060610 |