Impact of Stemming on Telugu Text Classification

In Text categorization, Information retrieval and document clustering stemming is absolutely necessary especially for morphological rich languages like Indian. The process of stemming is, reducing the inflected or resultant terms to their stem word, root or origin form. However, stemming is a tricky...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of recent technology and engineering 2019-07, Vol.8 (2), p.2767-2769
Hauptverfasser: Swapna, Dr. Narla, Subhashini, Dr. Peneti, Rani, Dr. B Padmaja
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In Text categorization, Information retrieval and document clustering stemming is absolutely necessary especially for morphological rich languages like Indian. The process of stemming is, reducing the inflected or resultant terms to their stem word, root or origin form. However, stemming is a tricky task - particularly for extremely inflected natural languages having a lot of words for the same normalized word form. In Text classification, stemming tries to cut off details like either suffix or prefix of a word and produce basic word. In this paper, we apply various stemming methods on Telugu text classification and ensure the performance of the classifier is effect by stemming. Telugu is suffix oriented language, so we have performed number of experiments on erratically selected Telugu text documents and finally we conceive that the performance of the classifier is improved.
ISSN:2277-3878
2277-3878
DOI:10.35940/ijrte.B2338.078219