To stop or not to stop - Experiments on stopword elimination for information retrieval of Gujarati text documents

Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Joshi, H., Pareek, J., Patel, R., Chauhan, K.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Words that frequently occur in a document but carry less significant meaning are called stopwords. Identification and removal of stopwords can result in effective indexing of documents. Mean average precision (MAP) is the metric used to measure the efficiency of information retrieval (IR) tasks. In this paper, we have experimented with elimination of Gujarati stopwords to measure the improvements in Adhoc monolingual information retrieval of Gujarati text documents. Results show that elimination of stopwords improve the MAP values of Gujarati IR.
ISSN:2375-1282
DOI:10.1109/NUICONE.2012.6493219