Processing of Documents and Queries in a Slovene Language Free Text Retrieval System

This paper considers language processing techniques necessary for the implementation of a document retrieval system for Slovenian text data bases After a brief introduction to the main characteristics of the Slovene language, the main body of the paper discusses the development of a stopword list an...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Literary and linguistic computing 1990, Vol.5 (2), p.182-190
Hauptverfasser:	POPOVIČ, MIRKO, WILLETT, PETER
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper considers language processing techniques necessary for the implementation of a document retrieval system for Slovenian text data bases After a brief introduction to the main characteristics of the Slovene language, the main body of the paper discusses the development of a stopword list and of a stemming a algorithm that are to be used for the Processing of natural language documents and queries Two stemming algorithms are described, one context free and the other context sensitive, the latter is found to be far more effective in operation, owing to the large number of context-sensitive and recording rules that are required to reflect fully the morphology of Slovene
ISSN:	0268-1145 1477-4615
DOI:	10.1093/llc/5.2.182