Processing of Documents and Queries in a Slovene Language Free Text Retrieval System
This paper considers language processing techniques necessary for the implementation of a document retrieval system for Slovenian text data bases After a brief introduction to the main characteristics of the Slovene language, the main body of the paper discusses the development of a stopword list an...
Gespeichert in:
Veröffentlicht in: | Literary and linguistic computing 1990, Vol.5 (2), p.182-190 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper considers language processing techniques necessary for the implementation of a document retrieval system for Slovenian text data bases After a brief introduction to the main characteristics of the Slovene language, the main body of the paper discusses the development of a stopword list and of a stemming a algorithm that are to be used for the Processing of natural language documents and queries Two stemming algorithms are described, one context free and the other context sensitive, the latter is found to be far more effective in operation, owing to the large number of context-sensitive and recording rules that are required to reflect fully the morphology of Slovene |
---|---|
ISSN: | 0268-1145 1477-4615 |
DOI: | 10.1093/llc/5.2.182 |