A METHOD FOR AUTOMATICALLY INDEXING DOCUMENTS
A method for retrieving based on a search term together with its corresponding meaning from a set of base documents those documents which contain the search term and in which the search term has the certain meaning to enable the building of an index on the retrieved documents. The method searches fo...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Patent |
Sprache: | eng ; fre |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A method for retrieving based on a search term together with its corresponding meaning from a set of base documents those documents which contain the search term and in which the search term has the certain meaning to enable the building of an index on the retrieved documents. The method searches for those base documents among the set of base documents which contain the certain search term and evaluates the found base documents as to whether the search term contained in the found base documents, respectively, has a certain meaning. The evaluation comprises generating a text document to represent elements surrounding the search term and their corresponding absolute or relative position with respect to the search term. The elements of the text document codes the absolute or relative position of the surrounding elements by corresponding text strings. The text document is inputted into a trainable classifying apparatus which has been trained to recognize whether an inputted text document belongs to a certain classification category or not. The training has been performed based on a training sample of text documents which have been generated for documents in which the term surrounded by the surrounding elements has the meaning inputted by the user. The inputted text document is classified to judge whether the search team has the inputted meaning.
L'invention concerne un procédé permettant de récupérer des documents sur la base d'un terme de recherche et de son sens correspondant, dans un ensemble de documents de base contenant ledit terme de recherche et dans lequel le terme de recherche à un sens donnée, pour créer un index à partir desdits documents récupérés. Ce procédé consiste à chercher, dans ledit ensemble de documents, les documents de base qui contiennent ledit terme de recherche et à analyser les documents de base trouvés pour déterminer si le terme de recherche contenu dans lesdits documents de base trouvés, respectivement, a un sens donné. Cette analyse consiste à générer un document-texte pour représenter les éléments entourant le terme de recherche et leur position absolue ou relative correspondante par rapport audit terme de recherche, les éléments dudit document-texte codant lesdites positions absolues ou relatives desdits éléments entourant le terme de recherche à l'aide de chaînes de textes correspondantes ; à entrer ledit document-texte dans un dispositif de classement formé pour déterminer si un document-texte entré appartient ou non à une c |
---|