METHOD FOR RETRIEVING TEXT BLOCKS IN DOCUMENTS

The invention relates to a method for retrieving text blocks in documents, preferably for postal mailings that are to be sorted, e.g. mass mailings. Th e aim of the invention is to retrieve or identify reference text blocks in all types of documents with the aid of distinctive characteristic data re...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: WORM, KATJA
Format: Patent
Sprache:eng ; fre
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to a method for retrieving text blocks in documents, preferably for postal mailings that are to be sorted, e.g. mass mailings. Th e aim of the invention is to retrieve or identify reference text blocks in all types of documents with the aid of distinctive characteristic data records o f said reference text blocks. According to said method, structure-related characteristics of the text block are extracted as distinctive characteristi cs and compared with characteristics of a characteristic data record of a reference text block, allowing a simple recognition of similar characteristi cs in several text blocks to take place. A first extraction of structure-relate d characteristics can be carried out by the division of a text block into several lines, whose height or spacing is saved to a characteristic data record of a mailing. Different text blocks can be analysed for their similarities by comparing the characteristic data records.