METHOD FOR RETRIEVING TEXT BLOCKS IN DOCUMENTS
The invention relates to a method for retrieving text blocks in documents, preferably for postal mailings that are to be sorted, e.g. mass mailings. Th e aim of the invention is to retrieve or identify reference text blocks in all types of documents with the aid of distinctive characteristic data re...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | eng ; fre |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention relates to a method for retrieving text blocks in documents, preferably for postal mailings that are to be sorted, e.g. mass mailings. Th e aim of the invention is to retrieve or identify reference text blocks in all types of documents with the aid of distinctive characteristic data records o f said reference text blocks. According to said method, structure-related characteristics of the text block are extracted as distinctive characteristi cs and compared with characteristics of a characteristic data record of a reference text block, allowing a simple recognition of similar characteristi cs in several text blocks to take place. A first extraction of structure-relate d characteristics can be carried out by the division of a text block into several lines, whose height or spacing is saved to a characteristic data record of a mailing. Different text blocks can be analysed for their similarities by comparing the characteristic data records. |
---|