APPARATUS AND METHOD FOR SEPARATION OF OPTICAL CHARACTER RECOGNITION DATA
An apparatus and method is described for the separation of data from adjacent characters of standard type fonts, some of which character pairs may kern or touch. Characters which do not kern or touch are separated by white column detection. Characters which do kern are first detected by a kerning te...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | eng ; fre ; ger |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | An apparatus and method is described for the separation of data from adjacent characters of standard type fonts, some of which character pairs may kern or touch. Characters which do not kern or touch are separated by white column detection. Characters which do kern are first detected by a kerning test, which consists of locating white bits which separate the characters while meeting pre-established standards of contiguity. Touching characters are detected by failure to pass the white column test, followed by failure to pass the kerning test. Characters which touch are separated by a statistical analysis, which involves determination of which of several probable vertical data columns has the least number of character bits. Following separation, the characters are compared with pre-established character patterns. |
---|