Method and system for labeling text segment

A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHAO, SHIH-LUNG, LIU, YIN-LI, LIN, TZUUAN, LIN, YIN, SHEN, SHENG-SYUN, HUANG, SHIHNG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method and system for labeling text segment. The method includes the following steps. First, a document to be recognized is provided, and the document to be recognized includes multiple text images. Then, at least one text segment is recognized and the text image in the text segment is converted into editable text. Thereafter, at least one first correlation information between the text segment and the document to be recognized is evaluated, and the editable text and the first correlation information are converted into a first feature matrix. Furthermore, a plurality of second correlation information of each text segment and other text segments is evaluated, and the first feature matrix is converted into a second feature matrix by the second correlation information. Then, the second feature matrix is converted into a third feature matrix which represents the confidence level. The third feature matrix is converted into a one-dimensional matrix, and each element of the one-dimensional matrix represents a label