TEXT DATA STRUCTURING METHOD AND APPARATUS USING LINE INFORMATION

A text data structuring apparatus according to the present invention includes: a data extraction unit which extracts text included in an image and position information of the text on the basis of OCR; a data processing unit which extracts line information included in the image by using the text, the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: KWON, Ki Beom, MOON, Da Hea, KWON, You Kyung, KIM, Dong Hwan, LIM, Yeo Sol, KO, So Young
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A text data structuring apparatus according to the present invention includes: a data extraction unit which extracts text included in an image and position information of the text on the basis of OCR; a data processing unit which extracts line information included in the image by using the text, the position information, and the image; a labeling unit which labels the text as keys or values; and a relationship identification unit which acquires a mapping candidate group including first text, second text, and third text labeled on the basis of the line information, calculates a first similarity score representing meaning similarity between the first text and the third text and a second similarity score representing meaning similarity between the second text and the third text, and decides text to be mapped with the third text among of the first text and the second text.