CONVERTING TABLE DATA INTO COMPONENT PARTS

A system and method of processing source data that includes table data by converting the table data into machine encoded text data having associated therewith text coordinate data having a Y-axis component and an X-axis component, and then generating from the machine encoded text data a plurality of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: MURALIDHARAN, Vishnu, SRINIVAS, Raghuram, HARSH, Seth, BOSTON, Marisa Ferrara
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A system and method of processing source data that includes table data by converting the table data into machine encoded text data having associated therewith text coordinate data having a Y-axis component and an X-axis component, and then generating from the machine encoded text data a plurality of pixels along the Y-axis component and the X-axis component. The system then performs a clustering technique on the plurality of pixels to generate a plurality of clusters of pixels based on similar attributes, and classifying each of the plurality of clusters of pixels as a selected row of the table and as a selected column of the table, thus making available the information encoded in the table for subsequent processing.