Method and apparatus for shaping data using signature recognition

Methods are provided for semantic processing of data files including detecting formats of data embedded in the data files and converting the data to formats compatible with a data analysis tool. The method may comprise determining if the data file comprises signature characteristics associated with...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Chan, Wing-Leung, Grosset, Robin Neil, Yanofsky, Corey Matthew, Smith, Michael Thomas
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Methods are provided for semantic processing of data files including detecting formats of data embedded in the data files and converting the data to formats compatible with a data analysis tool. The method may comprise determining if the data file comprises signature characteristics associated with a known data format and, if so, determining a set of data manipulation operations associated with the known data format to convert the data file to a compatible format for the data analysis tool. The method may further comprise semantically analyzing components of the data files to assess formatting across a required set of criterions needed by the data analysis tool and determining sets of data manipulation operations to perform to convert the data file to a compatible format.