Data matching for column-oriented data tables
A computer-implemented method includes receiving a column-oriented table comprising data for a column family, wherein the data for the column family comprises column names and corresponding column values, receiving a set of anonymous column names for the column family, receiving a set of synonymous...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A computer-implemented method includes receiving a column-oriented table comprising data for a column family, wherein the data for the column family comprises column names and corresponding column values, receiving a set of anonymous column names for the column family, receiving a set of synonymous column names for the column family, determining a weighting for each column name that is not an anonymous column name based on the count or frequency of occurrence of the column name and the synonymous column names within the column-oriented table, and processing the column-oriented table with a probabilistic matching engine using the weighting for each column name. A corresponding computer program product and computer system are also disclosed herein. |
---|