System and Method for Joining Datasets
A computer-implemented method includes receiving a first dataset including a first table having a first number of parts, where each part of the first number of parts represents either a row or a column of the first table and receiving a second dataset including a second table having a second number...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A computer-implemented method includes receiving a first dataset including a first table having a first number of parts, where each part of the first number of parts represents either a row or a column of the first table and receiving a second dataset including a second table having a second number of parts, where each part of the second number of parts represents either a row or a column of the second table. For each part of the first number of parts the method includes forming a string representation of the part from a number of values associated with the part and determining a numerical representation of the string representation of the part. For each part of the second number of parts, the method includes forming a string representation of the part from a number of values associated with the part and determining a numerical representation of the string representation of the part. The method includes determining a mapping between at least some parts of the first table and second table including determining that a first set of one or more parts of the first number of parts correspond a second set of one or more parts of the second number of parts based at least in part on a similarity between numerical representations of the parts. |
---|