System and Method for Joining Datasets

A computer-implemented method includes receiving a first dataset including a first table having a first number of parts, where each part of the first number of parts represents either a row or a column of the first table and receiving a second dataset including a second table having a second number...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sim, Jaehyun, Shah, Amar Himansu, Lanier, Nathaniel Clinton, Ramesh, Vinayak, Shah, Devavrat Dilitkumar, Dharaskar, Arth
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A computer-implemented method includes receiving a first dataset including a first table having a first number of parts, where each part of the first number of parts represents either a row or a column of the first table and receiving a second dataset including a second table having a second number of parts, where each part of the second number of parts represents either a row or a column of the second table. For each part of the first number of parts the method includes forming a string representation of the part from a number of values associated with the part and determining a numerical representation of the string representation of the part. For each part of the second number of parts, the method includes forming a string representation of the part from a number of values associated with the part and determining a numerical representation of the string representation of the part. The method includes determining a mapping between at least some parts of the first table and second table including determining that a first set of one or more parts of the first number of parts correspond a second set of one or more parts of the second number of parts based at least in part on a similarity between numerical representations of the parts.