System and Method for Joining Datasets

A computer-implemented method comprising: receiving, with a computer, first and second datasets; performing, with the computer, column discovery on the first and second datasets using a first trained machine-learning model to produce a column map that indexes one or more columns in the first dataset...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sim, Jaehyun, Shah, Amar Himansu, Lanier, Nathaniel Clinton, Ramesh, Vinayak, Shah, Devavrat Dilitkumar, Dharaskar, Arth
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A computer-implemented method comprising: receiving, with a computer, first and second datasets; performing, with the computer, column discovery on the first and second datasets using a first trained machine-learning model to produce a column map that indexes one or more columns in the first dataset to one or more columns in the second dataset; performing, with the computer, row discovery on the first and second datasets using a second trained machine-learning model, a trained approximate nearest neighbor index, and the column discovery to produce a row map that indexes one or more rows in the first dataset to one or more rows in the second dataset; combining, with the computer, the first and second datasets using the column map and the row map to form a combined dataset; and performing one or more actions with the combined dataset.