Machine learning to tune probabilistic matching in entity resolution systems

Techniques for data evaluation are provided. A plurality of data records is received, and a first comparison vector is generated by comparing a first and a second data record of the plurality of data records, where the first comparison vector indicates differences between the first and second data r...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: O'Neill, Jr., James Albert, Khatibi, Mohammad, Seth, Abhishek, Naganna, Soma Shekar, Schumacher, Scott
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Techniques for data evaluation are provided. A plurality of data records is received, and a first comparison vector is generated by comparing a first and a second data record of the plurality of data records, where the first comparison vector indicates differences between the first and second data records. A machine learning model is trained based at least in part on the first comparison vector. The plurality of data records is evaluated using the machine learning model, and at least two of the plurality of data records are linked based on the evaluation.