SELF-HEALING DATA CLUSTERS

Disclosed are various embodiments for self-healing data clusters. One or more candidates are determined from the candidate pool to be evaluated with the new record. A unique pair combination is generated for each one of the candidates of the candidate pool and the new record. Next, candidate data fo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Dhawan, Mahasweta, D'Souza, Alan Wilson, Handa, Anmol, Modadugu, Rajendra Prasad, Jangra, Shipali, Kamil, Hen I, Sharma, Bhupesh, Dutta, Sayantan, Alvarez Silverstein, Karina I, Prajapati, Sumit
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Disclosed are various embodiments for self-healing data clusters. One or more candidates are determined from the candidate pool to be evaluated with the new record. A unique pair combination is generated for each one of the candidates of the candidate pool and the new record. Next, candidate data for the one or more candidates is identified from the existing record based at least in part on one or more matching rules. A weight is assigned to one or more matching rules. Then, the candidate data of the one or more candidates and the new record is evaluated for a data linkage. A distance is calculated between each of the unique pair combinations. Finally, the candidates of the existing record and the new record are clustered into groups.