Automated identification of duplicate information objects

Systems and methods are configured to determine whether a particular information object is a duplicate of an object found in separate information objects. In various embodiments, the particular information object and each separate information object includes a set of data fields for storing data val...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Browne, Gillian S, Gonzalez, Sergio Moreno, Keating, Diarmuid, Hines, Ryan A
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Systems and methods are configured to determine whether a particular information object is a duplicate of an object found in separate information objects. In various embodiments, the particular information object and each separate information object includes a set of data fields for storing data values that allows identical values to be stored in different fields for the objects. The data values for the particular information object are combined to form a data structure that includes a data element for each value. A determination as to whether the particular information object is an exact or partial match of a separate information object is made by performing a function on the data structure for the particular information object and a data structure for the separate information object to identify an intersection that includes data values for the particular information object that have an identical match with values for the separate information object.