Automated identification of duplicate information objects
Systems and methods are configured to determine whether a particular information object is a duplicate of an object found in separate information objects. In various embodiments, the particular information object and each separate information object includes a set of data fields for storing data val...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Systems and methods are configured to determine whether a particular information object is a duplicate of an object found in separate information objects. In various embodiments, the particular information object and each separate information object includes a set of data fields for storing data values that allows identical values to be stored in different fields for the objects. The data values for the particular information object are combined to form a data structure that includes a data element for each value. A determination as to whether the particular information object is an exact or partial match of a separate information object is made by performing a function on the data structure for the particular information object and a data structure for the separate information object to identify an intersection that includes data values for the particular information object that have an identical match with values for the separate information object. |
---|