Systems And Methods For Identifying Potential Duplicate Entries In A Database

In accordance with the teachings described herein, systems and methods are provided for identifying potential duplicate entries in a database. Matchcodes are generated for a plurality of records, wherein a matchcode for a record may be generated by: receiving a character string from the record; dete...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: RINEER BRIAN CARL
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In accordance with the teachings described herein, systems and methods are provided for identifying potential duplicate entries in a database. Matchcodes are generated for a plurality of records, wherein a matchcode for a record may be generated by: receiving a character string from the record; determining whether the character string includes a non-essential character substring; if the non-essential character substring is missing from the character string, then generating the matchcode from the character string and adding a wildcard character to the matchcode in place of the missing non-essential character substring. The matchcodes for the plurality of records may be compared to identify matching pairs of matchcodes, wherein for the purpose of identifying a matching pair of matchcodes, two characters are considered the same if they are equal or if one or both are wildcard characters.