ANOMALOUS DATA IDENTIFICATION FOR TABULAR DATA

Systems and methods identify anomalous data in tabular data. A set of tabular data records is received. Each tabular data record includes data elements for a numbers of attributes, with each data element providing a value for a corresponding attribute. An anomaly score is generated for each data ele...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SAINI, Shiv Kumar, VADREVU, Keshav, CHOUDHARY, Gautam, TYAGI, Atharv, NARAYANAM, Ramasuri, MUKHERJEE, Koyel, PADALA, Manisha
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Systems and methods identify anomalous data in tabular data. A set of tabular data records is received. Each tabular data record includes data elements for a numbers of attributes, with each data element providing a value for a corresponding attribute. An anomaly score is generated for each data element of each tabular data record. Additionally, an evidence set is defined for each attribute and each tabular data record based on the anomaly scores for the data elements. An anomaly score is generated for each attribute and each tabular data record using the evidence sets. An output is provided that identifies one or more anomalous data subsets determined based on the anomaly scores for the attributes and tabular data records. Each anomalous data subset identifies a subset of attributes and a subset of tabular data records.