Using ontological distance to measure unexpectedness of correlation

A method, system and computer program product for evaluating the interestingness of correlated data. The fields of a dataset are classified by tagging the fields in terms of real world concepts. A correlation analysis on the dataset is performed to generate a correlation coefficient for each pair of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Pourshahid, Alireza, Wadhwa, Vinay N, Watts, Graham A, Wei, Qing
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method, system and computer program product for evaluating the interestingness of correlated data. The fields of a dataset are classified by tagging the fields in terms of real world concepts. A correlation analysis on the dataset is performed to generate a correlation coefficient for each pair of fields of correlated data items. An "ontological distance" between the tagged concepts for each pair of fields of correlated data items represented as nodes in the ontology is determined. A score is generated indicating an interestingness of correlation for each pair of fields of correlated data items based on the correlation coefficient and the ontological distance between the tagged concepts for each pair of fields of correlated data items. By utilizing the ontological distance with the correlation analysis to determine the interestingness of correlation, correlations that may not be obvious to users and unexpectedly correlated may be identified.