Efficient clustering of databases induced by local patterns

Many large organizations have multiple large databases as they transact from multiple branches. Most of the previous pieces of work are based on a single database. Thus, it is necessary to study data mining on multiple databases. In this paper, we propose two measures of similarity between a pair of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Decision Support Systems 2008-03, Vol.44 (4), p.925-943
Hauptverfasser:	Adhikari, Animesh, Rao, P.R.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Applied sciences Cluster analysis Clustering Computer science control theory systems Data mining Data processing. List processing. Character string processing Decision theory. Utility theory Efficiency Exact sciences and technology IS coding Local pattern analysis Memory organisation. Data processing Multi-database mining Operational research and scientific management Operational research. Management science Similarity between a pair of databases Software Studies
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Many large organizations have multiple large databases as they transact from multiple branches. Most of the previous pieces of work are based on a single database. Thus, it is necessary to study data mining on multiple databases. In this paper, we propose two measures of similarity between a pair of databases. Also, we propose an algorithm for clustering a set of databases. Efficiency of the clustering process has been improved using the following strategies: reducing execution time of clustering algorithm, using more appropriate similarity measure, and storing frequent itemsets space efficiently.
ISSN:	0167-9236 1873-5797
DOI:	10.1016/j.dss.2007.11.001