GREWA Scalable Frequent Subgraph Discovery Algorithm

Existing algorithms that mine graph datasets to discover patterns corresponding to frequently occurring subgraphs can operate efficiently on graphs that are sparse, contain a large number of relatively small connected components, have vertices with low and bounded degrees, and contain well-labeled v...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Kuramochi, Michihiro, Karypis, George
Format:	Report
Sprache:	eng
Schlagworte:	ALGORITHMS CLASSIFICATION CLUSTERING EXPERIMENTAL DATA FREQUENT PATTERN DISCOVERY FREQUENT SUBGRAPH GRAPH MINING GRAPHS GREW HEURISTIC METHODS INPUT Numerical Mathematics Operations Research SCALING FACTOR SPECIFIC CONSTRAINTS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Existing algorithms that mine graph datasets to discover patterns corresponding to frequently occurring subgraphs can operate efficiently on graphs that are sparse, contain a large number of relatively small connected components, have vertices with low and bounded degrees, and contain well-labeled vertices and edges. However, there are a number of applications that lead to graphs that do not share these characteristics, for which these algorithms highly become unscalable. In this paper we propose a heuristic algorithm called GREW to overcome the limitations of existing complete or heuristic frequent subgraph discovery algorithms. GREW is designed to operate on a large graph and to find patterns corresponding to connected subgraphs that have a large number of vertex-disjoint embeddings. Our experimental evaluation shows that GREW is efficient, can scale to very large graphs, and find non-trivial patterns that cover large portions of the input graph and the lattice of frequent patterns. Sponsored in part by the National Science Foundation grants ACI-0133464 and ACI-0212828.