Finding Maximal Cliques in Massive Networks

Maximal clique enumeration is a fundamental problem in graph theory and has important applications in many areas such as social network analysis and bioinformatics. The problem is extensively studied; however, the best existing algorithms require memory space linear in the size of the input graph. T...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	ACM transactions on database systems 2011-12, Vol.36 (4), p.1-34
Hauptverfasser:	CHENG, James, YIPING KE, ADA WAI-CHEE FU, JEFFREY XU YU, LINHONG ZHU
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Applied sciences Biological and medical sciences Combinatorics Combinatorics. Ordered structures Computer science control theory systems Computer systems and distributed systems. User interface Exact sciences and technology Fundamental and applied biological sciences. Psychology General aspects Graph theory Information retrieval. Graph Mathematics Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Sciences and techniques of general use Software Theoretical computing
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Maximal clique enumeration is a fundamental problem in graph theory and has important applications in many areas such as social network analysis and bioinformatics. The problem is extensively studied; however, the best existing algorithms require memory space linear in the size of the input graph. This has become a serious concern in view of the massive volume of today's fast-growing networks. We propose a general framework for designing external-memory algorithms for maximal clique enumeration in large graphs. The general framework enables maximal clique enumeration to be processed recursively in small subgraphs of the input graph, thus allowing in-memory computation of maximal cliques without the costly random disk access. We prove that the set of cliques obtained by the recursive local computation is both correct (i.e., globally maximal) and complete. The subgraph to be processed each time is defined based on a set of base vertices that can be flexibly chosen to achieve different purposes. We discuss the selection of the base vertices to fully utilize the available memory in order to minimize I/O cost in static graphs, and for update maintenance in dynamic graphs. We also apply our framework to design an external-memory algorithm for maximum clique computation in a large graph.
ISSN:	0362-5915 1557-4644
DOI:	10.1145/2043652.2043654