RaftGP: Random Fast Graph Partitioning
Graph partitioning (GP), a.k.a. community detection, is a classic problem that divides the node set of a graph into densely-connected blocks. Following prior work on the IEEE HPEC Graph Challenge benchmark and recent advances in graph machine learning, we propose a novel RAndom FasT Graph Partitioni...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Graph partitioning (GP), a.k.a. community detection, is a classic problem
that divides the node set of a graph into densely-connected blocks. Following
prior work on the IEEE HPEC Graph Challenge benchmark and recent advances in
graph machine learning, we propose a novel RAndom FasT Graph Partitioning
(RaftGP) method based on an efficient graph embedding scheme. It uses the
Gaussian random projection to extract community-preserving features from
classic GP objectives. These features are fed into a graph neural network (GNN)
to derive low-dimensional node embeddings. Surprisingly, our experiments
demonstrate that a randomly initialized GNN even without training is enough for
RaftGP to derive informative community-preserving embeddings and support
high-quality GP. To enable the derived embeddings to tackle GP, we introduce a
hierarchical model selection algorithm that simultaneously determines the
number of blocks and the corresponding GP result. We evaluate RaftGP on the
Graph Challenge benchmark and compare the performance with five baselines,
where our method can achieve a better trade-off between quality and efficiency.
In particular, compared to the baseline algorithm of the IEEE HPEC Graph
Challenge, our method is 6.68x -- 23.9x faster on graphs with 1E3 -- 5E4 nodes
and at least 64.5x faster on larger (1E5 node) graphs on which the baseline
takes more than 1E4 seconds. Our method achieves better accuracy on all test
cases. We also develop a new graph generator to address some limitations of the
original generator in the benchmark. |
---|---|
DOI: | 10.48550/arxiv.2312.01560 |