RTGEN++: A Relative Temporal Graph GENerator
Graph management systems have become popular for storing and querying graph-oriented data, and they are often evaluated with benchmarks based on large-scale graphs. However, obtaining such graphs is difficult due to their limited public availability. Several graph generators have been developed to a...
Gespeichert in:
Veröffentlicht in: | Future generation computer systems 2023-09, Vol.146, p.139-155 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Graph management systems have become popular for storing and querying graph-oriented data, and they are often evaluated with benchmarks based on large-scale graphs. However, obtaining such graphs is difficult due to their limited public availability. Several graph generators have been developed to address this challenge, producing synthetic graphs with characteristics similar to real-world graphs, such as degree distribution, community structure, and diameter. Generating synthetic graphs with constantly changing topology has received less attention despite its importance in developing useful benchmarks for temporal graph systems. In this paper, we present RTGEN++, a temporal graph generator that supports two evolution models. The first model generates temporal graphs by controlling the evolution of their degree distributions, using optimal transport methods to minimize the transformation effort. We also extend our method in order to consider the community structure of the generated graphs. The second model allows one to control the number of added and removed graph entities, thus enabling the modeling of the evolution of real-world graphs in many use cases. Our generator also includes a decorator that adds types and time-varying attributes to nodes and edges, enhancing the generated graphs and aligning with data platforms that use the property graph model. We validate our approach with experiments that demonstrate the reliability of the generated graphs in approximating ground-truth parameters.
•Synthetic datasets are useful for benchmarking data platforms.•The characteristics of synthetically generated graphs should mimic those of real-world graphs.•Managing temporal graphs is of great interest for analyzing the evolution of underlying systems.•RTGEN++ produces synthetic temporal graphs based on two evolution models that can cover a wide range of applications.•RTGEN++ is coupled with types and time-varying attributes that can add a thorough description of the graph entities. |
---|---|
ISSN: | 0167-739X 1872-7115 |
DOI: | 10.1016/j.future.2023.03.023 |