Optimizing subgraph retrieval and matching with an efficient indexing scheme

Graph index as an effective data structure is widely applied in subgraph retrieval and matching. It records and compares the frequencies of a set of specific features to detect subgraph containment on the fly, which is the foundation of the filtering techniques for subgraph retrieval and matching. H...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Knowledge and information systems 2024-11, Vol.66 (11), p.6815-6843
Hauptverfasser:	He, Jiezhong, Chen, Yixin, Liu, Zhouyang, Li, Dongsheng
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science Data Mining and Knowledge Discovery Data structures Database Management Graph matching Graph theory Graphs Indexing Information Storage and Retrieval Information Systems and Communication Service Information Systems Applications (incl.Internet) IT in Business Regular Paper Retrieval Semantics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Graph index as an effective data structure is widely applied in subgraph retrieval and matching. It records and compares the frequencies of a set of specific features to detect subgraph containment on the fly, which is the foundation of the filtering techniques for subgraph retrieval and matching. However, due to the NP-hardness of the subgraph counting, current graph indices struggle to be built on large graphs. Even counting the simple path and cycle graphs is NP-hard. We observe that the monotone property of the counting process is crucial for the correctness and precision of the index. Therefore, we introduce an efficient graph indexing scheme by counting the path and cycle features monotonically in relaxed semantics. In addition to the filtering techniques, we propose to reorder the search candidates via our index. Experimental results reveal that our index can be constructed significantly faster than existing methods, by 1–3 orders of magnitude, and can handle graphs that are larger than previous work by 1–3 orders of magnitude. Our index-boosted filtering and ordering techniques are proven to be effective in optimizing the subgraph retrieval and matching process.
ISSN:	0219-1377 0219-3116
DOI:	10.1007/s10115-024-02175-7