An Information-Theoretic Bound on p -Values for Detecting Communities Shared between Weighted Labeled Graphs

Extraction of subsets of highly connected nodes ("communities" or modules) is a standard step in the analysis of complex social and biological networks. We here consider the problem of finding a relatively small set of nodes in two labeled weighted graphs that is highly connected in both....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Entropy (Basel, Switzerland) Switzerland), 2022-09, Vol.24 (10), p.1329
Hauptverfasser: Obradovic, Predrag, Kovačević, Vladimir, Li, Xiqi, Milosavljevic, Aleksandar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Extraction of subsets of highly connected nodes ("communities" or modules) is a standard step in the analysis of complex social and biological networks. We here consider the problem of finding a relatively small set of nodes in two labeled weighted graphs that is highly connected in both. While many scoring functions and algorithms tackle the problem, the typically high computational cost of permutation testing required to establish the -value for the observed pattern presents a major practical obstacle. To address this problem, we here extend the recently proposed CTD ("Connect the Dots") approach to establish information-theoretic upper bounds on the -values and lower bounds on the size and connectedness of communities that are detectable. This is an innovation on the applicability of CTD, broadening its use to pairs of graphs.
ISSN:1099-4300
1099-4300
DOI:10.3390/e24101329