Bayesian analysis for partially observed network data, missing ties, attributes and actors

•We propose a Bayesian data augmentation scheme for analyzing ERGM with partially observed data.•We demonstrate estimation and predictive distributions for three quarters of ties unobserved.•We demonstrate analysis when a particularly influential attribute is only partially observed. We consider par...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Social networks 2013-10, Vol.35 (4), p.514-527
Hauptverfasser: Koskinen, Johan H., Robins, Garry L., Wang, Peng, Pattison, Philippa E.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•We propose a Bayesian data augmentation scheme for analyzing ERGM with partially observed data.•We demonstrate estimation and predictive distributions for three quarters of ties unobserved.•We demonstrate analysis when a particularly influential attribute is only partially observed. We consider partially observed network data as defined in Handcock and Gile (2010). More specifically we introduce an elaboration of the Bayesian data augmentation scheme of Koskinen et al. (2010) that uses the exchange algorithm (Caimo and Friel, 2011) for inference for the exponential random graph model (ERGM) where tie variables are partly observed. We illustrate the generating of posteriors and unobserved tie-variables with empirical network data where 74% of the tie variables are unobserved under the assumption that some standard assumptions hold true. One of these assumptions is that covariates are fixed and completely observed. A likely scenario is that also covariates might only be partially observed and we propose a further extension of the data augmentation algorithm for missing attributes. We provide an illustrative example of parameter inference with nearly 30% of dyads affected by missing attributes (e.g. homophily effects). The assumption that all actors are known is another assumption that is liable to be violated so that there are “covert actors”. We briefly discuss various aspects of this problem with reference to the Sageman (2004) data set on suspected terrorists. We conclude by identifying some areas in need of further research.
ISSN:0378-8733
1879-2111
DOI:10.1016/j.socnet.2013.07.003