Using machine learning to assess covariate balance in matching studies

In order to assess the effectiveness of matching approaches in observational studies, investigators typically present summary statistics for each observed pre‐intervention covariate, with the objective of showing that matching reduces the difference in means (or proportions) between groups to as clo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of evaluation in clinical practice 2016-12, Vol.22 (6), p.848-854
Hauptverfasser: Linden, Ariel, Yarnold, Paul R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In order to assess the effectiveness of matching approaches in observational studies, investigators typically present summary statistics for each observed pre‐intervention covariate, with the objective of showing that matching reduces the difference in means (or proportions) between groups to as close to zero as possible. In this paper, we introduce a new approach to distinguish between study groups based on their distributions of the covariates using a machine‐learning algorithm called optimal discriminant analysis (ODA). Assessing covariate balance using ODA as compared with the conventional method has several key advantages: the ability to ascertain how individuals self‐select based on optimal (maximum‐accuracy) cut‐points on the covariates; the application to any variable metric and number of groups; its insensitivity to skewed data or outliers; and the use of accuracy measures that can be widely applied to all analyses. Moreover, ODA accepts analytic weights, thereby extending the assessment of covariate balance to any study design where weights are used for covariate adjustment. By comparing the two approaches using empirical data, we are able to demonstrate that using measures of classification accuracy as balance diagnostics produces highly consistent results to those obtained via the conventional approach (in our matched‐pairs example, ODA revealed a weak statistically significant relationship not detected by the conventional approach). Thus, investigators should consider ODA as a robust complement, or perhaps alternative, to the conventional approach for assessing covariate balance in matching studies.
ISSN:1356-1294
1365-2753
DOI:10.1111/jep.12538