Predictive Modeling With Psychological Panel Data
Longitudinal panels include several thousand participants and variables. Traditionally, psychologists analyze only a few variables - partly because common unregularized linear models perform poorly when the number of variables (p) approaches the number of observations (N). Predictive modeling method...
Gespeichert in:
Veröffentlicht in: | Zeitschrift für Psychologie 2018, Vol.226 (4), p.246-258 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Longitudinal panels include several thousand participants and variables. Traditionally, psychologists analyze only a few variables - partly because common unregularized linear models perform poorly when the number of variables (p) approaches the number of observations (N). Predictive modeling methods can be used when N p situations arise in psychological research. We illustrate these techniques on exemplary variables from the German GESIS Panel, while describing the choice of preprocessing, model classes, resampling techniques, hyperparameter tuning, and performance measures. In analyses with about 2,000 subjects and variables each, we predict panelists' gender, sick days, an evaluation of US President Trump, income, life satisfaction, and sleep satisfaction. Elastic net and random forest models were compared to dummy predictions in benchmark experiments. While good performance was achieved, the linear elastic net performed similar to the nonlinear random forest. Elastic nets were refitted to extract the ten most important predictors. Their interpretation validates our approach, and further modeling options are discussed. Code can be found at https://osf.io/zpse3/ |
---|---|
ISSN: | 2190-8370 2151-2604 |
DOI: | 10.1027/2151-2604/a000343 |