Machine Learning-Based Predictive Modeling of Anxiety and Depressive Symptoms During 8 Months of the COVID-19 Global Pandemic: Repeated Cross-sectional Survey Study
The COVID-19 global pandemic has increased the burden of mental illness on Canadian adults. However, the complex combination of demographic, economic, and lifestyle factors and perceived health risks contributing to patterns of anxiety and depression has not been explored. The aim of this study is t...
Gespeichert in:
Veröffentlicht in: | JMIR mental health 2021-11, Vol.8 (11), p.e32876 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The COVID-19 global pandemic has increased the burden of mental illness on Canadian adults. However, the complex combination of demographic, economic, and lifestyle factors and perceived health risks contributing to patterns of anxiety and depression has not been explored.
The aim of this study is to harness flexible machine learning methods to identify constellations of factors related to symptoms of mental illness and to understand their changes over time during the COVID-19 pandemic.
Cross-sectional samples of Canadian adults (aged ≥18 years) completed web-based surveys in 6 waves from May to December 2020 (N=6021), and quota sampling strategies were used to match the English-speaking Canadian population in age, gender, and region. The surveys measured anxiety and depression symptoms, sociodemographic characteristics, substance use, and perceived COVID-19 risks and worries. First, principal component analysis was used to condense highly comorbid anxiety and depression symptoms into a single data-driven measure of emotional distress. Second, eXtreme Gradient Boosting (XGBoost), a machine learning algorithm that can model nonlinear and interactive relationships, was used to regress this measure on all included explanatory variables. Variable importance and effects across time were explored using SHapley Additive exPlanations (SHAP).
Principal component analysis of responses to 9 anxiety and depression questions on an ordinal scale revealed a primary latent factor, termed "emotional distress," that explained 76% of the variation in all 9 measures. Our XGBoost model explained a substantial proportion of variance in emotional distress (r
=0.39). The 3 most important items predicting elevated emotional distress were increased worries about finances (SHAP=0.17), worries about getting COVID-19 (SHAP=0.17), and younger age (SHAP=0.13). Hopefulness was associated with emotional distress and moderated the impacts of several other factors. Predicted emotional distress exhibited a nonlinear pattern over time, with the highest predicted symptoms in May and November and the lowest in June.
Our results highlight factors that may exacerbate emotional distress during the current pandemic and possible future pandemics, including a role of hopefulness in moderating distressing effects of other factors. The pandemic disproportionately affected emotional distress among younger adults and those economically impacted. |
---|---|
ISSN: | 2368-7959 2368-7959 |
DOI: | 10.2196/32876 |