How Your Location Relates to Health: Variable Importance and Interpretable Machine Learning for Environmental and Sociodemographic Data
Health outcomes depend on complex environmental and sociodemographic factors whose effects change over location and time. Only recently has fine-grained spatial and temporal data become available to study these effects, namely the MEDSAT dataset of English health, environmental, and sociodemographic...
Gespeichert in:
Hauptverfasser: | , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Health outcomes depend on complex environmental and sociodemographic factors
whose effects change over location and time. Only recently has fine-grained
spatial and temporal data become available to study these effects, namely the
MEDSAT dataset of English health, environmental, and sociodemographic
information. Leveraging this new resource, we use a variety of variable
importance techniques to robustly identify the most informative predictors
across multiple health outcomes. We then develop an interpretable machine
learning framework based on Generalized Additive Models (GAMs) and Multiscale
Geographically Weighted Regression (MGWR) to analyze both local and global
spatial dependencies of each variable on various health outcomes. Our findings
identify NO2 as a global predictor for asthma, hypertension, and anxiety,
alongside other outcome-specific predictors related to occupation, marriage,
and vegetation. Regional analyses reveal local variations with air pollution
and solar radiation, with notable shifts during COVID. This comprehensive
approach provides actionable insights for addressing health disparities, and
advocates for the integration of interpretable machine learning in public
health. |
---|---|
DOI: | 10.48550/arxiv.2501.02111 |