Development and validation of a machine learning-based diagnostic model for Parkinson's disease in community-dwelling populations: Evidence from the China health and retirement longitudinal study (CHARLS)
Parkinson's disease (PD) is a major neurodegenerative disorder in Middle-aged and elderly people.There is a pressing need for effective predictive models, particularly in chinese population. Objective:This study aims to develop and validate a machine learning-based diagnostic model to identify...
Gespeichert in:
Veröffentlicht in: | Parkinsonism & related disorders 2025-01, Vol.130, p.107182, Article 107182 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Parkinson's disease (PD) is a major neurodegenerative disorder in Middle-aged and elderly people.There is a pressing need for effective predictive models, particularly in chinese population. Objective:This study aims to develop and validate a machine learning-based diagnostic model to identify individuals with PD in community-dwelling populations using data from the China Health and Retirement Longitudinal Study (CHARLS).
We utilized data from 19,134 individuals aged 45 and above from the CHARLS dataset, with 265 adults reported to have PD. The external validation cohort included 1500 individuals, with 21 (1.4 %) having PD.The random forest (RF) algorithm was used to develop an interpretable PD prediction model, which was internally validated using 10-fold cross-validation and externally validated with a dataset from Northern Jiangsu People's Hospital. SHapley Additive exPlanation (SHAP) values were employed to elucidate the model's predictions.
The RF model demonstrated robust performance with an Area Under the Curve (AUC) of 0.884 and high sensitivity, specificity, and F1 scores. The model's performance in external validation cohort, highlighting an AUC of 0.82 and an accuracy of 0.99. The model's performance remained consistent across internal and external validation cohorts. SHAP analysis provided insights into the importance and interaction of various predictors, enhancing model interpretability.
The study presents a highly accurate and interpretable machine learning-based diagnostic model to identify individuals with PD in middle-aged and older Chinese adults. By combined with predictive risk factors and chronic disease information, the model offers valuable insights for early identification and intervention, potentially mitigating PD progression.
•Developed a machine learning model for PD in China using random forest.•Used SHAP values to explain the importance of predictors in the model, enhancing interpretability.•Showcased the value of lifestyle and disease data in PD diagnostic models for early detection.•Created a user-friendly app based on the model for clinical use and acceptance. |
---|---|
ISSN: | 1353-8020 1873-5126 1873-5126 |
DOI: | 10.1016/j.parkreldis.2024.107182 |