Classifying Alzheimer’s disease and normal subjects using machine learning techniques and genetic-environmental features

Alzheimer's disease (AD) is complicated by multiple environmental and polygenetic factors. The accuracy of artificial neural networks (ANNs) incorporating the common factors for identifying AD has not been evaluated. A total of 184 probable AD patients and 3773 healthy individuals aged 65 and o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of the Formosan Medical Association 2024-06, Vol.123 (6), p.701-709
Hauptverfasser: Huang, Yu-Hua, Chen, Yi-Chun, Ho, Wei-Min, Lee, Ren-Guey, Chung, Ren-Hua, Liu, Yu-Li, Chang, Pi-Yueh, Chang, Shih-Cheng, Wang, Chaung-Wei, Chung, Wen-Hung, Tsai, Shih-Jen, Kuo, Po-Hsiu, Lee, Yun-Shien, Hsiao, Chun-Chieh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Alzheimer's disease (AD) is complicated by multiple environmental and polygenetic factors. The accuracy of artificial neural networks (ANNs) incorporating the common factors for identifying AD has not been evaluated. A total of 184 probable AD patients and 3773 healthy individuals aged 65 and over were enrolled. AD-related genes (51 SNPs) and 8 environmental factors were selected as features for multilayer ANN modeling. Random Forest (RF) and Support Vector Machine with RBF kernel (SVM) were also employed for comparison. Model results were verified using traditional statistics. The ANN achieved high accuracy (0.98), sensitivity (0.95), and specificity (0.96) in the intrinsic test for AD classification. Excluding age and genetic data still yielded favorable results (accuracy: 0.97, sensitivity: 0.94, specificity: 0.96). The assigned weights to ANN features highlighted the importance of mental evaluation, years of education, and specific genetic variations (CASS4 rs7274581, PICALM rs3851179, and TOMM40 rs2075650) for AD classification. Receiver operating characteristic analysis revealed AUC values of 0.99 (intrinsic test), 0.60 (TWB-GWA), and 0.72 (CG-WGS), with slightly lower AUC values (0.96, 0.80, 0.52) when excluding age in ANN. The performance of the ANN model in AD classification was comparable to RF, SVM (linear kernel), and SVM (RBF kernel). The ANN model demonstrated good sensitivity, specificity, and accuracy in AD classification. The top-weighted SNPs for AD prediction were CASS4 rs7274581, PICALM rs3851179, and TOMM40 rs2075650. The ANN model performed similarly to RF and SVM, indicating its capability to handle the complexity of AD as a disease entity.
ISSN:0929-6646
1876-0821
DOI:10.1016/j.jfma.2023.10.021