ADHD classification with cross-dataset feature selection for biomarker consistency detection

Attention deficit hyperactivity disorder (ADHD) is a prevalent neurodevelopmental disorder in children. While numerous intelligent methods are applied for its subjective diagnosis, they seldom consider the consistency problem of ADHD biomarkers. In practice, these data-driven approaches lead to vary...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of neural engineering 2024-06, Vol.21 (3), p.36012
Hauptverfasser: Meng, Xiaojing, Chen, Ying, Gao, Yuan, Geng, Deqin, Tang, Yibin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Attention deficit hyperactivity disorder (ADHD) is a prevalent neurodevelopmental disorder in children. While numerous intelligent methods are applied for its subjective diagnosis, they seldom consider the consistency problem of ADHD biomarkers. In practice, these data-driven approaches lead to varying learned features for ADHD classification across diverse ADHD datasets. This phenomenon significantly undermines the reliability of identified biomarkers and hampers the interpretability of these methods. In this study, we propose a cross-dataset feature selection (FS) module using a grouped SVM-based recursive feature elimination approach (G-SVM-RFE) to enhance biomarker consistency across multiple datasets. Additionally, we employ connectome gradient data for ADHD classification. In details, we introduce the G-SVM-RFE method to effectively concentrate gradient components within a few brain regions, thereby increasing the likelihood of identifying these regions as ADHD biomarkers. The cross-dataset FS module is integrated into an existing binary hypothesis testing (BHT) framework. This module utilizes external datasets to identify global regions that yield stable biomarkers. Meanwhile, given a dataset which waits for implementing the classification task as local dataset, we learn its own specific regions to further improve the performance of accuracy on this dataset. By employing this module, our experiments achieve an average accuracy of 96.7% on diverse datasets. Importantly, the discriminative gradient components primarily originate from the global regions, providing evidence for the significance of these regions. We further identify regions with the high appearance frequencies as biomarkers, where all the used global regions and one local region are recognized. These biomarkers align with existing research on impaired brain regions in children with ADHD. Thus, our method demonstrates its validity by providing enhanced biological explanations derived from ADHD mechanisms.
ISSN:1741-2560
1741-2552
DOI:10.1088/1741-2552/ad48bd